Dynamic Encoding of Acoustic Features in Neural Responses to Continuous Speech

被引:71
作者
Khalighinejad, Bahar [1 ]
da Silva, Guilherme Cruzatto [1 ]
Mesgarani, Nima [1 ]
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
基金
美国国家卫生研究院;
关键词
EEG; event-related potential; phonemes; speech; EVENT-RELATED POTENTIALS; SUPERIOR TEMPORAL GYRUS; AUDITORY-EVOKED-POTENTIALS; BRAIN POTENTIALS; BROCAS-APHASIA; PERCEPTION; COMPONENT; LANGUAGE; CORTEX; REPRESENTATIONS;
D O I
10.1523/JNEUROSCI.2383-16.2017
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Humans are unique in their ability to communicate using spoken language. However, it remains unclear how the speech signal is transformed and represented in the brain at different stages of the auditory pathway. In this study, we characterized electroencephalography responses to continuous speech by obtaining the time-locked responses to phoneme instances (phoneme-related potential). We showed that responses to different phoneme categories are organized by phonetic features. We found that each instance of a phoneme in continuous speech produces multiple distinguishable neural responses occurring as early as 50 ms and as late as 400 ms after the phoneme onset. Comparing the patterns of phoneme similarity in the neural responses and the acoustic signals confirms a repetitive appearance of acoustic distinctions of phonemes in the neural data. Analysis of the phonetic and speaker information in neural activations revealed that different time intervals jointly encode the acoustic similarity of both phonetic and speaker categories. These findings provide evidence for a dynamic neural transformation of low-level speech features as they propagate along the auditory pathway, and form an empirical framework to study the representational changes in learning, attention, and speech disorders.
引用
收藏
页码:2176 / 2185
页数:10
相关论文
共 76 条
[1]   Human cortical responses to the speech envelope [J].
Aiken, Steven J. ;
Picton, Terence W. .
EAR AND HEARING, 2008, 29 (02) :139-157
[2]   How Do Humans Process and Recognize Speech? [J].
Allen, Jont B. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :567-577
[3]  
[Anonymous], 2009, Encyclopedia of Distances, DOI DOI 10.1007/978-3-642-00234-21
[4]  
[Anonymous], 2010, COURSE PHONETICS
[5]  
[Anonymous], 2010, Modern multidimensional scaling: theory and applications
[6]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[7]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[8]   Maturation of cortical sound processing as indexed by event-related potentials [J].
Ceponiene, R ;
Rinne, T ;
Näätänen, R .
CLINICAL NEUROPHYSIOLOGY, 2002, 113 (06) :870-882
[9]   Speech-Specific Tuning of Neurons in Human Superior Temporal Gyrus [J].
Chan, Alexander M. ;
Dykstra, Andrew R. ;
Jayaram, Vinay ;
Leonard, Matthew K. ;
Travis, Katherine E. ;
Gygi, Brian ;
Baker, Janet M. ;
Eskandar, Emad ;
Hochberg, Leigh R. ;
Halgren, Eric ;
Cash, Sydney S. .
CEREBRAL CORTEX, 2014, 24 (10) :2679-2693
[10]   Multiresolution spectrotemporal analysis of complex sounds [J].
Chi, T ;
Ru, PW ;
Shamma, SA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (02) :887-906