Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment

The amplitude envelope of speech is crucial for accurate comprehension. Considered a key stage in speech processing, the phase of neural activity in the theta-delta bands (1-10 Hz) tracks the phase of the speech amplitude envelope during listening. However, the mechanisms underlying this envelope re...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of neuroscience 2023-05, Vol.43 (21), p.3909-3921
Hauptverfasser: Oganian, Yulia, Kojima, Katsuaki, Breska, Assaf, Cai, Chang, Findlay, Anne, Chang, Edward, Nagarajan, Srikantan S
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3921
container_issue 21
container_start_page 3909
container_title The Journal of neuroscience
container_volume 43
creator Oganian, Yulia
Kojima, Katsuaki
Breska, Assaf
Cai, Chang
Findlay, Anne
Chang, Edward
Nagarajan, Srikantan S
description The amplitude envelope of speech is crucial for accurate comprehension. Considered a key stage in speech processing, the phase of neural activity in the theta-delta bands (1-10 Hz) tracks the phase of the speech amplitude envelope during listening. However, the mechanisms underlying this envelope representation have been heavily debated. A dominant model posits that envelope tracking reflects entrainment of endogenous low-frequency oscillations to the speech envelope. Alternatively, envelope tracking reflects a series of evoked responses to acoustic landmarks within the envelope. It has proven challenging to distinguish these two mechanisms. To address this, we recorded MEG while participants ( = 12, 6 female) listened to natural speech, and compared the neural phase patterns to the predictions of two computational models: an oscillatory entrainment model and a model of evoked responses to peaks in the rate of envelope change. Critically, we also presented speech at slowed rates, where the spectro-temporal predictions of the two models diverge. Our analyses revealed transient theta phase-locking in regular speech, as predicted by both models. However, for slow speech, we found transient theta and delta phase-locking, a pattern that was fully compatible with the evoked response model but could not be explained by the oscillatory entrainment model. Furthermore, encoding of acoustic edge magnitudes was invariant to contextual speech rate, demonstrating speech rate normalization of acoustic edge representations. Together, our results suggest that neural phase-locking to the speech envelope is more likely to reflect discrete representation of transient information rather than oscillatory entrainment. This study probes a highly debated topic in speech perception: the neural mechanisms underlying the cortical representation of the temporal envelope of speech. It is well established that the slow intensity profile of the speech signal, its envelope, elicits a robust brain response that "tracks" these envelope fluctuations. The oscillatory entrainment model posits that envelope tracking reflects phase alignment of endogenous neural oscillations. Here the authors provide evidence for a distinct mechanism. They show that neural speech envelope tracking arises from transient evoked neural responses to rapid increases in the speech envelope. Explicit computational modeling provides direct and compelling evidence that evoked responses are the primary mechanism underlyin
doi_str_mv 10.1523/JNEUROSCI.1663-22.2023
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10218004</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2814523873</sourcerecordid><originalsourceid>FETCH-LOGICAL-c443t-56c0b625efda60076c9d18afb4a13fa64c7dff7c4ae631589cd0833f691d56ae3</originalsourceid><addsrcrecordid>eNpdUl1v0zAUtRCIlcFfmCzxwgMp_oqTPKGqymCoatHGni3XuWk9kjiLnaL-mP3XOWxUwJNl3XOOzrn3IHRByZymjH_6ti5vrzc3y6s5lZInjM0ZYfwFmsVpkTBB6Es0IywjiRSZOENvvL8jhGSEZq_RGc9oHlXyGXr4vtce8KKxu66FLmBX45X7lVwOcD9CZ454DeOgG7wwwR5sOOLgcNhHRts3NowV4LI7QON6mKg3PYDZ42uoGzDB4_LgfkIV_753nQc_sRfGjT5Yg8tqB_4jXruAN97YptHBDceoFwZtf7t5i17VuvHw7vk9R7eX5Y_l12S1-XK1XKwSIwQPSSoN2UqWQl1pGUNKU1Q01_VWaMprLYXJqrrOjNAgOU3zwlQk57yWBa1SqYGfo89Puv24baEyMFloVD_YVg9H5bRV_046u1c7d1CUMJoTIqLCh2eFwcXF-aBa6w3ETB3EuIrlVEwbz3iEvv8PeufGoYv5JlQhcs7yNKLkE8oMzvsB6pMbStRUAXWqgJoqoBhTUwUi8eLvLCfan5vzRxKCsT8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2819483285</pqid></control><display><type>article</type><title>Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><creator>Oganian, Yulia ; Kojima, Katsuaki ; Breska, Assaf ; Cai, Chang ; Findlay, Anne ; Chang, Edward ; Nagarajan, Srikantan S</creator><creatorcontrib>Oganian, Yulia ; Kojima, Katsuaki ; Breska, Assaf ; Cai, Chang ; Findlay, Anne ; Chang, Edward ; Nagarajan, Srikantan S</creatorcontrib><description>The amplitude envelope of speech is crucial for accurate comprehension. Considered a key stage in speech processing, the phase of neural activity in the theta-delta bands (1-10 Hz) tracks the phase of the speech amplitude envelope during listening. However, the mechanisms underlying this envelope representation have been heavily debated. A dominant model posits that envelope tracking reflects entrainment of endogenous low-frequency oscillations to the speech envelope. Alternatively, envelope tracking reflects a series of evoked responses to acoustic landmarks within the envelope. It has proven challenging to distinguish these two mechanisms. To address this, we recorded MEG while participants ( = 12, 6 female) listened to natural speech, and compared the neural phase patterns to the predictions of two computational models: an oscillatory entrainment model and a model of evoked responses to peaks in the rate of envelope change. Critically, we also presented speech at slowed rates, where the spectro-temporal predictions of the two models diverge. Our analyses revealed transient theta phase-locking in regular speech, as predicted by both models. However, for slow speech, we found transient theta and delta phase-locking, a pattern that was fully compatible with the evoked response model but could not be explained by the oscillatory entrainment model. Furthermore, encoding of acoustic edge magnitudes was invariant to contextual speech rate, demonstrating speech rate normalization of acoustic edge representations. Together, our results suggest that neural phase-locking to the speech envelope is more likely to reflect discrete representation of transient information rather than oscillatory entrainment. This study probes a highly debated topic in speech perception: the neural mechanisms underlying the cortical representation of the temporal envelope of speech. It is well established that the slow intensity profile of the speech signal, its envelope, elicits a robust brain response that "tracks" these envelope fluctuations. The oscillatory entrainment model posits that envelope tracking reflects phase alignment of endogenous neural oscillations. Here the authors provide evidence for a distinct mechanism. They show that neural speech envelope tracking arises from transient evoked neural responses to rapid increases in the speech envelope. Explicit computational modeling provides direct and compelling evidence that evoked responses are the primary mechanism underlying cortical speech envelope representations, with no evidence for oscillatory entrainment.</description><identifier>ISSN: 0270-6474</identifier><identifier>EISSN: 1529-2401</identifier><identifier>DOI: 10.1523/JNEUROSCI.1663-22.2023</identifier><identifier>PMID: 37185238</identifier><language>eng</language><publisher>United States: Society for Neuroscience</publisher><subject>Acoustic Stimulation - methods ; Acoustics ; Alignment ; Amplitudes ; Auditory Cortex - physiology ; Auditory Perception ; Computational neuroscience ; Entrainment ; Female ; Frequency ; Frequency dependence ; Humans ; Information processing ; Locking ; Oscillations ; Representations ; Speech ; Speech - physiology ; Speech perception ; Speech Perception - physiology ; Speech processing ; Temporal variations ; Tracking</subject><ispartof>The Journal of neuroscience, 2023-05, Vol.43 (21), p.3909-3921</ispartof><rights>Copyright © 2023 Oganian et al.</rights><rights>Copyright Society for Neuroscience May 24, 2023</rights><rights>Copyright © 2023 Oganian et al. 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c443t-56c0b625efda60076c9d18afb4a13fa64c7dff7c4ae631589cd0833f691d56ae3</citedby><cites>FETCH-LOGICAL-c443t-56c0b625efda60076c9d18afb4a13fa64c7dff7c4ae631589cd0833f691d56ae3</cites><orcidid>0000-0003-2480-4700</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10218004/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10218004/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37185238$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Oganian, Yulia</creatorcontrib><creatorcontrib>Kojima, Katsuaki</creatorcontrib><creatorcontrib>Breska, Assaf</creatorcontrib><creatorcontrib>Cai, Chang</creatorcontrib><creatorcontrib>Findlay, Anne</creatorcontrib><creatorcontrib>Chang, Edward</creatorcontrib><creatorcontrib>Nagarajan, Srikantan S</creatorcontrib><title>Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment</title><title>The Journal of neuroscience</title><addtitle>J Neurosci</addtitle><description>The amplitude envelope of speech is crucial for accurate comprehension. Considered a key stage in speech processing, the phase of neural activity in the theta-delta bands (1-10 Hz) tracks the phase of the speech amplitude envelope during listening. However, the mechanisms underlying this envelope representation have been heavily debated. A dominant model posits that envelope tracking reflects entrainment of endogenous low-frequency oscillations to the speech envelope. Alternatively, envelope tracking reflects a series of evoked responses to acoustic landmarks within the envelope. It has proven challenging to distinguish these two mechanisms. To address this, we recorded MEG while participants ( = 12, 6 female) listened to natural speech, and compared the neural phase patterns to the predictions of two computational models: an oscillatory entrainment model and a model of evoked responses to peaks in the rate of envelope change. Critically, we also presented speech at slowed rates, where the spectro-temporal predictions of the two models diverge. Our analyses revealed transient theta phase-locking in regular speech, as predicted by both models. However, for slow speech, we found transient theta and delta phase-locking, a pattern that was fully compatible with the evoked response model but could not be explained by the oscillatory entrainment model. Furthermore, encoding of acoustic edge magnitudes was invariant to contextual speech rate, demonstrating speech rate normalization of acoustic edge representations. Together, our results suggest that neural phase-locking to the speech envelope is more likely to reflect discrete representation of transient information rather than oscillatory entrainment. This study probes a highly debated topic in speech perception: the neural mechanisms underlying the cortical representation of the temporal envelope of speech. It is well established that the slow intensity profile of the speech signal, its envelope, elicits a robust brain response that "tracks" these envelope fluctuations. The oscillatory entrainment model posits that envelope tracking reflects phase alignment of endogenous neural oscillations. Here the authors provide evidence for a distinct mechanism. They show that neural speech envelope tracking arises from transient evoked neural responses to rapid increases in the speech envelope. Explicit computational modeling provides direct and compelling evidence that evoked responses are the primary mechanism underlying cortical speech envelope representations, with no evidence for oscillatory entrainment.</description><subject>Acoustic Stimulation - methods</subject><subject>Acoustics</subject><subject>Alignment</subject><subject>Amplitudes</subject><subject>Auditory Cortex - physiology</subject><subject>Auditory Perception</subject><subject>Computational neuroscience</subject><subject>Entrainment</subject><subject>Female</subject><subject>Frequency</subject><subject>Frequency dependence</subject><subject>Humans</subject><subject>Information processing</subject><subject>Locking</subject><subject>Oscillations</subject><subject>Representations</subject><subject>Speech</subject><subject>Speech - physiology</subject><subject>Speech perception</subject><subject>Speech Perception - physiology</subject><subject>Speech processing</subject><subject>Temporal variations</subject><subject>Tracking</subject><issn>0270-6474</issn><issn>1529-2401</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpdUl1v0zAUtRCIlcFfmCzxwgMp_oqTPKGqymCoatHGni3XuWk9kjiLnaL-mP3XOWxUwJNl3XOOzrn3IHRByZymjH_6ti5vrzc3y6s5lZInjM0ZYfwFmsVpkTBB6Es0IywjiRSZOENvvL8jhGSEZq_RGc9oHlXyGXr4vtce8KKxu66FLmBX45X7lVwOcD9CZ454DeOgG7wwwR5sOOLgcNhHRts3NowV4LI7QON6mKg3PYDZ42uoGzDB4_LgfkIV_753nQc_sRfGjT5Yg8tqB_4jXruAN97YptHBDceoFwZtf7t5i17VuvHw7vk9R7eX5Y_l12S1-XK1XKwSIwQPSSoN2UqWQl1pGUNKU1Q01_VWaMprLYXJqrrOjNAgOU3zwlQk57yWBa1SqYGfo89Puv24baEyMFloVD_YVg9H5bRV_046u1c7d1CUMJoTIqLCh2eFwcXF-aBa6w3ETB3EuIrlVEwbz3iEvv8PeufGoYv5JlQhcs7yNKLkE8oMzvsB6pMbStRUAXWqgJoqoBhTUwUi8eLvLCfan5vzRxKCsT8</recordid><startdate>20230524</startdate><enddate>20230524</enddate><creator>Oganian, Yulia</creator><creator>Kojima, Katsuaki</creator><creator>Breska, Assaf</creator><creator>Cai, Chang</creator><creator>Findlay, Anne</creator><creator>Chang, Edward</creator><creator>Nagarajan, Srikantan S</creator><general>Society for Neuroscience</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7QR</scope><scope>7TK</scope><scope>7U7</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>P64</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-2480-4700</orcidid></search><sort><creationdate>20230524</creationdate><title>Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment</title><author>Oganian, Yulia ; Kojima, Katsuaki ; Breska, Assaf ; Cai, Chang ; Findlay, Anne ; Chang, Edward ; Nagarajan, Srikantan S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c443t-56c0b625efda60076c9d18afb4a13fa64c7dff7c4ae631589cd0833f691d56ae3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Acoustic Stimulation - methods</topic><topic>Acoustics</topic><topic>Alignment</topic><topic>Amplitudes</topic><topic>Auditory Cortex - physiology</topic><topic>Auditory Perception</topic><topic>Computational neuroscience</topic><topic>Entrainment</topic><topic>Female</topic><topic>Frequency</topic><topic>Frequency dependence</topic><topic>Humans</topic><topic>Information processing</topic><topic>Locking</topic><topic>Oscillations</topic><topic>Representations</topic><topic>Speech</topic><topic>Speech - physiology</topic><topic>Speech perception</topic><topic>Speech Perception - physiology</topic><topic>Speech processing</topic><topic>Temporal variations</topic><topic>Tracking</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oganian, Yulia</creatorcontrib><creatorcontrib>Kojima, Katsuaki</creatorcontrib><creatorcontrib>Breska, Assaf</creatorcontrib><creatorcontrib>Cai, Chang</creatorcontrib><creatorcontrib>Findlay, Anne</creatorcontrib><creatorcontrib>Chang, Edward</creatorcontrib><creatorcontrib>Nagarajan, Srikantan S</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>The Journal of neuroscience</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oganian, Yulia</au><au>Kojima, Katsuaki</au><au>Breska, Assaf</au><au>Cai, Chang</au><au>Findlay, Anne</au><au>Chang, Edward</au><au>Nagarajan, Srikantan S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment</atitle><jtitle>The Journal of neuroscience</jtitle><addtitle>J Neurosci</addtitle><date>2023-05-24</date><risdate>2023</risdate><volume>43</volume><issue>21</issue><spage>3909</spage><epage>3921</epage><pages>3909-3921</pages><issn>0270-6474</issn><eissn>1529-2401</eissn><abstract>The amplitude envelope of speech is crucial for accurate comprehension. Considered a key stage in speech processing, the phase of neural activity in the theta-delta bands (1-10 Hz) tracks the phase of the speech amplitude envelope during listening. However, the mechanisms underlying this envelope representation have been heavily debated. A dominant model posits that envelope tracking reflects entrainment of endogenous low-frequency oscillations to the speech envelope. Alternatively, envelope tracking reflects a series of evoked responses to acoustic landmarks within the envelope. It has proven challenging to distinguish these two mechanisms. To address this, we recorded MEG while participants ( = 12, 6 female) listened to natural speech, and compared the neural phase patterns to the predictions of two computational models: an oscillatory entrainment model and a model of evoked responses to peaks in the rate of envelope change. Critically, we also presented speech at slowed rates, where the spectro-temporal predictions of the two models diverge. Our analyses revealed transient theta phase-locking in regular speech, as predicted by both models. However, for slow speech, we found transient theta and delta phase-locking, a pattern that was fully compatible with the evoked response model but could not be explained by the oscillatory entrainment model. Furthermore, encoding of acoustic edge magnitudes was invariant to contextual speech rate, demonstrating speech rate normalization of acoustic edge representations. Together, our results suggest that neural phase-locking to the speech envelope is more likely to reflect discrete representation of transient information rather than oscillatory entrainment. This study probes a highly debated topic in speech perception: the neural mechanisms underlying the cortical representation of the temporal envelope of speech. It is well established that the slow intensity profile of the speech signal, its envelope, elicits a robust brain response that "tracks" these envelope fluctuations. The oscillatory entrainment model posits that envelope tracking reflects phase alignment of endogenous neural oscillations. Here the authors provide evidence for a distinct mechanism. They show that neural speech envelope tracking arises from transient evoked neural responses to rapid increases in the speech envelope. Explicit computational modeling provides direct and compelling evidence that evoked responses are the primary mechanism underlying cortical speech envelope representations, with no evidence for oscillatory entrainment.</abstract><cop>United States</cop><pub>Society for Neuroscience</pub><pmid>37185238</pmid><doi>10.1523/JNEUROSCI.1663-22.2023</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0003-2480-4700</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0270-6474
ispartof The Journal of neuroscience, 2023-05, Vol.43 (21), p.3909-3921
issn 0270-6474
1529-2401
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10218004
source MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central
subjects Acoustic Stimulation - methods
Acoustics
Alignment
Amplitudes
Auditory Cortex - physiology
Auditory Perception
Computational neuroscience
Entrainment
Female
Frequency
Frequency dependence
Humans
Information processing
Locking
Oscillations
Representations
Speech
Speech - physiology
Speech perception
Speech Perception - physiology
Speech processing
Temporal variations
Tracking
title Phase Alignment of Low-Frequency Neural Activity to the Amplitude Envelope of Speech Reflects Evoked Responses to Acoustic Edges, Not Oscillatory Entrainment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T07%3A17%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Phase%20Alignment%20of%20Low-Frequency%20Neural%20Activity%20to%20the%20Amplitude%20Envelope%20of%20Speech%20Reflects%20Evoked%20Responses%20to%20Acoustic%20Edges,%20Not%20Oscillatory%20Entrainment&rft.jtitle=The%20Journal%20of%20neuroscience&rft.au=Oganian,%20Yulia&rft.date=2023-05-24&rft.volume=43&rft.issue=21&rft.spage=3909&rft.epage=3921&rft.pages=3909-3921&rft.issn=0270-6474&rft.eissn=1529-2401&rft_id=info:doi/10.1523/JNEUROSCI.1663-22.2023&rft_dat=%3Cproquest_pubme%3E2814523873%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2819483285&rft_id=info:pmid/37185238&rfr_iscdi=true