Order matters: Distributional properties of speech to young children bootstraps learning of semantic representations

Some researchers claim that language acquisition is critically dependent on experiencing linguistic input in order of increasing complexity. We set out to test this hypothesis using a simple recurrent neural network (SRN) trained to predict word sequences in CHILDES, a 5-million-word corpus of speec...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Huebner, Philip A, Willits, Jon A
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computation and Language
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Huebner, Philip A Willits, Jon A
description	Some researchers claim that language acquisition is critically dependent on experiencing linguistic input in order of increasing complexity. We set out to test this hypothesis using a simple recurrent neural network (SRN) trained to predict word sequences in CHILDES, a 5-million-word corpus of speech directed to children. First, we demonstrated that age-ordered CHILDES exhibits a gradual increase in linguistic complexity. Next, we compared the performance of two groups of SRNs trained on CHILDES which had either been age-ordered or not. Specifically, we assessed learning of grammatical and semantic structure and showed that training on age-ordered input facilitates learning of semantic, but not of sequential structure. We found that this advantage is eliminated when the models were trained on input with utterance boundary information removed.
doi_str_mv	10.48550/arxiv.1802.00768
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1802_00768</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1802_00768</sourcerecordid><originalsourceid>FETCH-LOGICAL-a678-84125691f65c05d70f812c46f16afb5cabda5a9861824d2aff63f41c124b1f073</originalsourceid><addsrcrecordid>eNotj71OwzAUhbMwoMIDMNUvkGAntuOyofIrVerSPbpxrqmlJLauXUTfnqYwneGc80lfUTwIXkmjFH8E-vHflTC8rjhvtbkt8p4GJDZBzkjpib34lMn3p-zDDCOLFCJS9phYcCxFRHtkObBzOM1fzB79OBDOrA8hX34QExsRaPaXctnjBHP2lhFGwoRzhoWb7oobB2PC-_9cFYe318P2o9zt3z-3z7sSdGtKI0Wt9EY4rSxXQ8udEbWV2gkNrlcW-gEUbIwWppZDDc7pxklhRS174XjbrIr1H_aq3UXyE9C5W_S7q37zC7-LWJE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Order matters: Distributional properties of speech to young children bootstraps learning of semantic representations</title><source>arXiv.org</source><creator>Huebner, Philip A ; Willits, Jon A</creator><creatorcontrib>Huebner, Philip A ; Willits, Jon A</creatorcontrib><description>Some researchers claim that language acquisition is critically dependent on experiencing linguistic input in order of increasing complexity. We set out to test this hypothesis using a simple recurrent neural network (SRN) trained to predict word sequences in CHILDES, a 5-million-word corpus of speech directed to children. First, we demonstrated that age-ordered CHILDES exhibits a gradual increase in linguistic complexity. Next, we compared the performance of two groups of SRNs trained on CHILDES which had either been age-ordered or not. Specifically, we assessed learning of grammatical and semantic structure and showed that training on age-ordered input facilitates learning of semantic, but not of sequential structure. We found that this advantage is eliminated when the models were trained on input with utterance boundary information removed.</description><identifier>DOI: 10.48550/arxiv.1802.00768</identifier><language>eng</language><subject>Computer Science - Computation and Language</subject><creationdate>2018-02</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1802.00768$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1802.00768$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Huebner, Philip A</creatorcontrib><creatorcontrib>Willits, Jon A</creatorcontrib><title>Order matters: Distributional properties of speech to young children bootstraps learning of semantic representations</title><description>Some researchers claim that language acquisition is critically dependent on experiencing linguistic input in order of increasing complexity. We set out to test this hypothesis using a simple recurrent neural network (SRN) trained to predict word sequences in CHILDES, a 5-million-word corpus of speech directed to children. First, we demonstrated that age-ordered CHILDES exhibits a gradual increase in linguistic complexity. Next, we compared the performance of two groups of SRNs trained on CHILDES which had either been age-ordered or not. Specifically, we assessed learning of grammatical and semantic structure and showed that training on age-ordered input facilitates learning of semantic, but not of sequential structure. We found that this advantage is eliminated when the models were trained on input with utterance boundary information removed.</description><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj71OwzAUhbMwoMIDMNUvkGAntuOyofIrVerSPbpxrqmlJLauXUTfnqYwneGc80lfUTwIXkmjFH8E-vHflTC8rjhvtbkt8p4GJDZBzkjpib34lMn3p-zDDCOLFCJS9phYcCxFRHtkObBzOM1fzB79OBDOrA8hX34QExsRaPaXctnjBHP2lhFGwoRzhoWb7oobB2PC-_9cFYe318P2o9zt3z-3z7sSdGtKI0Wt9EY4rSxXQ8udEbWV2gkNrlcW-gEUbIwWppZDDc7pxklhRS174XjbrIr1H_aq3UXyE9C5W_S7q37zC7-LWJE</recordid><startdate>20180202</startdate><enddate>20180202</enddate><creator>Huebner, Philip A</creator><creator>Willits, Jon A</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20180202</creationdate><title>Order matters: Distributional properties of speech to young children bootstraps learning of semantic representations</title><author>Huebner, Philip A ; Willits, Jon A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a678-84125691f65c05d70f812c46f16afb5cabda5a9861824d2aff63f41c124b1f073</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Huebner, Philip A</creatorcontrib><creatorcontrib>Willits, Jon A</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Huebner, Philip A</au><au>Willits, Jon A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Order matters: Distributional properties of speech to young children bootstraps learning of semantic representations</atitle><date>2018-02-02</date><risdate>2018</risdate><abstract>Some researchers claim that language acquisition is critically dependent on experiencing linguistic input in order of increasing complexity. We set out to test this hypothesis using a simple recurrent neural network (SRN) trained to predict word sequences in CHILDES, a 5-million-word corpus of speech directed to children. First, we demonstrated that age-ordered CHILDES exhibits a gradual increase in linguistic complexity. Next, we compared the performance of two groups of SRNs trained on CHILDES which had either been age-ordered or not. Specifically, we assessed learning of grammatical and semantic structure and showed that training on age-ordered input facilitates learning of semantic, but not of sequential structure. We found that this advantage is eliminated when the models were trained on input with utterance boundary information removed.</abstract><doi>10.48550/arxiv.1802.00768</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1802.00768
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1802_00768
source	arXiv.org
subjects	Computer Science - Computation and Language
title	Order matters: Distributional properties of speech to young children bootstraps learning of semantic representations
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T11%3A05%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Order%20matters:%20Distributional%20properties%20of%20speech%20to%20young%20children%20bootstraps%20learning%20of%20semantic%20representations&rft.au=Huebner,%20Philip%20A&rft.date=2018-02-02&rft_id=info:doi/10.48550/arxiv.1802.00768&rft_dat=%3Carxiv_GOX%3E1802_00768%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true