TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation

The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researche...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2018-05
Hauptverfasser:	Fabbri, Alexander R, Li, Irene, Trairatvorakul, Prawat, He, Yijiao, Wei Tai Ting, Tung, Robert, Westerfield, Caitlin, Radev, Dragomir R
Format:	Artikel
Sprache:	eng
Schlagworte:	Annotations Artificial intelligence Datasets Education Information retrieval Internet resources Machine learning Natural language processing Scientific papers Search engines
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Fabbri, Alexander R Li, Irene Trairatvorakul, Prawat He, Yijiao Wei Tai Ting Tung, Robert Westerfield, Caitlin Radev, Dragomir R
description	The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address this situation, we introduce TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research. We have manually collected and categorized over 6,300 resources on NLP as well as the related fields of Artificial Intelligence (AI), Machine Learning (ML) and Information Retrieval (IR). Our dataset is notably the largest manually-picked corpus of resources intended for NLP education which does not include only academic papers. Additionally, we have created both a search engine and a command-line tool for the resources and have annotated the corpus to include lists of research topics, relevant resources for each topic, prerequisite relations among topics, relevant sub-parts of individual resources, among other annotations. We are releasing the dataset and present several avenues for further research.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2073289996</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2073289996</sourcerecordid><originalsourceid>FETCH-proquest_journals_20732899963</originalsourceid><addsrcrecordid>eNqNi8sKwjAQRYMgKOo_DLi1UBNfdadFcSOIupehHTEakzpJRP_eCn6Aq3Ph3NMQbanUMJmNpGyJnvfXNE3lZCrHY9UW1TEGxxrNEu1tDgvYoo1ozDvJnTFUBCohd1xFD2fHsGNiekTtdSDIL6itH8Ah8pPesHoFxiJoZwFtCXvyLnJB9Sjc_U62xK_riuYZjafejx3RX6-O-Sap2D0i-XC61pmt1UmmUyVnWZZN1H-vD-hTSzM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2073289996</pqid></control><display><type>article</type><title>TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation</title><source>Free E- Journals</source><creator>Fabbri, Alexander R ; Li, Irene ; Trairatvorakul, Prawat ; He, Yijiao ; Wei Tai Ting ; Tung, Robert ; Westerfield, Caitlin ; Radev, Dragomir R</creator><creatorcontrib>Fabbri, Alexander R ; Li, Irene ; Trairatvorakul, Prawat ; He, Yijiao ; Wei Tai Ting ; Tung, Robert ; Westerfield, Caitlin ; Radev, Dragomir R</creatorcontrib><description>The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address this situation, we introduce TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research. We have manually collected and categorized over 6,300 resources on NLP as well as the related fields of Artificial Intelligence (AI), Machine Learning (ML) and Information Retrieval (IR). Our dataset is notably the largest manually-picked corpus of resources intended for NLP education which does not include only academic papers. Additionally, we have created both a search engine and a command-line tool for the resources and have annotated the corpus to include lists of research topics, relevant resources for each topic, prerequisite relations among topics, relevant sub-parts of individual resources, among other annotations. We are releasing the dataset and present several avenues for further research.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Artificial intelligence ; Datasets ; Education ; Information retrieval ; Internet resources ; Machine learning ; Natural language processing ; Scientific papers ; Search engines</subject><ispartof>arXiv.org, 2018-05</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Fabbri, Alexander R</creatorcontrib><creatorcontrib>Li, Irene</creatorcontrib><creatorcontrib>Trairatvorakul, Prawat</creatorcontrib><creatorcontrib>He, Yijiao</creatorcontrib><creatorcontrib>Wei Tai Ting</creatorcontrib><creatorcontrib>Tung, Robert</creatorcontrib><creatorcontrib>Westerfield, Caitlin</creatorcontrib><creatorcontrib>Radev, Dragomir R</creatorcontrib><title>TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation</title><title>arXiv.org</title><description>The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address this situation, we introduce TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research. We have manually collected and categorized over 6,300 resources on NLP as well as the related fields of Artificial Intelligence (AI), Machine Learning (ML) and Information Retrieval (IR). Our dataset is notably the largest manually-picked corpus of resources intended for NLP education which does not include only academic papers. Additionally, we have created both a search engine and a command-line tool for the resources and have annotated the corpus to include lists of research topics, relevant resources for each topic, prerequisite relations among topics, relevant sub-parts of individual resources, among other annotations. We are releasing the dataset and present several avenues for further research.</description><subject>Annotations</subject><subject>Artificial intelligence</subject><subject>Datasets</subject><subject>Education</subject><subject>Information retrieval</subject><subject>Internet resources</subject><subject>Machine learning</subject><subject>Natural language processing</subject><subject>Scientific papers</subject><subject>Search engines</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi8sKwjAQRYMgKOo_DLi1UBNfdadFcSOIupehHTEakzpJRP_eCn6Aq3Ph3NMQbanUMJmNpGyJnvfXNE3lZCrHY9UW1TEGxxrNEu1tDgvYoo1ozDvJnTFUBCohd1xFD2fHsGNiekTtdSDIL6itH8Ah8pPesHoFxiJoZwFtCXvyLnJB9Sjc_U62xK_riuYZjafejx3RX6-O-Sap2D0i-XC61pmt1UmmUyVnWZZN1H-vD-hTSzM</recordid><startdate>20180511</startdate><enddate>20180511</enddate><creator>Fabbri, Alexander R</creator><creator>Li, Irene</creator><creator>Trairatvorakul, Prawat</creator><creator>He, Yijiao</creator><creator>Wei Tai Ting</creator><creator>Tung, Robert</creator><creator>Westerfield, Caitlin</creator><creator>Radev, Dragomir R</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180511</creationdate><title>TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation</title><author>Fabbri, Alexander R ; Li, Irene ; Trairatvorakul, Prawat ; He, Yijiao ; Wei Tai Ting ; Tung, Robert ; Westerfield, Caitlin ; Radev, Dragomir R</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20732899963</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Annotations</topic><topic>Artificial intelligence</topic><topic>Datasets</topic><topic>Education</topic><topic>Information retrieval</topic><topic>Internet resources</topic><topic>Machine learning</topic><topic>Natural language processing</topic><topic>Scientific papers</topic><topic>Search engines</topic><toplevel>online_resources</toplevel><creatorcontrib>Fabbri, Alexander R</creatorcontrib><creatorcontrib>Li, Irene</creatorcontrib><creatorcontrib>Trairatvorakul, Prawat</creatorcontrib><creatorcontrib>He, Yijiao</creatorcontrib><creatorcontrib>Wei Tai Ting</creatorcontrib><creatorcontrib>Tung, Robert</creatorcontrib><creatorcontrib>Westerfield, Caitlin</creatorcontrib><creatorcontrib>Radev, Dragomir R</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fabbri, Alexander R</au><au>Li, Irene</au><au>Trairatvorakul, Prawat</au><au>He, Yijiao</au><au>Wei Tai Ting</au><au>Tung, Robert</au><au>Westerfield, Caitlin</au><au>Radev, Dragomir R</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation</atitle><jtitle>arXiv.org</jtitle><date>2018-05-11</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address this situation, we introduce TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research. We have manually collected and categorized over 6,300 resources on NLP as well as the related fields of Artificial Intelligence (AI), Machine Learning (ML) and Information Retrieval (IR). Our dataset is notably the largest manually-picked corpus of resources intended for NLP education which does not include only academic papers. Additionally, we have created both a search engine and a command-line tool for the resources and have annotated the corpus to include lists of research topics, relevant resources for each topic, prerequisite relations among topics, relevant sub-parts of individual resources, among other annotations. We are releasing the dataset and present several avenues for further research.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2018-05
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2073289996
source	Free E- Journals
subjects	Annotations Artificial intelligence Datasets Education Information retrieval Internet resources Machine learning Natural language processing Scientific papers Search engines
title	TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T15%3A17%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=TutorialBank:%20A%20Manually-Collected%20Corpus%20for%20Prerequisite%20Chains,%20Survey%20Extraction%20and%20Resource%20Recommendation&rft.jtitle=arXiv.org&rft.au=Fabbri,%20Alexander%20R&rft.date=2018-05-11&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2073289996%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2073289996&rft_id=info:pmid/&rfr_iscdi=true