Categorizing Unknown Words: A Decision Tree-Based Misspelling Identifier

This paper introduces a robust, portable system for categorizing unknown words. It is based on a multi- component architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the component that identifies spelling errors. The misspelling ide...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Toole, Janine
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 133
container_issue
container_start_page 122
container_title
container_volume
creator Toole, Janine
description This paper introduces a robust, portable system for categorizing unknown words. It is based on a multi- component architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the component that identifies spelling errors. The misspelling identifier uses a decision tree architecture to combine multiple types of evidence about the unknown word. The misspelling identifier is evaluated using data from live closed captions - a genre replete with a wide variety of unknown words.
doi_str_mv 10.1007/3-540-46695-9_11
format Conference Proceeding
fullrecord <record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_1173769</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1173769</sourcerecordid><originalsourceid>FETCH-LOGICAL-p223t-2c2f905fada17ce4330fd0d36907a3ce8380f3036d92175cbde9f7ef4418dace3</originalsourceid><addsrcrecordid>eNotkM1PAjEQxetXIiJ3j3vwWmw73XbrDfEDEowXiMembFtSWbublsTgX-8CzmWS995MXn4I3VEypoTIB8AlJ5gLoUqsNKVn6AZ65SioczSgglIMwNXFyRCiYqy8RAMChGElOVyjUc5fpB9gpBJ8gGZTs3ObNoXfEDfFKm5j-xOLzzbZ_FhMimdXhxzaWCyTc_jJZGeL95Bz55rmcDC3Lu6CDy7doitvmuxG_3uIVq8vy-kMLz7e5tPJAneMwQ6zmnlFSm-sobJ2HIB4SywIRaSB2lVQEQ8EhFWMyrJeW6e8dJ5zWllTOxii-9PfzuTaND6Z2FfUXQrfJu17KhKkUH1sfIrl3okbl_S6bbdZU6IPLDXoHpA-otMHlvAH7_5iBg</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Categorizing Unknown Words: A Decision Tree-Based Misspelling Identifier</title><source>Springer Books</source><creator>Toole, Janine</creator><contributor>Foo, Norman</contributor><creatorcontrib>Toole, Janine ; Foo, Norman</creatorcontrib><description>This paper introduces a robust, portable system for categorizing unknown words. It is based on a multi- component architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the component that identifies spelling errors. The misspelling identifier uses a decision tree architecture to combine multiple types of evidence about the unknown word. The misspelling identifier is evaluated using data from live closed captions - a genre replete with a wide variety of unknown words.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 3540668225</identifier><identifier>ISBN: 9783540668220</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 3540466959</identifier><identifier>EISBN: 9783540466956</identifier><identifier>DOI: 10.1007/3-540-46695-9_11</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>Applied sciences ; Artificial intelligence ; Baseline Case ; Computational Linguistics ; Computer science; control theory; systems ; Edit Distance ; Exact sciences and technology ; Speech and sound recognition and synthesis. Linguistics ; Spelling Error ; Word Length</subject><ispartof>Advanced Topics in Artificial Intelligence, 1999, p.122-133</ispartof><rights>Springer-Verlag Berlin Heidelberg 1999</rights><rights>2000 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/3-540-46695-9_11$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/3-540-46695-9_11$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,776,777,781,786,787,790,4037,4038,27907,38237,41424,42493</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=1173769$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Foo, Norman</contributor><creatorcontrib>Toole, Janine</creatorcontrib><title>Categorizing Unknown Words: A Decision Tree-Based Misspelling Identifier</title><title>Advanced Topics in Artificial Intelligence</title><description>This paper introduces a robust, portable system for categorizing unknown words. It is based on a multi- component architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the component that identifies spelling errors. The misspelling identifier uses a decision tree architecture to combine multiple types of evidence about the unknown word. The misspelling identifier is evaluated using data from live closed captions - a genre replete with a wide variety of unknown words.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Baseline Case</subject><subject>Computational Linguistics</subject><subject>Computer science; control theory; systems</subject><subject>Edit Distance</subject><subject>Exact sciences and technology</subject><subject>Speech and sound recognition and synthesis. Linguistics</subject><subject>Spelling Error</subject><subject>Word Length</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>3540668225</isbn><isbn>9783540668220</isbn><isbn>3540466959</isbn><isbn>9783540466956</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1999</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkM1PAjEQxetXIiJ3j3vwWmw73XbrDfEDEowXiMembFtSWbublsTgX-8CzmWS995MXn4I3VEypoTIB8AlJ5gLoUqsNKVn6AZ65SioczSgglIMwNXFyRCiYqy8RAMChGElOVyjUc5fpB9gpBJ8gGZTs3ObNoXfEDfFKm5j-xOLzzbZ_FhMimdXhxzaWCyTc_jJZGeL95Bz55rmcDC3Lu6CDy7doitvmuxG_3uIVq8vy-kMLz7e5tPJAneMwQ6zmnlFSm-sobJ2HIB4SywIRaSB2lVQEQ8EhFWMyrJeW6e8dJ5zWllTOxii-9PfzuTaND6Z2FfUXQrfJu17KhKkUH1sfIrl3okbl_S6bbdZU6IPLDXoHpA-otMHlvAH7_5iBg</recordid><startdate>1999</startdate><enddate>1999</enddate><creator>Toole, Janine</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>1999</creationdate><title>Categorizing Unknown Words: A Decision Tree-Based Misspelling Identifier</title><author>Toole, Janine</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p223t-2c2f905fada17ce4330fd0d36907a3ce8380f3036d92175cbde9f7ef4418dace3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Baseline Case</topic><topic>Computational Linguistics</topic><topic>Computer science; control theory; systems</topic><topic>Edit Distance</topic><topic>Exact sciences and technology</topic><topic>Speech and sound recognition and synthesis. Linguistics</topic><topic>Spelling Error</topic><topic>Word Length</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Toole, Janine</creatorcontrib><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Toole, Janine</au><au>Foo, Norman</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Categorizing Unknown Words: A Decision Tree-Based Misspelling Identifier</atitle><btitle>Advanced Topics in Artificial Intelligence</btitle><date>1999</date><risdate>1999</risdate><spage>122</spage><epage>133</epage><pages>122-133</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>3540668225</isbn><isbn>9783540668220</isbn><eisbn>3540466959</eisbn><eisbn>9783540466956</eisbn><abstract>This paper introduces a robust, portable system for categorizing unknown words. It is based on a multi- component architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the component that identifies spelling errors. The misspelling identifier uses a decision tree architecture to combine multiple types of evidence about the unknown word. The misspelling identifier is evaluated using data from live closed captions - a genre replete with a wide variety of unknown words.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/3-540-46695-9_11</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0302-9743
ispartof Advanced Topics in Artificial Intelligence, 1999, p.122-133
issn 0302-9743
1611-3349
language eng
recordid cdi_pascalfrancis_primary_1173769
source Springer Books
subjects Applied sciences
Artificial intelligence
Baseline Case
Computational Linguistics
Computer science
control theory
systems
Edit Distance
Exact sciences and technology
Speech and sound recognition and synthesis. Linguistics
Spelling Error
Word Length
title Categorizing Unknown Words: A Decision Tree-Based Misspelling Identifier
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T09%3A29%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Categorizing%20Unknown%20Words:%20A%20Decision%20Tree-Based%20Misspelling%20Identifier&rft.btitle=Advanced%20Topics%20in%20Artificial%20Intelligence&rft.au=Toole,%20Janine&rft.date=1999&rft.spage=122&rft.epage=133&rft.pages=122-133&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=3540668225&rft.isbn_list=9783540668220&rft_id=info:doi/10.1007/3-540-46695-9_11&rft_dat=%3Cpascalfrancis_sprin%3E1173769%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=3540466959&rft.eisbn_list=9783540466956&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true