To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents

Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Joshi, H., Pareek, J., Patel, R., Chauhan, K.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4
container_issue
container_start_page 1
container_title
container_volume
creator Joshi, H.
Pareek, J.
Patel, R.
Chauhan, K.
description Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.
doi_str_mv 10.1109/NUICONE.2012.6493219
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6493219</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6493219</ieee_id><sourcerecordid>6493219</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-bbc0c4d9d6ca2a47e935f015c9ca5825dc44464178d90dca35b2895575b905e63</originalsourceid><addsrcrecordid>eNotkN1qAjEQRlPaQq31CdqLvMDaTH42yWWRrRVEb_RaskkWIrqx2djat--iezPDOTAfzIfQG5ApANHvq-1itl5VU0qATkuuGQV9hyZaKuClZCBBwz16HoAS_YBGlElRAFX0CU26bk8IYaCUJuUIfW8i7nI84ZhwGzPOAxa4upx8Ckff5g7H9mp_Y3LYH8IxtCaHXjb9VWj7ebxx8jkF_2MOODZ4ft6b1Huc_SVjF-35GvaCHhtz6Pxk2GO0_aw2s69iuZ4vZh_LIoAUuahrSyx32pXWUMOl10w0BITV1ghFhbOc85KDVE4TZw0TNVVaCClqTYQv2Ri93nKD93536l8x6W83NMb-AUcqX70</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Joshi, H. ; Pareek, J. ; Patel, R. ; Chauhan, K.</creator><creatorcontrib>Joshi, H. ; Pareek, J. ; Patel, R. ; Chauhan, K.</creatorcontrib><description>Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.</description><identifier>ISSN: 2375-1282</identifier><identifier>ISBN: 1467317209</identifier><identifier>ISBN: 9781467317207</identifier><identifier>EISBN: 9781467317191</identifier><identifier>EISBN: 1467317187</identifier><identifier>EISBN: 1467317195</identifier><identifier>EISBN: 9781467317184</identifier><identifier>DOI: 10.1109/NUICONE.2012.6493219</identifier><language>eng</language><publisher>IEEE</publisher><subject>Automatic Indexing ; Corpus ; Gujarati Information Retrieval ; Mean Average Precision ; Stopwords</subject><ispartof>2012 Nirma University International Conference on Engineering (NUiCONE), 2012, p.1-4</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6493219$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,778,782,787,788,2054,27908,54903</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6493219$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Joshi, H.</creatorcontrib><creatorcontrib>Pareek, J.</creatorcontrib><creatorcontrib>Patel, R.</creatorcontrib><creatorcontrib>Chauhan, K.</creatorcontrib><title>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</title><title>2012 Nirma University International Conference on Engineering (NUiCONE)</title><addtitle>NUICONE</addtitle><description>Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.</description><subject>Automatic Indexing</subject><subject>Corpus</subject><subject>Gujarati Information Retrieval</subject><subject>Mean Average Precision</subject><subject>Stopwords</subject><issn>2375-1282</issn><isbn>1467317209</isbn><isbn>9781467317207</isbn><isbn>9781467317191</isbn><isbn>1467317187</isbn><isbn>1467317195</isbn><isbn>9781467317184</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotkN1qAjEQRlPaQq31CdqLvMDaTH42yWWRrRVEb_RaskkWIrqx2djat--iezPDOTAfzIfQG5ApANHvq-1itl5VU0qATkuuGQV9hyZaKuClZCBBwz16HoAS_YBGlElRAFX0CU26bk8IYaCUJuUIfW8i7nI84ZhwGzPOAxa4upx8Ckff5g7H9mp_Y3LYH8IxtCaHXjb9VWj7ebxx8jkF_2MOODZ4ft6b1Huc_SVjF-35GvaCHhtz6Pxk2GO0_aw2s69iuZ4vZh_LIoAUuahrSyx32pXWUMOl10w0BITV1ghFhbOc85KDVE4TZw0TNVVaCClqTYQv2Ri93nKD93536l8x6W83NMb-AUcqX70</recordid><startdate>201212</startdate><enddate>201212</enddate><creator>Joshi, H.</creator><creator>Pareek, J.</creator><creator>Patel, R.</creator><creator>Chauhan, K.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201212</creationdate><title>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</title><author>Joshi, H. ; Pareek, J. ; Patel, R. ; Chauhan, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-bbc0c4d9d6ca2a47e935f015c9ca5825dc44464178d90dca35b2895575b905e63</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Automatic Indexing</topic><topic>Corpus</topic><topic>Gujarati Information Retrieval</topic><topic>Mean Average Precision</topic><topic>Stopwords</topic><toplevel>online_resources</toplevel><creatorcontrib>Joshi, H.</creatorcontrib><creatorcontrib>Pareek, J.</creatorcontrib><creatorcontrib>Patel, R.</creatorcontrib><creatorcontrib>Chauhan, K.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Joshi, H.</au><au>Pareek, J.</au><au>Patel, R.</au><au>Chauhan, K.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</atitle><btitle>2012 Nirma University International Conference on Engineering (NUiCONE)</btitle><stitle>NUICONE</stitle><date>2012-12</date><risdate>2012</risdate><spage>1</spage><epage>4</epage><pages>1-4</pages><issn>2375-1282</issn><isbn>1467317209</isbn><isbn>9781467317207</isbn><eisbn>9781467317191</eisbn><eisbn>1467317187</eisbn><eisbn>1467317195</eisbn><eisbn>9781467317184</eisbn><abstract>Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.</abstract><pub>IEEE</pub><doi>10.1109/NUICONE.2012.6493219</doi><tpages>4</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2375-1282
ispartof 2012 Nirma University International Conference on Engineering (NUiCONE), 2012, p.1-4
issn 2375-1282
language eng
recordid cdi_ieee_primary_6493219
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Automatic Indexing
Corpus
Gujarati Information Retrieval
Mean Average Precision
Stopwords
title To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T14%3A59%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=To%20stop%20or%20not%20to%20stop%20-%20Experiments%20on%20stopword%20elimination%20for%20information%20retrieval%20of%20Gujarati%20text%20documents&rft.btitle=2012%20Nirma%20University%20International%20Conference%20on%20Engineering%20(NUiCONE)&rft.au=Joshi,%20H.&rft.date=2012-12&rft.spage=1&rft.epage=4&rft.pages=1-4&rft.issn=2375-1282&rft.isbn=1467317209&rft.isbn_list=9781467317207&rft_id=info:doi/10.1109/NUICONE.2012.6493219&rft_dat=%3Cieee_6IE%3E6493219%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781467317191&rft.eisbn_list=1467317187&rft.eisbn_list=1467317195&rft.eisbn_list=9781467317184&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6493219&rfr_iscdi=true