To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents
Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 4 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | |
container_volume | |
creator | Joshi, H. Pareek, J. Patel, R. Chauhan, K. |
description | Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR. |
doi_str_mv | 10.1109/NUICONE.2012.6493219 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6493219</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6493219</ieee_id><sourcerecordid>6493219</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-bbc0c4d9d6ca2a47e935f015c9ca5825dc44464178d90dca35b2895575b905e63</originalsourceid><addsrcrecordid>eNotkN1qAjEQRlPaQq31CdqLvMDaTH42yWWRrRVEb_RaskkWIrqx2djat--iezPDOTAfzIfQG5ApANHvq-1itl5VU0qATkuuGQV9hyZaKuClZCBBwz16HoAS_YBGlElRAFX0CU26bk8IYaCUJuUIfW8i7nI84ZhwGzPOAxa4upx8Ckff5g7H9mp_Y3LYH8IxtCaHXjb9VWj7ebxx8jkF_2MOODZ4ft6b1Huc_SVjF-35GvaCHhtz6Pxk2GO0_aw2s69iuZ4vZh_LIoAUuahrSyx32pXWUMOl10w0BITV1ghFhbOc85KDVE4TZw0TNVVaCClqTYQv2Ri93nKD93536l8x6W83NMb-AUcqX70</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Joshi, H. ; Pareek, J. ; Patel, R. ; Chauhan, K.</creator><creatorcontrib>Joshi, H. ; Pareek, J. ; Patel, R. ; Chauhan, K.</creatorcontrib><description>Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.</description><identifier>ISSN: 2375-1282</identifier><identifier>ISBN: 1467317209</identifier><identifier>ISBN: 9781467317207</identifier><identifier>EISBN: 9781467317191</identifier><identifier>EISBN: 1467317187</identifier><identifier>EISBN: 1467317195</identifier><identifier>EISBN: 9781467317184</identifier><identifier>DOI: 10.1109/NUICONE.2012.6493219</identifier><language>eng</language><publisher>IEEE</publisher><subject>Automatic Indexing ; Corpus ; Gujarati Information Retrieval ; Mean Average Precision ; Stopwords</subject><ispartof>2012 Nirma University International Conference on Engineering (NUiCONE), 2012, p.1-4</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6493219$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,778,782,787,788,2054,27908,54903</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6493219$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Joshi, H.</creatorcontrib><creatorcontrib>Pareek, J.</creatorcontrib><creatorcontrib>Patel, R.</creatorcontrib><creatorcontrib>Chauhan, K.</creatorcontrib><title>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</title><title>2012 Nirma University International Conference on Engineering (NUiCONE)</title><addtitle>NUICONE</addtitle><description>Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.</description><subject>Automatic Indexing</subject><subject>Corpus</subject><subject>Gujarati Information Retrieval</subject><subject>Mean Average Precision</subject><subject>Stopwords</subject><issn>2375-1282</issn><isbn>1467317209</isbn><isbn>9781467317207</isbn><isbn>9781467317191</isbn><isbn>1467317187</isbn><isbn>1467317195</isbn><isbn>9781467317184</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotkN1qAjEQRlPaQq31CdqLvMDaTH42yWWRrRVEb_RaskkWIrqx2djat--iezPDOTAfzIfQG5ApANHvq-1itl5VU0qATkuuGQV9hyZaKuClZCBBwz16HoAS_YBGlElRAFX0CU26bk8IYaCUJuUIfW8i7nI84ZhwGzPOAxa4upx8Ckff5g7H9mp_Y3LYH8IxtCaHXjb9VWj7ebxx8jkF_2MOODZ4ft6b1Huc_SVjF-35GvaCHhtz6Pxk2GO0_aw2s69iuZ4vZh_LIoAUuahrSyx32pXWUMOl10w0BITV1ghFhbOc85KDVE4TZw0TNVVaCClqTYQv2Ri93nKD93536l8x6W83NMb-AUcqX70</recordid><startdate>201212</startdate><enddate>201212</enddate><creator>Joshi, H.</creator><creator>Pareek, J.</creator><creator>Patel, R.</creator><creator>Chauhan, K.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201212</creationdate><title>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</title><author>Joshi, H. ; Pareek, J. ; Patel, R. ; Chauhan, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-bbc0c4d9d6ca2a47e935f015c9ca5825dc44464178d90dca35b2895575b905e63</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Automatic Indexing</topic><topic>Corpus</topic><topic>Gujarati Information Retrieval</topic><topic>Mean Average Precision</topic><topic>Stopwords</topic><toplevel>online_resources</toplevel><creatorcontrib>Joshi, H.</creatorcontrib><creatorcontrib>Pareek, J.</creatorcontrib><creatorcontrib>Patel, R.</creatorcontrib><creatorcontrib>Chauhan, K.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Joshi, H.</au><au>Pareek, J.</au><au>Patel, R.</au><au>Chauhan, K.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents</atitle><btitle>2012 Nirma University International Conference on Engineering (NUiCONE)</btitle><stitle>NUICONE</stitle><date>2012-12</date><risdate>2012</risdate><spage>1</spage><epage>4</epage><pages>1-4</pages><issn>2375-1282</issn><isbn>1467317209</isbn><isbn>9781467317207</isbn><eisbn>9781467317191</eisbn><eisbn>1467317187</eisbn><eisbn>1467317195</eisbn><eisbn>9781467317184</eisbn><abstract>Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.</abstract><pub>IEEE</pub><doi>10.1109/NUICONE.2012.6493219</doi><tpages>4</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 2375-1282 |
ispartof | 2012 Nirma University International Conference on Engineering (NUiCONE), 2012, p.1-4 |
issn | 2375-1282 |
language | eng |
recordid | cdi_ieee_primary_6493219 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Automatic Indexing Corpus Gujarati Information Retrieval Mean Average Precision Stopwords |
title | To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T14%3A59%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=To%20stop%20or%20not%20to%20stop%20-%20Experiments%20on%20stopword%20elimination%20for%20information%20retrieval%20of%20Gujarati%20text%20documents&rft.btitle=2012%20Nirma%20University%20International%20Conference%20on%20Engineering%20(NUiCONE)&rft.au=Joshi,%20H.&rft.date=2012-12&rft.spage=1&rft.epage=4&rft.pages=1-4&rft.issn=2375-1282&rft.isbn=1467317209&rft.isbn_list=9781467317207&rft_id=info:doi/10.1109/NUICONE.2012.6493219&rft_dat=%3Cieee_6IE%3E6493219%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781467317191&rft.eisbn_list=1467317187&rft.eisbn_list=1467317195&rft.eisbn_list=9781467317184&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6493219&rfr_iscdi=true |