Question retrieval using combined queries in community question answering
Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the for...
Gespeichert in:
Veröffentlicht in: | Journal of intelligent information systems 2020-10, Vol.55 (2), p.307-327 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 327 |
---|---|
container_issue | 2 |
container_start_page | 307 |
container_title | Journal of intelligent information systems |
container_volume | 55 |
creator | Khushhal, Saquib Majid, Abdul Abbas, Syed Ali Nadeem, Malik Sajjad Ahmed Shah, Saeed Arif |
description | Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly. |
doi_str_mv | 10.1007/s10844-020-00612-x |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2435627999</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2435627999</sourcerecordid><originalsourceid>FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</originalsourceid><addsrcrecordid>eNp9UEtLAzEQDqJgrf4BTwueo5P35ijFR6Eggp5DNpstKW22Jrta_727XcGbp4HvNTMfQtcEbgmAussESs4xUMAAklB8OEEzIhTDSipximagqcBaAz1HFzlvAECXEmZo-dr73IU2Fsl3KfhPuy36HOK6cO2uCtHXxUfvByIXIY7Yro-h-x7ByWZj_hr4uL5EZ43dZn_1O-fo_fHhbfGMVy9Py8X9CjsmaYeZ0lZV1FWqLmvllKy1rkvRKCKpddw3TgjKieQVKCZFA5R4y61llS5rCZrN0c2Uu0_t8QqzafsUh5WGciYkVVqPKjqpXGpzTr4x-xR2Nn0bAmaszEyVmaEyc6zMHAYTm0x5P37k01_0P64fEllwGA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2435627999</pqid></control><display><type>article</type><title>Question retrieval using combined queries in community question answering</title><source>SpringerLink Journals - AutoHoldings</source><creator>Khushhal, Saquib ; Majid, Abdul ; Abbas, Syed Ali ; Nadeem, Malik Sajjad Ahmed ; Shah, Saeed Arif</creator><creatorcontrib>Khushhal, Saquib ; Majid, Abdul ; Abbas, Syed Ali ; Nadeem, Malik Sajjad Ahmed ; Shah, Saeed Arif</creatorcontrib><description>Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.</description><identifier>ISSN: 0925-9902</identifier><identifier>EISSN: 1573-7675</identifier><identifier>DOI: 10.1007/s10844-020-00612-x</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Archives & records ; Artificial Intelligence ; Computer Science ; Data Structures and Information Theory ; Information retrieval ; Information Storage and Retrieval ; IT in Business ; Natural Language Processing (NLP) ; Performance enhancement ; Performance indices ; Queries ; Query languages ; Questions ; Ranking ; Sentences</subject><ispartof>Journal of intelligent information systems, 2020-10, Vol.55 (2), p.307-327</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</citedby><cites>FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10844-020-00612-x$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10844-020-00612-x$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Khushhal, Saquib</creatorcontrib><creatorcontrib>Majid, Abdul</creatorcontrib><creatorcontrib>Abbas, Syed Ali</creatorcontrib><creatorcontrib>Nadeem, Malik Sajjad Ahmed</creatorcontrib><creatorcontrib>Shah, Saeed Arif</creatorcontrib><title>Question retrieval using combined queries in community question answering</title><title>Journal of intelligent information systems</title><addtitle>J Intell Inf Syst</addtitle><description>Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.</description><subject>Archives & records</subject><subject>Artificial Intelligence</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Information retrieval</subject><subject>Information Storage and Retrieval</subject><subject>IT in Business</subject><subject>Natural Language Processing (NLP)</subject><subject>Performance enhancement</subject><subject>Performance indices</subject><subject>Queries</subject><subject>Query languages</subject><subject>Questions</subject><subject>Ranking</subject><subject>Sentences</subject><issn>0925-9902</issn><issn>1573-7675</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9UEtLAzEQDqJgrf4BTwueo5P35ijFR6Eggp5DNpstKW22Jrta_727XcGbp4HvNTMfQtcEbgmAussESs4xUMAAklB8OEEzIhTDSipximagqcBaAz1HFzlvAECXEmZo-dr73IU2Fsl3KfhPuy36HOK6cO2uCtHXxUfvByIXIY7Yro-h-x7ByWZj_hr4uL5EZ43dZn_1O-fo_fHhbfGMVy9Py8X9CjsmaYeZ0lZV1FWqLmvllKy1rkvRKCKpddw3TgjKieQVKCZFA5R4y61llS5rCZrN0c2Uu0_t8QqzafsUh5WGciYkVVqPKjqpXGpzTr4x-xR2Nn0bAmaszEyVmaEyc6zMHAYTm0x5P37k01_0P64fEllwGA</recordid><startdate>20201001</startdate><enddate>20201001</enddate><creator>Khushhal, Saquib</creator><creator>Majid, Abdul</creator><creator>Abbas, Syed Ali</creator><creator>Nadeem, Malik Sajjad Ahmed</creator><creator>Shah, Saeed Arif</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20201001</creationdate><title>Question retrieval using combined queries in community question answering</title><author>Khushhal, Saquib ; Majid, Abdul ; Abbas, Syed Ali ; Nadeem, Malik Sajjad Ahmed ; Shah, Saeed Arif</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Archives & records</topic><topic>Artificial Intelligence</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Information retrieval</topic><topic>Information Storage and Retrieval</topic><topic>IT in Business</topic><topic>Natural Language Processing (NLP)</topic><topic>Performance enhancement</topic><topic>Performance indices</topic><topic>Queries</topic><topic>Query languages</topic><topic>Questions</topic><topic>Ranking</topic><topic>Sentences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khushhal, Saquib</creatorcontrib><creatorcontrib>Majid, Abdul</creatorcontrib><creatorcontrib>Abbas, Syed Ali</creatorcontrib><creatorcontrib>Nadeem, Malik Sajjad Ahmed</creatorcontrib><creatorcontrib>Shah, Saeed Arif</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Journal of intelligent information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khushhal, Saquib</au><au>Majid, Abdul</au><au>Abbas, Syed Ali</au><au>Nadeem, Malik Sajjad Ahmed</au><au>Shah, Saeed Arif</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Question retrieval using combined queries in community question answering</atitle><jtitle>Journal of intelligent information systems</jtitle><stitle>J Intell Inf Syst</stitle><date>2020-10-01</date><risdate>2020</risdate><volume>55</volume><issue>2</issue><spage>307</spage><epage>327</epage><pages>307-327</pages><issn>0925-9902</issn><eissn>1573-7675</eissn><abstract>Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10844-020-00612-x</doi><tpages>21</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0925-9902 |
ispartof | Journal of intelligent information systems, 2020-10, Vol.55 (2), p.307-327 |
issn | 0925-9902 1573-7675 |
language | eng |
recordid | cdi_proquest_journals_2435627999 |
source | SpringerLink Journals - AutoHoldings |
subjects | Archives & records Artificial Intelligence Computer Science Data Structures and Information Theory Information retrieval Information Storage and Retrieval IT in Business Natural Language Processing (NLP) Performance enhancement Performance indices Queries Query languages Questions Ranking Sentences |
title | Question retrieval using combined queries in community question answering |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T20%3A18%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Question%20retrieval%20using%20combined%20queries%20in%20community%20question%20answering&rft.jtitle=Journal%20of%20intelligent%20information%20systems&rft.au=Khushhal,%20Saquib&rft.date=2020-10-01&rft.volume=55&rft.issue=2&rft.spage=307&rft.epage=327&rft.pages=307-327&rft.issn=0925-9902&rft.eissn=1573-7675&rft_id=info:doi/10.1007/s10844-020-00612-x&rft_dat=%3Cproquest_cross%3E2435627999%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2435627999&rft_id=info:pmid/&rfr_iscdi=true |