Question retrieval using combined queries in community question answering

Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the for...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent information systems 2020-10, Vol.55 (2), p.307-327
Hauptverfasser: Khushhal, Saquib, Majid, Abdul, Abbas, Syed Ali, Nadeem, Malik Sajjad Ahmed, Shah, Saeed Arif
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 327
container_issue 2
container_start_page 307
container_title Journal of intelligent information systems
container_volume 55
creator Khushhal, Saquib
Majid, Abdul
Abbas, Syed Ali
Nadeem, Malik Sajjad Ahmed
Shah, Saeed Arif
description Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.
doi_str_mv 10.1007/s10844-020-00612-x
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2435627999</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2435627999</sourcerecordid><originalsourceid>FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</originalsourceid><addsrcrecordid>eNp9UEtLAzEQDqJgrf4BTwueo5P35ijFR6Eggp5DNpstKW22Jrta_727XcGbp4HvNTMfQtcEbgmAussESs4xUMAAklB8OEEzIhTDSipximagqcBaAz1HFzlvAECXEmZo-dr73IU2Fsl3KfhPuy36HOK6cO2uCtHXxUfvByIXIY7Yro-h-x7ByWZj_hr4uL5EZ43dZn_1O-fo_fHhbfGMVy9Py8X9CjsmaYeZ0lZV1FWqLmvllKy1rkvRKCKpddw3TgjKieQVKCZFA5R4y61llS5rCZrN0c2Uu0_t8QqzafsUh5WGciYkVVqPKjqpXGpzTr4x-xR2Nn0bAmaszEyVmaEyc6zMHAYTm0x5P37k01_0P64fEllwGA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2435627999</pqid></control><display><type>article</type><title>Question retrieval using combined queries in community question answering</title><source>SpringerLink Journals - AutoHoldings</source><creator>Khushhal, Saquib ; Majid, Abdul ; Abbas, Syed Ali ; Nadeem, Malik Sajjad Ahmed ; Shah, Saeed Arif</creator><creatorcontrib>Khushhal, Saquib ; Majid, Abdul ; Abbas, Syed Ali ; Nadeem, Malik Sajjad Ahmed ; Shah, Saeed Arif</creatorcontrib><description>Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.</description><identifier>ISSN: 0925-9902</identifier><identifier>EISSN: 1573-7675</identifier><identifier>DOI: 10.1007/s10844-020-00612-x</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Archives &amp; records ; Artificial Intelligence ; Computer Science ; Data Structures and Information Theory ; Information retrieval ; Information Storage and Retrieval ; IT in Business ; Natural Language Processing (NLP) ; Performance enhancement ; Performance indices ; Queries ; Query languages ; Questions ; Ranking ; Sentences</subject><ispartof>Journal of intelligent information systems, 2020-10, Vol.55 (2), p.307-327</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</citedby><cites>FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10844-020-00612-x$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10844-020-00612-x$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Khushhal, Saquib</creatorcontrib><creatorcontrib>Majid, Abdul</creatorcontrib><creatorcontrib>Abbas, Syed Ali</creatorcontrib><creatorcontrib>Nadeem, Malik Sajjad Ahmed</creatorcontrib><creatorcontrib>Shah, Saeed Arif</creatorcontrib><title>Question retrieval using combined queries in community question answering</title><title>Journal of intelligent information systems</title><addtitle>J Intell Inf Syst</addtitle><description>Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.</description><subject>Archives &amp; records</subject><subject>Artificial Intelligence</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Information retrieval</subject><subject>Information Storage and Retrieval</subject><subject>IT in Business</subject><subject>Natural Language Processing (NLP)</subject><subject>Performance enhancement</subject><subject>Performance indices</subject><subject>Queries</subject><subject>Query languages</subject><subject>Questions</subject><subject>Ranking</subject><subject>Sentences</subject><issn>0925-9902</issn><issn>1573-7675</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp9UEtLAzEQDqJgrf4BTwueo5P35ijFR6Eggp5DNpstKW22Jrta_727XcGbp4HvNTMfQtcEbgmAussESs4xUMAAklB8OEEzIhTDSipximagqcBaAz1HFzlvAECXEmZo-dr73IU2Fsl3KfhPuy36HOK6cO2uCtHXxUfvByIXIY7Yro-h-x7ByWZj_hr4uL5EZ43dZn_1O-fo_fHhbfGMVy9Py8X9CjsmaYeZ0lZV1FWqLmvllKy1rkvRKCKpddw3TgjKieQVKCZFA5R4y61llS5rCZrN0c2Uu0_t8QqzafsUh5WGciYkVVqPKjqpXGpzTr4x-xR2Nn0bAmaszEyVmaEyc6zMHAYTm0x5P37k01_0P64fEllwGA</recordid><startdate>20201001</startdate><enddate>20201001</enddate><creator>Khushhal, Saquib</creator><creator>Majid, Abdul</creator><creator>Abbas, Syed Ali</creator><creator>Nadeem, Malik Sajjad Ahmed</creator><creator>Shah, Saeed Arif</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20201001</creationdate><title>Question retrieval using combined queries in community question answering</title><author>Khushhal, Saquib ; Majid, Abdul ; Abbas, Syed Ali ; Nadeem, Malik Sajjad Ahmed ; Shah, Saeed Arif</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c362t-379a7b2cb7d8d7c76d99d85f7162ac4efc5524164b07365f021ea4aa3b98d6093</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Archives &amp; records</topic><topic>Artificial Intelligence</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Information retrieval</topic><topic>Information Storage and Retrieval</topic><topic>IT in Business</topic><topic>Natural Language Processing (NLP)</topic><topic>Performance enhancement</topic><topic>Performance indices</topic><topic>Queries</topic><topic>Query languages</topic><topic>Questions</topic><topic>Ranking</topic><topic>Sentences</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khushhal, Saquib</creatorcontrib><creatorcontrib>Majid, Abdul</creatorcontrib><creatorcontrib>Abbas, Syed Ali</creatorcontrib><creatorcontrib>Nadeem, Malik Sajjad Ahmed</creatorcontrib><creatorcontrib>Shah, Saeed Arif</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Journal of intelligent information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khushhal, Saquib</au><au>Majid, Abdul</au><au>Abbas, Syed Ali</au><au>Nadeem, Malik Sajjad Ahmed</au><au>Shah, Saeed Arif</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Question retrieval using combined queries in community question answering</atitle><jtitle>Journal of intelligent information systems</jtitle><stitle>J Intell Inf Syst</stitle><date>2020-10-01</date><risdate>2020</risdate><volume>55</volume><issue>2</issue><spage>307</spage><epage>327</epage><pages>307-327</pages><issn>0925-9902</issn><eissn>1573-7675</eissn><abstract>Community question answering (cQA) has emerged as a popular service on the web; users can use it to ask and answer questions and access historical question-answer (QA) pairs. cQA retrieval, as an alternative to general web searches, has several advantages. First, user can register a query in the form of natural language sentences instead of a set of keywords; thus, they can present the required information more clearly and comprehensively. Second, the system returns several possible answers instead of a long list of ranked documents, thereby enhancing the efficient location of the desired answers. Question retrieval from a cQA archive, an essential function of cQA retrieval services, aims to retrieve historical QA pairs relevant to the query question. In this study, combined queries (combined inverted and nextword indexes) are proposed for question retrieval in cQA. The method performance is investigated for two different scenarios: (a) when only questions from QA pairs are used as documents, and (b) when QA pairs are used as documents. In the proposed method, combined indexes are first created for both queries and documents; then, different information retrieval (IR) models are used to retrieve relevant questions from the cQA archive. Evaluation is performed on a public Yahoo! Answers dataset; the results thereby obtained show that using combined queries for all three IR models (vector space model, Okapi model, and language model) improves performance in terms of the retrieval precision and ranking effectiveness. Notably, by using combined indexes when both QA pairs are used as documents, the retrieval and ranking effectiveness of these cQA retrieval models increases significantly.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10844-020-00612-x</doi><tpages>21</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0925-9902
ispartof Journal of intelligent information systems, 2020-10, Vol.55 (2), p.307-327
issn 0925-9902
1573-7675
language eng
recordid cdi_proquest_journals_2435627999
source SpringerLink Journals - AutoHoldings
subjects Archives & records
Artificial Intelligence
Computer Science
Data Structures and Information Theory
Information retrieval
Information Storage and Retrieval
IT in Business
Natural Language Processing (NLP)
Performance enhancement
Performance indices
Queries
Query languages
Questions
Ranking
Sentences
title Question retrieval using combined queries in community question answering
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T20%3A18%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Question%20retrieval%20using%20combined%20queries%20in%20community%20question%20answering&rft.jtitle=Journal%20of%20intelligent%20information%20systems&rft.au=Khushhal,%20Saquib&rft.date=2020-10-01&rft.volume=55&rft.issue=2&rft.spage=307&rft.epage=327&rft.pages=307-327&rft.issn=0925-9902&rft.eissn=1573-7675&rft_id=info:doi/10.1007/s10844-020-00612-x&rft_dat=%3Cproquest_cross%3E2435627999%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2435627999&rft_id=info:pmid/&rfr_iscdi=true