Experiments in Query Paraphrasing for Information Retrieval

We investigate the effect of paraphrase generation on document retrieval performance. Specifically, we describe experiments where three information sources are used to generate lexical paraphrases of queries posed to the Internet. These information sources are: WordNet, a Webster-based thesaurus, an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zukerman, Ingrid, Raskutti, Bhavani, Wen, Yingying
Format: Buchkapitel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We investigate the effect of paraphrase generation on document retrieval performance. Specifically, we describe experiments where three information sources are used to generate lexical paraphrases of queries posed to the Internet. These information sources are: WordNet, a Webster-based thesaurus, and a combination of Webster and WordNet. Corpus-based information and wordsimilarity information are then used to rank the paraphrases. We evaluated our mechanism using 404 queries whose answers reside in the LA Times subset of the TREC-9 corpus. Our experiments show that query paraphrasing improves retrieval performance, and that performance is influenced both by the number of paraphrases generated for a query and by their quality. Specifically, the best performance was obtained usingWordNet, which improves document recall by 14% and increases the number of questions that can be answered by 8%.
ISSN:0302-9743
1611-3349
DOI:10.1007/3-540-36187-1_3