Performance Evaluation of Compressed Inverted Index in Lucene

Inverted index is the most popular index structure in search engine. Applying index compression can reduce storage space on inverted index, and improve the search performance. In this paper, we compare three typical index compression schemes in Lucene-the open source information retrieval system. Fi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Jian Wan, Shengyi Pan
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Inverted index is the most popular index structure in search engine. Applying index compression can reduce storage space on inverted index, and improve the search performance. In this paper, we compare three typical index compression schemes in Lucene-the open source information retrieval system. First, index compression schemes are realized in Lucene. Then we present the comparison results of these compression schemes in compression ratio, decompression speed, and scalability. In different algorithms, the impact caused by whether index file is interleaving has remarkable discrepancies in compression ratio and decompression speed, and the scale of data also influences the algorithm's efficiency.
DOI:10.1109/ICRCCS.2009.53