Are two document clusters better than one? The Cluster Performance Question for information retrieval
When do information retrieval systems using two document clusters provide better retrieval performance than systems using no clustering? We answer this question for one set of assumptions and suggest how this may be studied with other assumptions. The “Cluster Hypothesis” asks an empirical question...
Gespeichert in:
Veröffentlicht in: | Journal of the American Society for Information Science and Technology 2005-01, Vol.56 (1), p.106-108 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | When do information retrieval systems using two document clusters provide better retrieval performance than systems using no clustering? We answer this question for one set of assumptions and suggest how this may be studied with other assumptions. The “Cluster Hypothesis” asks an empirical question about the relationships between documents and user‐supplied relevance judgments, while the “Cluster Performance Question” proposed here focuses on the when and why of information retrieval or digital library performance for clustered and unclustered text databases. This may be generalized to study the relative performance of m versus n clusters. |
---|---|
ISSN: | 1532-2882 2330-1635 1532-2890 2330-1643 |
DOI: | 10.1002/asi.20068 |