A note on measuring overlap

In measuring the overlap between two sets A and B (e.g. libraries, databases) one is obliged to calculate the overlap O(A|B) of A with respect to B (i.e. the fraction of elements of B that are also in A) and of O(B|A) of B with respect to A (i.e. the fraction of elements in A that are also in B). Th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of information science 2007-04, Vol.33 (2), p.189-195
Hauptverfasser:	Egghe, L., Goovaerts, M.
Format:	Artikel
Sprache:	eng
Schlagworte:	Bibliometrics. Scientometrics. Evaluation Confidence intervals Exact sciences and technology Information and communication sciences Information science. Documentation Informetrics Libraries Library and information science. General aspects Mathematics Overlap Sciences and techniques of general use
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In measuring the overlap between two sets A and B (e.g. libraries, databases) one is obliged to calculate the overlap O(A\|B) of A with respect to B (i.e. the fraction of elements of B that are also in A) and of O(B\|A) of B with respect to A (i.e. the fraction of elements in A that are also in B). Theoretically this requires two samples. In this paper we explain that one sample can suffice to determine confidence intervals for both O(A\|B) and O(B\|A). The paper closes with the example of measuring the overlap between the secondary sources in mathematics MathSciNet and Zentralblatt MATH and with a remark on the estimation of the Jaccard index.
ISSN:	0165-5515 1741-6485
DOI:	10.1177/0165551506075325