Three social distance measures for film rankings

We describe the use of three alternative methods for ranking films for information retrieval (IR). A large film‐person incidence matrix is generated using the principle cast, directors, producers and screenwriters for each film. These attributes are used to measure film‐film distances by creating a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the ASIS annual meeting 2003-10, Vol.40 (1), p.21-27
Hauptverfasser: Leazer, Gregory H., Furner, Jonathan, Napper, Rachel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We describe the use of three alternative methods for ranking films for information retrieval (IR). A large film‐person incidence matrix is generated using the principle cast, directors, producers and screenwriters for each film. These attributes are used to measure film‐film distances by creating a distance matrix: two films are considered to be adjacent if there is any overlap in the people associated with each film. The distance between any two films is measured by the shortest path used to connect them through their adjacent members. The second and third methods involve the creation of a similarity matrix that expresses the amount of overlap in the people associated with any two films using Dice's coefficient. A “product distance” matrix is then derived that express the distances between any two films based on the product of the similarity weights on a path that connects those films. The highest value is chosen when alternate paths connect the two films. We also describe an “accumulative difference distance” matrix that also expresses the distances among pairs of films. The distance, product distance and accumulative difference distance matrices are used to generate rankings for a random sample of films.
ISSN:0044-7870
1550-8390
1550-8390
DOI:10.1002/meet.1450400103