Incremental author name disambiguation by exploiting domain‐specific heuristics

The vast majority of the current author name disambiguation solutions are designed to disambiguate a whole digital library (DL) at once considering the entire repository. However, these solutions besides being very expensive and having scalability problems, also may not benefit from eventual manual...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of the American Society for Information Science and Technology 2017-04, Vol.68 (4), p.931-945
Hauptverfasser:	Santana, Alan Filipe, Gonçalves, Marcos André, Laender, Alberto H. F., Ferreira, Anderson A.
Format:	Artikel
Sprache:	eng
Schlagworte:	Digital systems Electronic Libraries Heuristic Manuals Names Proposals Repositories State of the art
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The vast majority of the current author name disambiguation solutions are designed to disambiguate a whole digital library (DL) at once considering the entire repository. However, these solutions besides being very expensive and having scalability problems, also may not benefit from eventual manual corrections, as they may be lost whenever the process of disambiguating the entire repository is required. In the real world, in which repositories are updated on a daily basis, incremental solutions that disambiguate only the newly introduced citation records, are likely to produce improved results in the long run. However, the problem of incremental author name disambiguation has been largely neglected in the literature. In this article we present a new author name disambiguation method, specially designed for the incremental scenario. In our experiments, our new method largely outperforms recent incremental proposals reported in the literature as well as the current state‐of‐the‐art non‐incremental method.
ISSN:	2330-1635 2330-1643
DOI:	10.1002/asi.23726