A supervised machine learning approach to trace doctorate recipients’ employment trajectories

Only scarce information is available on doctorate recipients’ career outcomes ( ). With the current information base, graduate students cannot make an informed decision on whether to start a doctorate or not ( ; ). However, administrative labor market data, which could provide the necessary informat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Quantitative science studies 2020-02, Vol.1 (1), p.94-116
Hauptverfasser: Heinisch, Dominik P., Koenig, Johannes, Otto, Anne
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Only scarce information is available on doctorate recipients’ career outcomes ( ). With the current information base, graduate students cannot make an informed decision on whether to start a doctorate or not ( ; ). However, administrative labor market data, which could provide the necessary information, are incomplete in this respect. In this paper, we describe the record linkage of two data sets to close this information gap: data on doctorate recipients collected in the catalog of the German National Library (DNB), and the German labor market biographies (IEB) from the German Institute of Employment Research. We use a machine learning-based methodology, which (a) improves the record linkage of data sets without unique identifiers, and (b) evaluates the quality of the record linkage. The machine learning algorithms are trained on a synthetic training and evaluation data set. In an exemplary analysis, we compare the evolution of the employment status of female and male doctorate recipients in Germany.
ISSN:2641-3337
2641-3337
DOI:10.1162/qss_a_00001