A community detection approach based on network representation learning for repository mining

In this paper, we propose a novel heterogeneous graph-based model for capturing and handling all the complex and strongly-correlated information of a software Developer Social Network (DSN) to support several analytic tasks. In particular, we challenge the problem of automatically discovering commun...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2023-11, Vol.231, p.120597, Article 120597
Hauptverfasser: De Luca, Marco, Fasolino, Anna Rita, Ferraro, Antonino, Moscato, Vincenzo, Sperlí, Giancarlo, Tramontana, Porfirio
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper, we propose a novel heterogeneous graph-based model for capturing and handling all the complex and strongly-correlated information of a software Developer Social Network (DSN) to support several analytic tasks. In particular, we challenge the problem of automatically discovering communities of software developers sharing interests for similar projects by relying on Social Network Analysis (SNA) findings. To overcome the huge graph-size issue, we leverage different graph embedding techniques. Eventually, we evaluate the proposed approach with respect to state-of-the-art approaches from an efficiency and an effectiveness point of view by carrying out an experiment involving the GitHub dataset. •Heterogeneous graph has been designed to model developers interaction in projects.•Community detection algorithm has been designed by using graph embedding techniques.•Graph embedding techniques have been used to extract information from DSN;•Effectiveness and efficiency evaluation have been made on real dataset (Github).
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2023.120597