Investigating relationships within and between category networks in Wikipedia

► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and ana...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of informetrics 2011-07, Vol.5 (3), p.431-438
Hauptverfasser: Silva, F.N., Viana, M.P., Travençolo, B.A.N., Costa, L. da F.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and analyses cross-citations in the areas of Biology, Mathematics, Physics and Medicine in the English version of Wikipedia, which are represented as an undirected complex network where the entries correspond to nodes and the citations among the entries are mapped as edges. We found a high value of clustering coefficient for the areas of Biology and Medicine, and a small value for Mathematics and Physics. The topological organization is also different for each network, including a modular structure for Biology and Medicine, a sparse structure for Mathematics and a dense core for Physics. The networks have degree distributions that can be approximated by a power-law with a cut-off. The assortativity of the isolated networks has also been investigated and the results indicate distinct patterns for each subject. We estimated the betweenness centrality of each node considering the full Wikipedia network, which contains the nodes of the four subjects and the edges between them. In addition, the average shortest path length between the subjects revealed a close relationship between the subjects of Biology and Physics, and also between Medicine and Physics. Our results indicate that the analysis of the full Wikipedia network cannot predict the behavior of the isolated categories since their properties can be very different from those observed in the full network.
ISSN:1751-1577
1875-5879
DOI:10.1016/j.joi.2011.03.003