Investigating relationships within and between category networks in Wikipedia

► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and ana...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of informetrics 2011-07, Vol.5 (3), p.431-438
Hauptverfasser: Silva, F.N., Viana, M.P., Travençolo, B.A.N., Costa, L. da F.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 438
container_issue 3
container_start_page 431
container_title Journal of informetrics
container_volume 5
creator Silva, F.N.
Viana, M.P.
Travençolo, B.A.N.
Costa, L. da F.
description ► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and analyses cross-citations in the areas of Biology, Mathematics, Physics and Medicine in the English version of Wikipedia, which are represented as an undirected complex network where the entries correspond to nodes and the citations among the entries are mapped as edges. We found a high value of clustering coefficient for the areas of Biology and Medicine, and a small value for Mathematics and Physics. The topological organization is also different for each network, including a modular structure for Biology and Medicine, a sparse structure for Mathematics and a dense core for Physics. The networks have degree distributions that can be approximated by a power-law with a cut-off. The assortativity of the isolated networks has also been investigated and the results indicate distinct patterns for each subject. We estimated the betweenness centrality of each node considering the full Wikipedia network, which contains the nodes of the four subjects and the edges between them. In addition, the average shortest path length between the subjects revealed a close relationship between the subjects of Biology and Physics, and also between Medicine and Physics. Our results indicate that the analysis of the full Wikipedia network cannot predict the behavior of the isolated categories since their properties can be very different from those observed in the full network.
doi_str_mv 10.1016/j.joi.2011.03.003
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_902064352</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1751157711000344</els_id><sourcerecordid>902064352</sourcerecordid><originalsourceid>FETCH-LOGICAL-c329t-3af05149fd6d59245548c065cef8946071380ef4de0588ee532333ca1dc974783</originalsourceid><addsrcrecordid>eNp1kM1OwzAQhC0EEqXwANxy45SwjuPYESdU8VOpiAuIo2WcTes0dYKdturb46pcOe1oNbPa-Qi5pZBRoOV9m7W9zXKgNAOWAbAzMqFS8JRLUZ1HLThNKRfiklyF0ALwsqTVhLzN3Q7DaJd6tG6ZeOyi6F1Y2SEkezuurEu0q5NvHPeILjF6xGXvD4mLi96vQxINX3ZtB6ytviYXje4C3vzNKfl8fvqYvaaL95f57HGRGpZXY8p0A5wWVVOXNa_ygvNCGii5wUZWRQmCMgnYFDUClxKRs5wxZjStTSUKIdmU3J3uDr7_2cb_1cYGg12nHfbboCrIoSwYz6OTnpzG9yF4bNTg7Ub7g6KgjuRUqyI5dSSngKlILmYeThmMFXYWvQrGojOxoUczqjr6_0__Al5ydro</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>902064352</pqid></control><display><type>article</type><title>Investigating relationships within and between category networks in Wikipedia</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Silva, F.N. ; Viana, M.P. ; Travençolo, B.A.N. ; Costa, L. da F.</creator><creatorcontrib>Silva, F.N. ; Viana, M.P. ; Travençolo, B.A.N. ; Costa, L. da F.</creatorcontrib><description>► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and analyses cross-citations in the areas of Biology, Mathematics, Physics and Medicine in the English version of Wikipedia, which are represented as an undirected complex network where the entries correspond to nodes and the citations among the entries are mapped as edges. We found a high value of clustering coefficient for the areas of Biology and Medicine, and a small value for Mathematics and Physics. The topological organization is also different for each network, including a modular structure for Biology and Medicine, a sparse structure for Mathematics and a dense core for Physics. The networks have degree distributions that can be approximated by a power-law with a cut-off. The assortativity of the isolated networks has also been investigated and the results indicate distinct patterns for each subject. We estimated the betweenness centrality of each node considering the full Wikipedia network, which contains the nodes of the four subjects and the edges between them. In addition, the average shortest path length between the subjects revealed a close relationship between the subjects of Biology and Physics, and also between Medicine and Physics. Our results indicate that the analysis of the full Wikipedia network cannot predict the behavior of the isolated categories since their properties can be very different from those observed in the full network.</description><identifier>ISSN: 1751-1577</identifier><identifier>EISSN: 1875-5879</identifier><identifier>DOI: 10.1016/j.joi.2011.03.003</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Citation analysis ; Complex network ; Encyclopaedias ; Map of science ; Science ; Wikipedia ; Wikis</subject><ispartof>Journal of informetrics, 2011-07, Vol.5 (3), p.431-438</ispartof><rights>2011 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c329t-3af05149fd6d59245548c065cef8946071380ef4de0588ee532333ca1dc974783</citedby><cites>FETCH-LOGICAL-c329t-3af05149fd6d59245548c065cef8946071380ef4de0588ee532333ca1dc974783</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.joi.2011.03.003$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27913,27914,45984</link.rule.ids></links><search><creatorcontrib>Silva, F.N.</creatorcontrib><creatorcontrib>Viana, M.P.</creatorcontrib><creatorcontrib>Travençolo, B.A.N.</creatorcontrib><creatorcontrib>Costa, L. da F.</creatorcontrib><title>Investigating relationships within and between category networks in Wikipedia</title><title>Journal of informetrics</title><description>► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and analyses cross-citations in the areas of Biology, Mathematics, Physics and Medicine in the English version of Wikipedia, which are represented as an undirected complex network where the entries correspond to nodes and the citations among the entries are mapped as edges. We found a high value of clustering coefficient for the areas of Biology and Medicine, and a small value for Mathematics and Physics. The topological organization is also different for each network, including a modular structure for Biology and Medicine, a sparse structure for Mathematics and a dense core for Physics. The networks have degree distributions that can be approximated by a power-law with a cut-off. The assortativity of the isolated networks has also been investigated and the results indicate distinct patterns for each subject. We estimated the betweenness centrality of each node considering the full Wikipedia network, which contains the nodes of the four subjects and the edges between them. In addition, the average shortest path length between the subjects revealed a close relationship between the subjects of Biology and Physics, and also between Medicine and Physics. Our results indicate that the analysis of the full Wikipedia network cannot predict the behavior of the isolated categories since their properties can be very different from those observed in the full network.</description><subject>Citation analysis</subject><subject>Complex network</subject><subject>Encyclopaedias</subject><subject>Map of science</subject><subject>Science</subject><subject>Wikipedia</subject><subject>Wikis</subject><issn>1751-1577</issn><issn>1875-5879</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><recordid>eNp1kM1OwzAQhC0EEqXwANxy45SwjuPYESdU8VOpiAuIo2WcTes0dYKdturb46pcOe1oNbPa-Qi5pZBRoOV9m7W9zXKgNAOWAbAzMqFS8JRLUZ1HLThNKRfiklyF0ALwsqTVhLzN3Q7DaJd6tG6ZeOyi6F1Y2SEkezuurEu0q5NvHPeILjF6xGXvD4mLi96vQxINX3ZtB6ytviYXje4C3vzNKfl8fvqYvaaL95f57HGRGpZXY8p0A5wWVVOXNa_ygvNCGii5wUZWRQmCMgnYFDUClxKRs5wxZjStTSUKIdmU3J3uDr7_2cb_1cYGg12nHfbboCrIoSwYz6OTnpzG9yF4bNTg7Ub7g6KgjuRUqyI5dSSngKlILmYeThmMFXYWvQrGojOxoUczqjr6_0__Al5ydro</recordid><startdate>20110701</startdate><enddate>20110701</enddate><creator>Silva, F.N.</creator><creator>Viana, M.P.</creator><creator>Travençolo, B.A.N.</creator><creator>Costa, L. da F.</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope></search><sort><creationdate>20110701</creationdate><title>Investigating relationships within and between category networks in Wikipedia</title><author>Silva, F.N. ; Viana, M.P. ; Travençolo, B.A.N. ; Costa, L. da F.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c329t-3af05149fd6d59245548c065cef8946071380ef4de0588ee532333ca1dc974783</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Citation analysis</topic><topic>Complex network</topic><topic>Encyclopaedias</topic><topic>Map of science</topic><topic>Science</topic><topic>Wikipedia</topic><topic>Wikis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Silva, F.N.</creatorcontrib><creatorcontrib>Viana, M.P.</creatorcontrib><creatorcontrib>Travençolo, B.A.N.</creatorcontrib><creatorcontrib>Costa, L. da F.</creatorcontrib><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><jtitle>Journal of informetrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Silva, F.N.</au><au>Viana, M.P.</au><au>Travençolo, B.A.N.</au><au>Costa, L. da F.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Investigating relationships within and between category networks in Wikipedia</atitle><jtitle>Journal of informetrics</jtitle><date>2011-07-01</date><risdate>2011</risdate><volume>5</volume><issue>3</issue><spage>431</spage><epage>438</epage><pages>431-438</pages><issn>1751-1577</issn><eissn>1875-5879</eissn><abstract>► Topology of Wikipedia citation network is not uniform. ► Connectivity patterns inside each category are different among themselves. ► The growth mechanisms of the categories are not equal. ► Full Wikipedia network analysis cannot predict the behaviour of isolated categories. This work maps and analyses cross-citations in the areas of Biology, Mathematics, Physics and Medicine in the English version of Wikipedia, which are represented as an undirected complex network where the entries correspond to nodes and the citations among the entries are mapped as edges. We found a high value of clustering coefficient for the areas of Biology and Medicine, and a small value for Mathematics and Physics. The topological organization is also different for each network, including a modular structure for Biology and Medicine, a sparse structure for Mathematics and a dense core for Physics. The networks have degree distributions that can be approximated by a power-law with a cut-off. The assortativity of the isolated networks has also been investigated and the results indicate distinct patterns for each subject. We estimated the betweenness centrality of each node considering the full Wikipedia network, which contains the nodes of the four subjects and the edges between them. In addition, the average shortest path length between the subjects revealed a close relationship between the subjects of Biology and Physics, and also between Medicine and Physics. Our results indicate that the analysis of the full Wikipedia network cannot predict the behavior of the isolated categories since their properties can be very different from those observed in the full network.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.joi.2011.03.003</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1751-1577
ispartof Journal of informetrics, 2011-07, Vol.5 (3), p.431-438
issn 1751-1577
1875-5879
language eng
recordid cdi_proquest_miscellaneous_902064352
source ScienceDirect Journals (5 years ago - present)
subjects Citation analysis
Complex network
Encyclopaedias
Map of science
Science
Wikipedia
Wikis
title Investigating relationships within and between category networks in Wikipedia
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T09%3A42%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Investigating%20relationships%20within%20and%20between%20category%20networks%20in%20Wikipedia&rft.jtitle=Journal%20of%20informetrics&rft.au=Silva,%20F.N.&rft.date=2011-07-01&rft.volume=5&rft.issue=3&rft.spage=431&rft.epage=438&rft.pages=431-438&rft.issn=1751-1577&rft.eissn=1875-5879&rft_id=info:doi/10.1016/j.joi.2011.03.003&rft_dat=%3Cproquest_cross%3E902064352%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=902064352&rft_id=info:pmid/&rft_els_id=S1751157711000344&rfr_iscdi=true