PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks
•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel m...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2021-11, Vol.183, p.115377, Article 115377 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 115377 |
container_title | Expert systems with applications |
container_volume | 183 |
creator | Roghani, Hamid Bouyer, Asgarali Nourani, Esmaeil |
description | •A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods.
Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods. |
doi_str_mv | 10.1016/j.eswa.2021.115377 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2579413735</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417421008034</els_id><sourcerecordid>2579413735</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EEqXwApwscU6w4zhuEJeq_EqVQCqcLdfZgFsnLnbSqkfeHFfpmct6NZ7ZXX0IXVOSUkKL21UKYafSjGQ0pZQzIU7QiE4ESwpRslM0IiUXSU5Ffo4uQlgRQgUhYoR-3-cP88UdnuLWbcHijfLK2thYtYy1MnXdB-NarNrqqC3Agu6ilixVgApr1zR9a7o9rqAbfrCyX86b7rvBgydKizh6jU2Lg9NGWdxCt3N-HS7RWa1sgKvjO0afT48fs5dk_vb8OpvOE82ySZeoeHJdLgvG8kJzzmou8onKi2xZFppmnGrBJoRkUAmouK5LoWomdDTnhIHO2RjdDHM33v30EDq5cr1v40qZcVHmlAnGoysbXNq7EDzUcuNNo_xeUiIPqOVKHlDLA2o5oI6h-yEE8f6tAS-DNtBqqIyPQGTlzH_xP-MeiBY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2579413735</pqid></control><display><type>article</type><title>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</title><source>Elsevier ScienceDirect Journals</source><creator>Roghani, Hamid ; Bouyer, Asgarali ; Nourani, Esmaeil</creator><creatorcontrib>Roghani, Hamid ; Bouyer, Asgarali ; Nourani, Esmaeil</creatorcontrib><description>•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods.
Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2021.115377</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Algorithms ; Complexity ; Diffusion rate ; Iterative methods ; Label diffusion ; Label selection ; Labels ; Local similarity ; Nodes ; Parallel community detection ; Social networks ; Spark</subject><ispartof>Expert systems with applications, 2021-11, Vol.183, p.115377, Article 115377</ispartof><rights>2021 Elsevier Ltd</rights><rights>Copyright Elsevier BV Nov 30, 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</citedby><cites>FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0957417421008034$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids></links><search><creatorcontrib>Roghani, Hamid</creatorcontrib><creatorcontrib>Bouyer, Asgarali</creatorcontrib><creatorcontrib>Nourani, Esmaeil</creatorcontrib><title>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</title><title>Expert systems with applications</title><description>•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods.
Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.</description><subject>Algorithms</subject><subject>Complexity</subject><subject>Diffusion rate</subject><subject>Iterative methods</subject><subject>Label diffusion</subject><subject>Label selection</subject><subject>Labels</subject><subject>Local similarity</subject><subject>Nodes</subject><subject>Parallel community detection</subject><subject>Social networks</subject><subject>Spark</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EEqXwApwscU6w4zhuEJeq_EqVQCqcLdfZgFsnLnbSqkfeHFfpmct6NZ7ZXX0IXVOSUkKL21UKYafSjGQ0pZQzIU7QiE4ESwpRslM0IiUXSU5Ffo4uQlgRQgUhYoR-3-cP88UdnuLWbcHijfLK2thYtYy1MnXdB-NarNrqqC3Agu6ilixVgApr1zR9a7o9rqAbfrCyX86b7rvBgydKizh6jU2Lg9NGWdxCt3N-HS7RWa1sgKvjO0afT48fs5dk_vb8OpvOE82ySZeoeHJdLgvG8kJzzmou8onKi2xZFppmnGrBJoRkUAmouK5LoWomdDTnhIHO2RjdDHM33v30EDq5cr1v40qZcVHmlAnGoysbXNq7EDzUcuNNo_xeUiIPqOVKHlDLA2o5oI6h-yEE8f6tAS-DNtBqqIyPQGTlzH_xP-MeiBY</recordid><startdate>20211130</startdate><enddate>20211130</enddate><creator>Roghani, Hamid</creator><creator>Bouyer, Asgarali</creator><creator>Nourani, Esmaeil</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20211130</creationdate><title>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</title><author>Roghani, Hamid ; Bouyer, Asgarali ; Nourani, Esmaeil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-a001f9b63346c553f5748a462b96c1251c738002ed7ed5cf97af37c46c403ec43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Complexity</topic><topic>Diffusion rate</topic><topic>Iterative methods</topic><topic>Label diffusion</topic><topic>Label selection</topic><topic>Labels</topic><topic>Local similarity</topic><topic>Nodes</topic><topic>Parallel community detection</topic><topic>Social networks</topic><topic>Spark</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roghani, Hamid</creatorcontrib><creatorcontrib>Bouyer, Asgarali</creatorcontrib><creatorcontrib>Nourani, Esmaeil</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roghani, Hamid</au><au>Bouyer, Asgarali</au><au>Nourani, Esmaeil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks</atitle><jtitle>Expert systems with applications</jtitle><date>2021-11-30</date><risdate>2021</risdate><volume>183</volume><spage>115377</spage><pages>115377-</pages><artnum>115377</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•A novel fast and accurate Spark-based parallel community detection algorithm is proposed.•The proposed PLDLS algorithm uses label diffusion of core nodes along with a new label selection method.•Multi-factor criteria for computing nodes importance is used to select core nodes.•A fast and parallel merge phase is utilized to obtain more dense and accurate communities.•The result of PLDLS completely is robust, stable and scalable in comparison with other examined methods.
Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with two-step of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi-factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2021.115377</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2021-11, Vol.183, p.115377, Article 115377 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_journals_2579413735 |
source | Elsevier ScienceDirect Journals |
subjects | Algorithms Complexity Diffusion rate Iterative methods Label diffusion Label selection Labels Local similarity Nodes Parallel community detection Social networks Spark |
title | PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-21T00%3A43%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PLDLS:%20A%20novel%20parallel%20label%20diffusion%20and%20label%20Selection-based%20community%20detection%20algorithm%20based%20on%20Spark%20in%20social%20networks&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Roghani,%20Hamid&rft.date=2021-11-30&rft.volume=183&rft.spage=115377&rft.pages=115377-&rft.artnum=115377&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2021.115377&rft_dat=%3Cproquest_cross%3E2579413735%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2579413735&rft_id=info:pmid/&rft_els_id=S0957417421008034&rfr_iscdi=true |