BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection

With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks activ...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:World wide web (Bussum) 2023-07, Vol.26 (4), p.1793-1809
Hauptverfasser: Li, Shudong, Zhao, Chuanyu, Li, Qing, Huang, Jiuming, Zhao, Dawei, Zhu, Peican
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1809
container_issue 4
container_start_page 1793
container_title World wide web (Bussum)
container_volume 26
creator Li, Shudong
Zhao, Chuanyu
Li, Qing
Huang, Jiuming
Zhao, Dawei
Zhu, Peican
description With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.
doi_str_mv 10.1007/s11280-022-01114-2
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2842280421</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2842280421</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</originalsourceid><addsrcrecordid>eNp9kE9LwzAYxosoOKdfwFPAczVv0jaNNxWnwsCLgreQNm9nZ5vMpFN29JubWWU3T-8Dz58XfklyCvQcKBUXAYCVNKWMpRQAspTtJRPIBU8hA74fNS-LqPOXw-QohCWltOASJsnXtRtmrTXoL4km1n1gRxqve_x0_o00zpPg6lZ3pHJDIAYHrIfWWdJa4mzXWvzzLQ7bSiCVDmiiSRZer14J9hUa09oF0daQ2vX92rbDZjd1nBw0ugt48nunyfPs9unmPp0_3j3cXM3Tmhd8SKEuZa2lwAqkkAimMYIJXVKpZVVKnmmelRRzYEwWwCotpEQhIxxeCcgNnyZn4-7Ku_c1hkEt3drb-FKxMmMRX8YgptiYqr0LwWOjVr7ttd8ooGqLWo2oVUStflArFkt8LIUYtgv0u-l_Wt9QVYJh</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2842280421</pqid></control><display><type>article</type><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><source>SpringerLink Journals</source><creator>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</creator><creatorcontrib>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</creatorcontrib><description>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-022-01114-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial intelligence ; Computer Science ; Database Management ; Datasets ; Deep learning ; Embedding ; Engineering ; Information Systems Applications (incl.Internet) ; Machine learning ; Operating Systems ; Social networks ; Software agents ; User behavior ; World Wide Web</subject><ispartof>World wide web (Bussum), 2023-07, Vol.26 (4), p.1793-1809</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</citedby><cites>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-022-01114-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-022-01114-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Li, Shudong</creatorcontrib><creatorcontrib>Zhao, Chuanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Huang, Jiuming</creatorcontrib><creatorcontrib>Zhao, Dawei</creatorcontrib><creatorcontrib>Zhu, Peican</creatorcontrib><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</description><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Computer Science</subject><subject>Database Management</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Embedding</subject><subject>Engineering</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Operating Systems</subject><subject>Social networks</subject><subject>Software agents</subject><subject>User behavior</subject><subject>World Wide Web</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE9LwzAYxosoOKdfwFPAczVv0jaNNxWnwsCLgreQNm9nZ5vMpFN29JubWWU3T-8Dz58XfklyCvQcKBUXAYCVNKWMpRQAspTtJRPIBU8hA74fNS-LqPOXw-QohCWltOASJsnXtRtmrTXoL4km1n1gRxqve_x0_o00zpPg6lZ3pHJDIAYHrIfWWdJa4mzXWvzzLQ7bSiCVDmiiSRZer14J9hUa09oF0daQ2vX92rbDZjd1nBw0ugt48nunyfPs9unmPp0_3j3cXM3Tmhd8SKEuZa2lwAqkkAimMYIJXVKpZVVKnmmelRRzYEwWwCotpEQhIxxeCcgNnyZn4-7Ku_c1hkEt3drb-FKxMmMRX8YgptiYqr0LwWOjVr7ttd8ooGqLWo2oVUStflArFkt8LIUYtgv0u-l_Wt9QVYJh</recordid><startdate>20230701</startdate><enddate>20230701</enddate><creator>Li, Shudong</creator><creator>Zhao, Chuanyu</creator><creator>Li, Qing</creator><creator>Huang, Jiuming</creator><creator>Zhao, Dawei</creator><creator>Zhu, Peican</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20230701</creationdate><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><author>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Computer Science</topic><topic>Database Management</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Embedding</topic><topic>Engineering</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Operating Systems</topic><topic>Social networks</topic><topic>Software agents</topic><topic>User behavior</topic><topic>World Wide Web</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Shudong</creatorcontrib><creatorcontrib>Zhao, Chuanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Huang, Jiuming</creatorcontrib><creatorcontrib>Zhao, Dawei</creatorcontrib><creatorcontrib>Zhu, Peican</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Shudong</au><au>Zhao, Chuanyu</au><au>Li, Qing</au><au>Huang, Jiuming</au><au>Zhao, Dawei</au><au>Zhu, Peican</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2023-07-01</date><risdate>2023</risdate><volume>26</volume><issue>4</issue><spage>1793</spage><epage>1809</epage><pages>1793-1809</pages><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-022-01114-2</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1386-145X
ispartof World wide web (Bussum), 2023-07, Vol.26 (4), p.1793-1809
issn 1386-145X
1573-1413
language eng
recordid cdi_proquest_journals_2842280421
source SpringerLink Journals
subjects Algorithms
Artificial intelligence
Computer Science
Database Management
Datasets
Deep learning
Embedding
Engineering
Information Systems Applications (incl.Internet)
Machine learning
Operating Systems
Social networks
Software agents
User behavior
World Wide Web
title BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T01%3A29%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=BotFinder:%20a%20novel%20framework%20for%20social%20bots%20detection%20in%20online%20social%20networks%20based%20on%20graph%20embedding%20and%20community%20detection&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Li,%20Shudong&rft.date=2023-07-01&rft.volume=26&rft.issue=4&rft.spage=1793&rft.epage=1809&rft.pages=1793-1809&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-022-01114-2&rft_dat=%3Cproquest_cross%3E2842280421%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2842280421&rft_id=info:pmid/&rfr_iscdi=true