BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection

With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks activ...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	World wide web (Bussum) 2023-07, Vol.26 (4), p.1793-1809
Hauptverfasser:	Li, Shudong, Zhao, Chuanyu, Li, Qing, Huang, Jiuming, Zhao, Dawei, Zhu, Peican
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial intelligence Computer Science Database Management Datasets Deep learning Embedding Engineering Information Systems Applications (incl.Internet) Machine learning Operating Systems Social networks Software agents User behavior World Wide Web
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1809
container_issue	4
container_start_page	1793
container_title	World wide web (Bussum)
container_volume	26
creator	Li, Shudong Zhao, Chuanyu Li, Qing Huang, Jiuming Zhao, Dawei Zhu, Peican
description	With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.
doi_str_mv	10.1007/s11280-022-01114-2
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2842280421</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2842280421</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</originalsourceid><addsrcrecordid>eNp9kE9LwzAYxosoOKdfwFPAczVv0jaNNxWnwsCLgreQNm9nZ5vMpFN29JubWWU3T-8Dz58XfklyCvQcKBUXAYCVNKWMpRQAspTtJRPIBU8hA74fNS-LqPOXw-QohCWltOASJsnXtRtmrTXoL4km1n1gRxqve_x0_o00zpPg6lZ3pHJDIAYHrIfWWdJa4mzXWvzzLQ7bSiCVDmiiSRZer14J9hUa09oF0daQ2vX92rbDZjd1nBw0ugt48nunyfPs9unmPp0_3j3cXM3Tmhd8SKEuZa2lwAqkkAimMYIJXVKpZVVKnmmelRRzYEwWwCotpEQhIxxeCcgNnyZn4-7Ku_c1hkEt3drb-FKxMmMRX8YgptiYqr0LwWOjVr7ttd8ooGqLWo2oVUStflArFkt8LIUYtgv0u-l_Wt9QVYJh</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2842280421</pqid></control><display><type>article</type><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><source>SpringerLink Journals</source><creator>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</creator><creatorcontrib>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</creatorcontrib><description>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-022-01114-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial intelligence ; Computer Science ; Database Management ; Datasets ; Deep learning ; Embedding ; Engineering ; Information Systems Applications (incl.Internet) ; Machine learning ; Operating Systems ; Social networks ; Software agents ; User behavior ; World Wide Web</subject><ispartof>World wide web (Bussum), 2023-07, Vol.26 (4), p.1793-1809</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</citedby><cites>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-022-01114-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-022-01114-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Li, Shudong</creatorcontrib><creatorcontrib>Zhao, Chuanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Huang, Jiuming</creatorcontrib><creatorcontrib>Zhao, Dawei</creatorcontrib><creatorcontrib>Zhu, Peican</creatorcontrib><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</description><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Computer Science</subject><subject>Database Management</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Embedding</subject><subject>Engineering</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Operating Systems</subject><subject>Social networks</subject><subject>Software agents</subject><subject>User behavior</subject><subject>World Wide Web</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE9LwzAYxosoOKdfwFPAczVv0jaNNxWnwsCLgreQNm9nZ5vMpFN29JubWWU3T-8Dz58XfklyCvQcKBUXAYCVNKWMpRQAspTtJRPIBU8hA74fNS-LqPOXw-QohCWltOASJsnXtRtmrTXoL4km1n1gRxqve_x0_o00zpPg6lZ3pHJDIAYHrIfWWdJa4mzXWvzzLQ7bSiCVDmiiSRZer14J9hUa09oF0daQ2vX92rbDZjd1nBw0ugt48nunyfPs9unmPp0_3j3cXM3Tmhd8SKEuZa2lwAqkkAimMYIJXVKpZVVKnmmelRRzYEwWwCotpEQhIxxeCcgNnyZn4-7Ku_c1hkEt3drb-FKxMmMRX8YgptiYqr0LwWOjVr7ttd8ooGqLWo2oVUStflArFkt8LIUYtgv0u-l_Wt9QVYJh</recordid><startdate>20230701</startdate><enddate>20230701</enddate><creator>Li, Shudong</creator><creator>Zhao, Chuanyu</creator><creator>Li, Qing</creator><creator>Huang, Jiuming</creator><creator>Zhao, Dawei</creator><creator>Zhu, Peican</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20230701</creationdate><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><author>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Computer Science</topic><topic>Database Management</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Embedding</topic><topic>Engineering</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Operating Systems</topic><topic>Social networks</topic><topic>Software agents</topic><topic>User behavior</topic><topic>World Wide Web</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Shudong</creatorcontrib><creatorcontrib>Zhao, Chuanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Huang, Jiuming</creatorcontrib><creatorcontrib>Zhao, Dawei</creatorcontrib><creatorcontrib>Zhu, Peican</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Shudong</au><au>Zhao, Chuanyu</au><au>Li, Qing</au><au>Huang, Jiuming</au><au>Zhao, Dawei</au><au>Zhu, Peican</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2023-07-01</date><risdate>2023</risdate><volume>26</volume><issue>4</issue><spage>1793</spage><epage>1809</epage><pages>1793-1809</pages><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-022-01114-2</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1386-145X
ispartof	World wide web (Bussum), 2023-07, Vol.26 (4), p.1793-1809
issn	1386-145X 1573-1413
language	eng
recordid	cdi_proquest_journals_2842280421
source	SpringerLink Journals
subjects	Algorithms Artificial intelligence Computer Science Database Management Datasets Deep learning Embedding Engineering Information Systems Applications (incl.Internet) Machine learning Operating Systems Social networks Software agents User behavior World Wide Web
title	BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T01%3A29%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=BotFinder:%20a%20novel%20framework%20for%20social%20bots%20detection%20in%20online%20social%20networks%20based%20on%20graph%20embedding%20and%20community%20detection&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Li,%20Shudong&rft.date=2023-07-01&rft.volume=26&rft.issue=4&rft.spage=1793&rft.epage=1809&rft.pages=1793-1809&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-022-01114-2&rft_dat=%3Cproquest_cross%3E2842280421%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2842280421&rft_id=info:pmid/&rfr_iscdi=true