BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection
With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks activ...
Gespeichert in:
Veröffentlicht in: | World wide web (Bussum) 2023-07, Vol.26 (4), p.1793-1809 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1809 |
---|---|
container_issue | 4 |
container_start_page | 1793 |
container_title | World wide web (Bussum) |
container_volume | 26 |
creator | Li, Shudong Zhao, Chuanyu Li, Qing Huang, Jiuming Zhao, Dawei Zhu, Peican |
description | With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art. |
doi_str_mv | 10.1007/s11280-022-01114-2 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2842280421</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2842280421</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</originalsourceid><addsrcrecordid>eNp9kE9LwzAYxosoOKdfwFPAczVv0jaNNxWnwsCLgreQNm9nZ5vMpFN29JubWWU3T-8Dz58XfklyCvQcKBUXAYCVNKWMpRQAspTtJRPIBU8hA74fNS-LqPOXw-QohCWltOASJsnXtRtmrTXoL4km1n1gRxqve_x0_o00zpPg6lZ3pHJDIAYHrIfWWdJa4mzXWvzzLQ7bSiCVDmiiSRZer14J9hUa09oF0daQ2vX92rbDZjd1nBw0ugt48nunyfPs9unmPp0_3j3cXM3Tmhd8SKEuZa2lwAqkkAimMYIJXVKpZVVKnmmelRRzYEwWwCotpEQhIxxeCcgNnyZn4-7Ku_c1hkEt3drb-FKxMmMRX8YgptiYqr0LwWOjVr7ttd8ooGqLWo2oVUStflArFkt8LIUYtgv0u-l_Wt9QVYJh</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2842280421</pqid></control><display><type>article</type><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><source>SpringerLink Journals</source><creator>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</creator><creatorcontrib>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</creatorcontrib><description>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-022-01114-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial intelligence ; Computer Science ; Database Management ; Datasets ; Deep learning ; Embedding ; Engineering ; Information Systems Applications (incl.Internet) ; Machine learning ; Operating Systems ; Social networks ; Software agents ; User behavior ; World Wide Web</subject><ispartof>World wide web (Bussum), 2023-07, Vol.26 (4), p.1793-1809</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</citedby><cites>FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-022-01114-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-022-01114-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Li, Shudong</creatorcontrib><creatorcontrib>Zhao, Chuanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Huang, Jiuming</creatorcontrib><creatorcontrib>Zhao, Dawei</creatorcontrib><creatorcontrib>Zhu, Peican</creatorcontrib><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</description><subject>Algorithms</subject><subject>Artificial intelligence</subject><subject>Computer Science</subject><subject>Database Management</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Embedding</subject><subject>Engineering</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Operating Systems</subject><subject>Social networks</subject><subject>Software agents</subject><subject>User behavior</subject><subject>World Wide Web</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE9LwzAYxosoOKdfwFPAczVv0jaNNxWnwsCLgreQNm9nZ5vMpFN29JubWWU3T-8Dz58XfklyCvQcKBUXAYCVNKWMpRQAspTtJRPIBU8hA74fNS-LqPOXw-QohCWltOASJsnXtRtmrTXoL4km1n1gRxqve_x0_o00zpPg6lZ3pHJDIAYHrIfWWdJa4mzXWvzzLQ7bSiCVDmiiSRZer14J9hUa09oF0daQ2vX92rbDZjd1nBw0ugt48nunyfPs9unmPp0_3j3cXM3Tmhd8SKEuZa2lwAqkkAimMYIJXVKpZVVKnmmelRRzYEwWwCotpEQhIxxeCcgNnyZn4-7Ku_c1hkEt3drb-FKxMmMRX8YgptiYqr0LwWOjVr7ttd8ooGqLWo2oVUStflArFkt8LIUYtgv0u-l_Wt9QVYJh</recordid><startdate>20230701</startdate><enddate>20230701</enddate><creator>Li, Shudong</creator><creator>Zhao, Chuanyu</creator><creator>Li, Qing</creator><creator>Huang, Jiuming</creator><creator>Zhao, Dawei</creator><creator>Zhu, Peican</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20230701</creationdate><title>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</title><author>Li, Shudong ; Zhao, Chuanyu ; Li, Qing ; Huang, Jiuming ; Zhao, Dawei ; Zhu, Peican</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-1c89ca97eb1979e1dfd727a809a9b8934a3480e51229612ba799e790073b715d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Artificial intelligence</topic><topic>Computer Science</topic><topic>Database Management</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Embedding</topic><topic>Engineering</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Operating Systems</topic><topic>Social networks</topic><topic>Software agents</topic><topic>User behavior</topic><topic>World Wide Web</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Shudong</creatorcontrib><creatorcontrib>Zhao, Chuanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Huang, Jiuming</creatorcontrib><creatorcontrib>Zhao, Dawei</creatorcontrib><creatorcontrib>Zhu, Peican</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Shudong</au><au>Zhao, Chuanyu</au><au>Li, Qing</au><au>Huang, Jiuming</au><au>Zhao, Dawei</au><au>Zhu, Peican</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2023-07-01</date><risdate>2023</risdate><volume>26</volume><issue>4</issue><spage>1793</spage><epage>1809</epage><pages>1793-1809</pages><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>With the widespread popularity of online social networks (OSNs), the number of users has also increased exponentially in recent years. At the same time, Social bots, i.e. accounts that controlled by program, are also on the rise. Service providers of OSNs often use them to keep social networks active. Meanwhile, some social bots are also registered for malicious purposes. It is necessary to detect these malicious social bots to present a real public opinion environment. We propose BotFinder, a framework to detect malicious social bots in OSNs. Specifically, it combines machine learning and graph methods so that the potential features of social bots can be effectively extracted. Regarding the feature engineering, we generate second order features and use coding methods to encode variables that have high cardinality. These features make full use of both labelled and unlabeled samples. With respect to the graphs, we firstly generate node vectors through embedding method, following which the similarity between vectors of humans and bots can be further calculated; Then, we use an unsupervised method to diffuse labels and thus the performance can be improved again. To valid the performance of the proposed method, we conduct extensive experiments on the dataset provided by an artificial intelligence contest which is composed of over eight million records of users. Results show that our approach reaches a F1-score of 0.8850, which is much better compared to the state of the art.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-022-01114-2</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1386-145X |
ispartof | World wide web (Bussum), 2023-07, Vol.26 (4), p.1793-1809 |
issn | 1386-145X 1573-1413 |
language | eng |
recordid | cdi_proquest_journals_2842280421 |
source | SpringerLink Journals |
subjects | Algorithms Artificial intelligence Computer Science Database Management Datasets Deep learning Embedding Engineering Information Systems Applications (incl.Internet) Machine learning Operating Systems Social networks Software agents User behavior World Wide Web |
title | BotFinder: a novel framework for social bots detection in online social networks based on graph embedding and community detection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T01%3A29%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=BotFinder:%20a%20novel%20framework%20for%20social%20bots%20detection%20in%20online%20social%20networks%20based%20on%20graph%20embedding%20and%20community%20detection&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Li,%20Shudong&rft.date=2023-07-01&rft.volume=26&rft.issue=4&rft.spage=1793&rft.epage=1809&rft.pages=1793-1809&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-022-01114-2&rft_dat=%3Cproquest_cross%3E2842280421%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2842280421&rft_id=info:pmid/&rfr_iscdi=true |