A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms

•We model three collusion patterns of spam worker in crowdsourcing platform, and analyze the corresponding characteristics of spam workers.•We convert spam worker detection problem as a node classification problem in a crowdsourcing heterogeneous network, using network embedding and one-class SVM to...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computer networks (Amsterdam, Netherlands : 1999) Netherlands : 1999), 2020-12, Vol.183, p.107587, Article 107587
Hauptverfasser:	Kuang, Li, Zhang, Huan, Shi, Ruyi, Liao, Zhifang, Yang, Xiaoxian
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Crowdsourcing Embedding Network embedding Nodes Platforms Random walk Spam worker detection Workers
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page	107587
container_title	Computer networks (Amsterdam, Netherlands : 1999)
container_volume	183
creator	Kuang, Li Zhang, Huan Shi, Ruyi Liao, Zhifang Yang, Xiaoxian
description	•We model three collusion patterns of spam worker in crowdsourcing platform, and analyze the corresponding characteristics of spam workers.•We convert spam worker detection problem as a node classification problem in a crowdsourcing heterogeneous network, using network embedding and one-class SVM to distinguish spam worker.•We design a variable-length random walk algorithm based on node centrality to improve the efficiency of network embedding. Due to the popularity of crowdsourcing, more crowds are participating in crowdsourcing tasks. However, the proportion of spam workers is continuously increasing due to the openness of crowdsourcing platforms and their incentive mechanisms. To defend against threats from spam workers, researchers have proposed reputation-based and verification-based detection methods, but they either cannot address various collusion patterns or are costly when facing a large number of spam workers with "good" reputations due to collusion. Therefore, we propose a spam worker detection approach based on heterogeneous network embedding. We first model three collusion patterns and analyze the characteristics of spam workers to provide a theoretical basis for detecting spam workers. We then transform the problem of spam worker detection into a node classification problem in a crowdsourcing heterogeneous network in which the vectors of worker nodes are learned using network embedding. To improve the efficiency of network embedding, we propose an improved variable-length random walk algorithm based on node centrality. Finally, based on the obtained vectors of worker nodes, a one-class SVM is used to detect spam workers. The experiments demonstrate that our proposed approach can effectively detect spam workers in different collusion patterns and that the proposed random walk algorithm can reduce the time spent on model training while improving the efficiency of network embedding.
doi_str_mv	10.1016/j.comnet.2020.107587
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2509630546</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1389128620312251</els_id><sourcerecordid>2509630546</sourcerecordid><originalsourceid>FETCH-LOGICAL-c400t-bea722e0d6df2dc8bda6405759a14b30ba54918703777bd066ec4ad0682cdd813</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhCMEEqXwBhwscU7ZOI6dXJCqij-pEhc4W469aR2aONgpFW-Po3DmtNZ4Z3f2S5LbDFYZZPy-XWnX9TiuKNBJEkUpzpJFVgqaCuDVeXznZZVmtOSXyVUILQAwRstFYtYkDKojJ-c_0RODI-rRup6oYfBO6T2pVUBDorKPf97tsEd3DCSumzwEuxqNsf2O2J5o704muKPXkzAc1Ng434Xr5KJRh4A3f3WZfDw9vm9e0u3b8-tmvU01AxjTGpWgFMFw01Cjy9oozqAQRaUyVudQq4JV8SjIhRC1Ac5RMxVrSbUxZZYvk7t5boz-dcQwyjZm6eNKSQuoeA4F47GLzV0xbQgeGzl42yn_IzOQE0_ZypmnnHjKmWe0Pcw2jBd8W_QyaIu9RmN9ZCaNs_8P-AXNcIIU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2509630546</pqid></control><display><type>article</type><title>A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms</title><source>Elsevier ScienceDirect Journals</source><creator>Kuang, Li ; Zhang, Huan ; Shi, Ruyi ; Liao, Zhifang ; Yang, Xiaoxian</creator><creatorcontrib>Kuang, Li ; Zhang, Huan ; Shi, Ruyi ; Liao, Zhifang ; Yang, Xiaoxian</creatorcontrib><description>•We model three collusion patterns of spam worker in crowdsourcing platform, and analyze the corresponding characteristics of spam workers.•We convert spam worker detection problem as a node classification problem in a crowdsourcing heterogeneous network, using network embedding and one-class SVM to distinguish spam worker.•We design a variable-length random walk algorithm based on node centrality to improve the efficiency of network embedding. Due to the popularity of crowdsourcing, more crowds are participating in crowdsourcing tasks. However, the proportion of spam workers is continuously increasing due to the openness of crowdsourcing platforms and their incentive mechanisms. To defend against threats from spam workers, researchers have proposed reputation-based and verification-based detection methods, but they either cannot address various collusion patterns or are costly when facing a large number of spam workers with "good" reputations due to collusion. Therefore, we propose a spam worker detection approach based on heterogeneous network embedding. We first model three collusion patterns and analyze the characteristics of spam workers to provide a theoretical basis for detecting spam workers. We then transform the problem of spam worker detection into a node classification problem in a crowdsourcing heterogeneous network in which the vectors of worker nodes are learned using network embedding. To improve the efficiency of network embedding, we propose an improved variable-length random walk algorithm based on node centrality. Finally, based on the obtained vectors of worker nodes, a one-class SVM is used to detect spam workers. The experiments demonstrate that our proposed approach can effectively detect spam workers in different collusion patterns and that the proposed random walk algorithm can reduce the time spent on model training while improving the efficiency of network embedding.</description><identifier>ISSN: 1389-1286</identifier><identifier>EISSN: 1872-7069</identifier><identifier>DOI: 10.1016/j.comnet.2020.107587</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Algorithms ; Crowdsourcing ; Embedding ; Network embedding ; Nodes ; Platforms ; Random walk ; Spam worker detection ; Workers</subject><ispartof>Computer networks (Amsterdam, Netherlands : 1999), 2020-12, Vol.183, p.107587, Article 107587</ispartof><rights>2020 Elsevier Ltd</rights><rights>Copyright Elsevier Sequoia S.A. Dec 24, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c400t-bea722e0d6df2dc8bda6405759a14b30ba54918703777bd066ec4ad0682cdd813</citedby><cites>FETCH-LOGICAL-c400t-bea722e0d6df2dc8bda6405759a14b30ba54918703777bd066ec4ad0682cdd813</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.comnet.2020.107587$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids></links><search><creatorcontrib>Kuang, Li</creatorcontrib><creatorcontrib>Zhang, Huan</creatorcontrib><creatorcontrib>Shi, Ruyi</creatorcontrib><creatorcontrib>Liao, Zhifang</creatorcontrib><creatorcontrib>Yang, Xiaoxian</creatorcontrib><title>A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms</title><title>Computer networks (Amsterdam, Netherlands : 1999)</title><description>•We model three collusion patterns of spam worker in crowdsourcing platform, and analyze the corresponding characteristics of spam workers.•We convert spam worker detection problem as a node classification problem in a crowdsourcing heterogeneous network, using network embedding and one-class SVM to distinguish spam worker.•We design a variable-length random walk algorithm based on node centrality to improve the efficiency of network embedding. Due to the popularity of crowdsourcing, more crowds are participating in crowdsourcing tasks. However, the proportion of spam workers is continuously increasing due to the openness of crowdsourcing platforms and their incentive mechanisms. To defend against threats from spam workers, researchers have proposed reputation-based and verification-based detection methods, but they either cannot address various collusion patterns or are costly when facing a large number of spam workers with "good" reputations due to collusion. Therefore, we propose a spam worker detection approach based on heterogeneous network embedding. We first model three collusion patterns and analyze the characteristics of spam workers to provide a theoretical basis for detecting spam workers. We then transform the problem of spam worker detection into a node classification problem in a crowdsourcing heterogeneous network in which the vectors of worker nodes are learned using network embedding. To improve the efficiency of network embedding, we propose an improved variable-length random walk algorithm based on node centrality. Finally, based on the obtained vectors of worker nodes, a one-class SVM is used to detect spam workers. The experiments demonstrate that our proposed approach can effectively detect spam workers in different collusion patterns and that the proposed random walk algorithm can reduce the time spent on model training while improving the efficiency of network embedding.</description><subject>Algorithms</subject><subject>Crowdsourcing</subject><subject>Embedding</subject><subject>Network embedding</subject><subject>Nodes</subject><subject>Platforms</subject><subject>Random walk</subject><subject>Spam worker detection</subject><subject>Workers</subject><issn>1389-1286</issn><issn>1872-7069</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhCMEEqXwBhwscU7ZOI6dXJCqij-pEhc4W469aR2aONgpFW-Po3DmtNZ4Z3f2S5LbDFYZZPy-XWnX9TiuKNBJEkUpzpJFVgqaCuDVeXznZZVmtOSXyVUILQAwRstFYtYkDKojJ-c_0RODI-rRup6oYfBO6T2pVUBDorKPf97tsEd3DCSumzwEuxqNsf2O2J5o704muKPXkzAc1Ng434Xr5KJRh4A3f3WZfDw9vm9e0u3b8-tmvU01AxjTGpWgFMFw01Cjy9oozqAQRaUyVudQq4JV8SjIhRC1Ac5RMxVrSbUxZZYvk7t5boz-dcQwyjZm6eNKSQuoeA4F47GLzV0xbQgeGzl42yn_IzOQE0_ZypmnnHjKmWe0Pcw2jBd8W_QyaIu9RmN9ZCaNs_8P-AXNcIIU</recordid><startdate>20201224</startdate><enddate>20201224</enddate><creator>Kuang, Li</creator><creator>Zhang, Huan</creator><creator>Shi, Ruyi</creator><creator>Liao, Zhifang</creator><creator>Yang, Xiaoxian</creator><general>Elsevier B.V</general><general>Elsevier Sequoia S.A</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20201224</creationdate><title>A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms</title><author>Kuang, Li ; Zhang, Huan ; Shi, Ruyi ; Liao, Zhifang ; Yang, Xiaoxian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c400t-bea722e0d6df2dc8bda6405759a14b30ba54918703777bd066ec4ad0682cdd813</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Crowdsourcing</topic><topic>Embedding</topic><topic>Network embedding</topic><topic>Nodes</topic><topic>Platforms</topic><topic>Random walk</topic><topic>Spam worker detection</topic><topic>Workers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kuang, Li</creatorcontrib><creatorcontrib>Zhang, Huan</creatorcontrib><creatorcontrib>Shi, Ruyi</creatorcontrib><creatorcontrib>Liao, Zhifang</creatorcontrib><creatorcontrib>Yang, Xiaoxian</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computer networks (Amsterdam, Netherlands : 1999)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kuang, Li</au><au>Zhang, Huan</au><au>Shi, Ruyi</au><au>Liao, Zhifang</au><au>Yang, Xiaoxian</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms</atitle><jtitle>Computer networks (Amsterdam, Netherlands : 1999)</jtitle><date>2020-12-24</date><risdate>2020</risdate><volume>183</volume><spage>107587</spage><pages>107587-</pages><artnum>107587</artnum><issn>1389-1286</issn><eissn>1872-7069</eissn><abstract>•We model three collusion patterns of spam worker in crowdsourcing platform, and analyze the corresponding characteristics of spam workers.•We convert spam worker detection problem as a node classification problem in a crowdsourcing heterogeneous network, using network embedding and one-class SVM to distinguish spam worker.•We design a variable-length random walk algorithm based on node centrality to improve the efficiency of network embedding. Due to the popularity of crowdsourcing, more crowds are participating in crowdsourcing tasks. However, the proportion of spam workers is continuously increasing due to the openness of crowdsourcing platforms and their incentive mechanisms. To defend against threats from spam workers, researchers have proposed reputation-based and verification-based detection methods, but they either cannot address various collusion patterns or are costly when facing a large number of spam workers with "good" reputations due to collusion. Therefore, we propose a spam worker detection approach based on heterogeneous network embedding. We first model three collusion patterns and analyze the characteristics of spam workers to provide a theoretical basis for detecting spam workers. We then transform the problem of spam worker detection into a node classification problem in a crowdsourcing heterogeneous network in which the vectors of worker nodes are learned using network embedding. To improve the efficiency of network embedding, we propose an improved variable-length random walk algorithm based on node centrality. Finally, based on the obtained vectors of worker nodes, a one-class SVM is used to detect spam workers. The experiments demonstrate that our proposed approach can effectively detect spam workers in different collusion patterns and that the proposed random walk algorithm can reduce the time spent on model training while improving the efficiency of network embedding.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.comnet.2020.107587</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 1389-1286
ispartof	Computer networks (Amsterdam, Netherlands : 1999), 2020-12, Vol.183, p.107587, Article 107587
issn	1389-1286 1872-7069
language	eng
recordid	cdi_proquest_journals_2509630546
source	Elsevier ScienceDirect Journals
subjects	Algorithms Crowdsourcing Embedding Network embedding Nodes Platforms Random walk Spam worker detection Workers
title	A spam worker detection approach based on heterogeneous network embedding in crowdsourcing platforms
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T06%3A35%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20spam%20worker%20detection%20approach%20based%20on%20heterogeneous%20network%20embedding%20in%20crowdsourcing%20platforms&rft.jtitle=Computer%20networks%20(Amsterdam,%20Netherlands%20:%201999)&rft.au=Kuang,%20Li&rft.date=2020-12-24&rft.volume=183&rft.spage=107587&rft.pages=107587-&rft.artnum=107587&rft.issn=1389-1286&rft.eissn=1872-7069&rft_id=info:doi/10.1016/j.comnet.2020.107587&rft_dat=%3Cproquest_cross%3E2509630546%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2509630546&rft_id=info:pmid/&rft_els_id=S1389128620312251&rfr_iscdi=true