Application research of credit fraud detection based on distributed rotation deep forest

Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Intelligent data analysis 2024-01, Vol.28 (4), p.1067-1091
Hauptverfasser: Chen, Hongwei, Shi, Dewei, Zhou, Xun, Zhang, Man, Liu, Luanxuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1091
container_issue 4
container_start_page 1067
container_title Intelligent data analysis
container_volume 28
creator Chen, Hongwei
Shi, Dewei
Zhou, Xun
Zhang, Man
Liu, Luanxuan
description Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.
doi_str_mv 10.3233/IDA-230193
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3082612101</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.3233_IDA-230193</sage_id><sourcerecordid>3082612101</sourcerecordid><originalsourceid>FETCH-LOGICAL-c180t-c6ad2dafa6a2db2421657f4f06c38792dc3ffc440367c7c72ad12aa4564f2b293</originalsourceid><addsrcrecordid>eNptkE1LAzEQhoMoWKsXf0HAgyCsJpM0uz2W-lUoeFHoLczmQ7fUZk2yB_-90RW8yBzmHXh4Bl5Czjm7FiDEzep2UYFgfC4OyITPal5JDs1hyaxpKqnqzTE5SWnLGJPA5IRsFn2_6wzmLuxpdMlhNG80eGqis12mPuJgqXXZmR-kxeQsLcF2KceuHXI5Y8ijwDrXUx-KJ5-SI4-75M5-95S83N89Lx-r9dPDarlYV4Y3LFdGoQWLHhWCbUECV7PaS8-UEU09B2uE90ZKJlRtygBaDohypqSHFuZiSi5Gbx_Dx1Ae620Y4r681II1oDhwxgt1NVImhpSi87qP3TvGT82Z_m5Ol-b02FyBL0c44av70_1DfgHhWW0_</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3082612101</pqid></control><display><type>article</type><title>Application research of credit fraud detection based on distributed rotation deep forest</title><source>Business Source Complete</source><creator>Chen, Hongwei ; Shi, Dewei ; Zhou, Xun ; Zhang, Man ; Liu, Luanxuan</creator><creatorcontrib>Chen, Hongwei ; Shi, Dewei ; Zhou, Xun ; Zhang, Man ; Liu, Luanxuan</creatorcontrib><description>Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.</description><identifier>ISSN: 1088-467X</identifier><identifier>EISSN: 1571-4128</identifier><identifier>DOI: 10.3233/IDA-230193</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Algorithms ; Economic impact ; Ensemble learning ; Forests ; Fraud ; Fraud prevention ; Machine learning ; Neural networks ; Rotation ; Spatial data</subject><ispartof>Intelligent data analysis, 2024-01, Vol.28 (4), p.1067-1091</ispartof><rights>2024 – IOS Press. All rights reserved.</rights><rights>Copyright IOS Press BV 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c180t-c6ad2dafa6a2db2421657f4f06c38792dc3ffc440367c7c72ad12aa4564f2b293</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Chen, Hongwei</creatorcontrib><creatorcontrib>Shi, Dewei</creatorcontrib><creatorcontrib>Zhou, Xun</creatorcontrib><creatorcontrib>Zhang, Man</creatorcontrib><creatorcontrib>Liu, Luanxuan</creatorcontrib><title>Application research of credit fraud detection based on distributed rotation deep forest</title><title>Intelligent data analysis</title><description>Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.</description><subject>Algorithms</subject><subject>Economic impact</subject><subject>Ensemble learning</subject><subject>Forests</subject><subject>Fraud</subject><subject>Fraud prevention</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Rotation</subject><subject>Spatial data</subject><issn>1088-467X</issn><issn>1571-4128</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNptkE1LAzEQhoMoWKsXf0HAgyCsJpM0uz2W-lUoeFHoLczmQ7fUZk2yB_-90RW8yBzmHXh4Bl5Czjm7FiDEzep2UYFgfC4OyITPal5JDs1hyaxpKqnqzTE5SWnLGJPA5IRsFn2_6wzmLuxpdMlhNG80eGqis12mPuJgqXXZmR-kxeQsLcF2KceuHXI5Y8ijwDrXUx-KJ5-SI4-75M5-95S83N89Lx-r9dPDarlYV4Y3LFdGoQWLHhWCbUECV7PaS8-UEU09B2uE90ZKJlRtygBaDohypqSHFuZiSi5Gbx_Dx1Ae620Y4r681II1oDhwxgt1NVImhpSi87qP3TvGT82Z_m5Ol-b02FyBL0c44av70_1DfgHhWW0_</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Chen, Hongwei</creator><creator>Shi, Dewei</creator><creator>Zhou, Xun</creator><creator>Zhang, Man</creator><creator>Liu, Luanxuan</creator><general>SAGE Publications</general><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240101</creationdate><title>Application research of credit fraud detection based on distributed rotation deep forest</title><author>Chen, Hongwei ; Shi, Dewei ; Zhou, Xun ; Zhang, Man ; Liu, Luanxuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c180t-c6ad2dafa6a2db2421657f4f06c38792dc3ffc440367c7c72ad12aa4564f2b293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Economic impact</topic><topic>Ensemble learning</topic><topic>Forests</topic><topic>Fraud</topic><topic>Fraud prevention</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Rotation</topic><topic>Spatial data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Hongwei</creatorcontrib><creatorcontrib>Shi, Dewei</creatorcontrib><creatorcontrib>Zhou, Xun</creatorcontrib><creatorcontrib>Zhang, Man</creatorcontrib><creatorcontrib>Liu, Luanxuan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Intelligent data analysis</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Hongwei</au><au>Shi, Dewei</au><au>Zhou, Xun</au><au>Zhang, Man</au><au>Liu, Luanxuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application research of credit fraud detection based on distributed rotation deep forest</atitle><jtitle>Intelligent data analysis</jtitle><date>2024-01-01</date><risdate>2024</risdate><volume>28</volume><issue>4</issue><spage>1067</spage><epage>1091</epage><pages>1067-1091</pages><issn>1088-467X</issn><eissn>1571-4128</eissn><abstract>Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.3233/IDA-230193</doi><tpages>25</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1088-467X
ispartof Intelligent data analysis, 2024-01, Vol.28 (4), p.1067-1091
issn 1088-467X
1571-4128
language eng
recordid cdi_proquest_journals_3082612101
source Business Source Complete
subjects Algorithms
Economic impact
Ensemble learning
Forests
Fraud
Fraud prevention
Machine learning
Neural networks
Rotation
Spatial data
title Application research of credit fraud detection based on distributed rotation deep forest
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T12%3A05%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20research%20of%20credit%20fraud%20detection%20based%20on%20distributed%20rotation%20deep%20forest&rft.jtitle=Intelligent%20data%20analysis&rft.au=Chen,%20Hongwei&rft.date=2024-01-01&rft.volume=28&rft.issue=4&rft.spage=1067&rft.epage=1091&rft.pages=1067-1091&rft.issn=1088-467X&rft.eissn=1571-4128&rft_id=info:doi/10.3233/IDA-230193&rft_dat=%3Cproquest_cross%3E3082612101%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3082612101&rft_id=info:pmid/&rft_sage_id=10.3233_IDA-230193&rfr_iscdi=true