Application research of credit fraud detection based on distributed rotation deep forest
Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks....
Gespeichert in:
Veröffentlicht in: | Intelligent data analysis 2024-01, Vol.28 (4), p.1067-1091 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1091 |
---|---|
container_issue | 4 |
container_start_page | 1067 |
container_title | Intelligent data analysis |
container_volume | 28 |
creator | Chen, Hongwei Shi, Dewei Zhou, Xun Zhang, Man Liu, Luanxuan |
description | Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase. |
doi_str_mv | 10.3233/IDA-230193 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3082612101</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.3233_IDA-230193</sage_id><sourcerecordid>3082612101</sourcerecordid><originalsourceid>FETCH-LOGICAL-c180t-c6ad2dafa6a2db2421657f4f06c38792dc3ffc440367c7c72ad12aa4564f2b293</originalsourceid><addsrcrecordid>eNptkE1LAzEQhoMoWKsXf0HAgyCsJpM0uz2W-lUoeFHoLczmQ7fUZk2yB_-90RW8yBzmHXh4Bl5Czjm7FiDEzep2UYFgfC4OyITPal5JDs1hyaxpKqnqzTE5SWnLGJPA5IRsFn2_6wzmLuxpdMlhNG80eGqis12mPuJgqXXZmR-kxeQsLcF2KceuHXI5Y8ijwDrXUx-KJ5-SI4-75M5-95S83N89Lx-r9dPDarlYV4Y3LFdGoQWLHhWCbUECV7PaS8-UEU09B2uE90ZKJlRtygBaDohypqSHFuZiSi5Gbx_Dx1Ae620Y4r681II1oDhwxgt1NVImhpSi87qP3TvGT82Z_m5Ol-b02FyBL0c44av70_1DfgHhWW0_</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3082612101</pqid></control><display><type>article</type><title>Application research of credit fraud detection based on distributed rotation deep forest</title><source>Business Source Complete</source><creator>Chen, Hongwei ; Shi, Dewei ; Zhou, Xun ; Zhang, Man ; Liu, Luanxuan</creator><creatorcontrib>Chen, Hongwei ; Shi, Dewei ; Zhou, Xun ; Zhang, Man ; Liu, Luanxuan</creatorcontrib><description>Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.</description><identifier>ISSN: 1088-467X</identifier><identifier>EISSN: 1571-4128</identifier><identifier>DOI: 10.3233/IDA-230193</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Algorithms ; Economic impact ; Ensemble learning ; Forests ; Fraud ; Fraud prevention ; Machine learning ; Neural networks ; Rotation ; Spatial data</subject><ispartof>Intelligent data analysis, 2024-01, Vol.28 (4), p.1067-1091</ispartof><rights>2024 – IOS Press. All rights reserved.</rights><rights>Copyright IOS Press BV 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c180t-c6ad2dafa6a2db2421657f4f06c38792dc3ffc440367c7c72ad12aa4564f2b293</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Chen, Hongwei</creatorcontrib><creatorcontrib>Shi, Dewei</creatorcontrib><creatorcontrib>Zhou, Xun</creatorcontrib><creatorcontrib>Zhang, Man</creatorcontrib><creatorcontrib>Liu, Luanxuan</creatorcontrib><title>Application research of credit fraud detection based on distributed rotation deep forest</title><title>Intelligent data analysis</title><description>Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.</description><subject>Algorithms</subject><subject>Economic impact</subject><subject>Ensemble learning</subject><subject>Forests</subject><subject>Fraud</subject><subject>Fraud prevention</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Rotation</subject><subject>Spatial data</subject><issn>1088-467X</issn><issn>1571-4128</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNptkE1LAzEQhoMoWKsXf0HAgyCsJpM0uz2W-lUoeFHoLczmQ7fUZk2yB_-90RW8yBzmHXh4Bl5Czjm7FiDEzep2UYFgfC4OyITPal5JDs1hyaxpKqnqzTE5SWnLGJPA5IRsFn2_6wzmLuxpdMlhNG80eGqis12mPuJgqXXZmR-kxeQsLcF2KceuHXI5Y8ijwDrXUx-KJ5-SI4-75M5-95S83N89Lx-r9dPDarlYV4Y3LFdGoQWLHhWCbUECV7PaS8-UEU09B2uE90ZKJlRtygBaDohypqSHFuZiSi5Gbx_Dx1Ae620Y4r681II1oDhwxgt1NVImhpSi87qP3TvGT82Z_m5Ol-b02FyBL0c44av70_1DfgHhWW0_</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Chen, Hongwei</creator><creator>Shi, Dewei</creator><creator>Zhou, Xun</creator><creator>Zhang, Man</creator><creator>Liu, Luanxuan</creator><general>SAGE Publications</general><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240101</creationdate><title>Application research of credit fraud detection based on distributed rotation deep forest</title><author>Chen, Hongwei ; Shi, Dewei ; Zhou, Xun ; Zhang, Man ; Liu, Luanxuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c180t-c6ad2dafa6a2db2421657f4f06c38792dc3ffc440367c7c72ad12aa4564f2b293</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Economic impact</topic><topic>Ensemble learning</topic><topic>Forests</topic><topic>Fraud</topic><topic>Fraud prevention</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Rotation</topic><topic>Spatial data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chen, Hongwei</creatorcontrib><creatorcontrib>Shi, Dewei</creatorcontrib><creatorcontrib>Zhou, Xun</creatorcontrib><creatorcontrib>Zhang, Man</creatorcontrib><creatorcontrib>Liu, Luanxuan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Intelligent data analysis</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Hongwei</au><au>Shi, Dewei</au><au>Zhou, Xun</au><au>Zhang, Man</au><au>Liu, Luanxuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application research of credit fraud detection based on distributed rotation deep forest</atitle><jtitle>Intelligent data analysis</jtitle><date>2024-01-01</date><risdate>2024</risdate><volume>28</volume><issue>4</issue><spage>1067</spage><epage>1091</epage><pages>1067-1091</pages><issn>1088-467X</issn><eissn>1571-4128</eissn><abstract>Credit fraud is a common financial crime that causes significant economic losses to financial institutions. To address this issue, researchers have proposed various fraud detection methods. Recently, research on deep forests has opened up a new path for exploring deep models beyond neural networks. It combines the features of neural networks and ensemble learning, and has achieved good results in various fields. This paper mainly studies the application of deep forests to the field of fraud detection and proposes a distributed dense rotation deep forest algorithm (DRDF-spark) based on the improved RotBoost. The model has three main characteristics: firstly, it solves the problem of multi-granularity scanning due to the lack of spatial correlation in the data by introducing RotBoost. Secondly, Spark is used for parallel construction to improve the processing speed and efficiency of data. Thirdly, a pre-aggregation mechanism is added to the distributed algorithm to locally aggregate the statistical results of sub-forests in the same node in advance to improve communication efficiency. The experiments show that DRDF-spark performs better than deep forests and some mainstream ensemble learning algorithms on the fraud dataset in this paper, and the training speed is up to 3.53 times faster. Furthermore, if the number of nodes is further increased, the speedup ratio will continue to increase.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.3233/IDA-230193</doi><tpages>25</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1088-467X |
ispartof | Intelligent data analysis, 2024-01, Vol.28 (4), p.1067-1091 |
issn | 1088-467X 1571-4128 |
language | eng |
recordid | cdi_proquest_journals_3082612101 |
source | Business Source Complete |
subjects | Algorithms Economic impact Ensemble learning Forests Fraud Fraud prevention Machine learning Neural networks Rotation Spatial data |
title | Application research of credit fraud detection based on distributed rotation deep forest |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T12%3A05%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20research%20of%20credit%20fraud%20detection%20based%20on%20distributed%20rotation%20deep%20forest&rft.jtitle=Intelligent%20data%20analysis&rft.au=Chen,%20Hongwei&rft.date=2024-01-01&rft.volume=28&rft.issue=4&rft.spage=1067&rft.epage=1091&rft.pages=1067-1091&rft.issn=1088-467X&rft.eissn=1571-4128&rft_id=info:doi/10.3233/IDA-230193&rft_dat=%3Cproquest_cross%3E3082612101%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3082612101&rft_id=info:pmid/&rft_sage_id=10.3233_IDA-230193&rfr_iscdi=true |