EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation

Imbalanced data cause deep neural networks to output biased results, and it becomes more serious when facing extremely imbalanced data regarding the outliers with tiny size (the ratio of the outlier size to the image size is around 0.05%). Many data argumentation models are proposed to supplement im...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on industrial informatics 2023-03, Vol.19 (3), p.3208-3218
Hauptverfasser:	Li, Wei, Chen, Jinlin, Cao, Jiannong, Ma, Chao, Wang, Jia, Cui, Xiaohui, Chen, Ping
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Back propagation Back propagation networks Data augmentation Data models Datasets Detectors Extremely imbalanced data augmentation Fabrics generated data evaluation generative adversarial net (GAN) Generative adversarial networks Generators norm penalty function Outliers (statistics) Penalty function Pistons Prototypes Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3218
container_issue	3
container_start_page	3208
container_title	IEEE transactions on industrial informatics
container_volume	19
creator	Li, Wei Chen, Jinlin Cao, Jiannong Ma, Chao Wang, Jia Cui, Xiaohui Chen, Ping
description	Imbalanced data cause deep neural networks to output biased results, and it becomes more serious when facing extremely imbalanced data regarding the outliers with tiny size (the ratio of the outlier size to the image size is around 0.05%). Many data argumentation models are proposed to supplement imbalanced data to alleviate biased results. However, the existing augmentation models cannot synthesize tiny outliers, which make the generated data unavailable. In this article, we propose a new augmentation model named extremely imbalanced data augmentation generative adversarial nets (EID-GANs) to address the extremely imbalanced data augmentation problem. First, we design a new penalty function by subtracting the outliers from the cropped region of generated instance to guide the generator to learn the features of outliers. After this, we combine the output value of the penalty function with the generator loss to jointly update the generator's parameters with backpropagation. Second, we propose a new evaluation approach that adopts two outlier detectors with k -fold cross-validation to assess the availability of generated instances. We conduct extensive experiments to demonstrate the significant performance improvement of EID-GAN on two extremely imbalanced datasets, which are the industrial Piston and the Fabric datasets, and one general imbalanced dataset, i.e., the public DAGM dataset. The experimental results show that our EID-GAN outperforms the state-of-the-art (SOTA) augmentation models on different imbalanced datasets.
doi_str_mv	10.1109/TII.2022.3182781
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2784633378</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9795891</ieee_id><sourcerecordid>2784633378</sourcerecordid><originalsourceid>FETCH-LOGICAL-c291t-e10ff3f269ff68028a902ec7717efc85c8182b059a100d6ca774b3252a9d16033</originalsourceid><addsrcrecordid>eNo9kE1PAjEQhhujiYjeTbw08bw4bdlt640A4iYELnhuyu7ULNkPbBci_94SiKeZw_vM5H0IeWYwYgz02ybPRxw4HwmmuFTshgyYHrMEIIXbuKcpSwQHcU8eQtgBCAlCD8h6ns-SxWT1ThfYord9dUQ6KY_og_WVrekK-0Bd5-n8t_fYYH2iebO1tW0LLOnM9pZODt8Ntn1ku_aR3DlbB3y6ziH5-phvpp_Jcr3Ip5NlUnDN-gQZOCccz7RzmQKurAaOhZRMoitUWqjYYguptgygzAor5XgreMqtLlkGQgzJ6-Xu3nc_Bwy92XUH38aXJrYfZ0IIqWIKLqnCdyF4dGbvq8b6k2FgztpM1GbO2sxVW0ReLkiFiP9xLXWqNBN_KSNm0Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2784633378</pqid></control><display><type>article</type><title>EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation</title><source>IEEE Electronic Library (IEL)</source><creator>Li, Wei ; Chen, Jinlin ; Cao, Jiannong ; Ma, Chao ; Wang, Jia ; Cui, Xiaohui ; Chen, Ping</creator><creatorcontrib>Li, Wei ; Chen, Jinlin ; Cao, Jiannong ; Ma, Chao ; Wang, Jia ; Cui, Xiaohui ; Chen, Ping</creatorcontrib><description>Imbalanced data cause deep neural networks to output biased results, and it becomes more serious when facing extremely imbalanced data regarding the outliers with tiny size (the ratio of the outlier size to the image size is around 0.05%). Many data argumentation models are proposed to supplement imbalanced data to alleviate biased results. However, the existing augmentation models cannot synthesize tiny outliers, which make the generated data unavailable. In this article, we propose a new augmentation model named extremely imbalanced data augmentation generative adversarial nets (EID-GANs) to address the extremely imbalanced data augmentation problem. First, we design a new penalty function by subtracting the outliers from the cropped region of generated instance to guide the generator to learn the features of outliers. After this, we combine the output value of the penalty function with the generator loss to jointly update the generator's parameters with backpropagation. Second, we propose a new evaluation approach that adopts two outlier detectors with k -fold cross-validation to assess the availability of generated instances. We conduct extensive experiments to demonstrate the significant performance improvement of EID-GAN on two extremely imbalanced datasets, which are the industrial Piston and the Fabric datasets, and one general imbalanced dataset, i.e., the public DAGM dataset. The experimental results show that our EID-GAN outperforms the state-of-the-art (SOTA) augmentation models on different imbalanced datasets.</description><identifier>ISSN: 1551-3203</identifier><identifier>EISSN: 1941-0050</identifier><identifier>DOI: 10.1109/TII.2022.3182781</identifier><identifier>CODEN: ITIICH</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Artificial neural networks ; Back propagation ; Back propagation networks ; Data augmentation ; Data models ; Datasets ; Detectors ; Extremely imbalanced data augmentation ; Fabrics ; generated data evaluation ; generative adversarial net (GAN) ; Generative adversarial networks ; Generators ; norm penalty function ; Outliers (statistics) ; Penalty function ; Pistons ; Prototypes ; Training</subject><ispartof>IEEE transactions on industrial informatics, 2023-03, Vol.19 (3), p.3208-3218</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c291t-e10ff3f269ff68028a902ec7717efc85c8182b059a100d6ca774b3252a9d16033</citedby><cites>FETCH-LOGICAL-c291t-e10ff3f269ff68028a902ec7717efc85c8182b059a100d6ca774b3252a9d16033</cites><orcidid>0000-0001-6079-009X ; 0000-0003-3789-7686 ; 0000-0002-2725-2529 ; 0000-0002-7443-6267 ; 0000-0002-3135-0447 ; 0000-0003-3923-8844</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9795891$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9795891$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Li, Wei</creatorcontrib><creatorcontrib>Chen, Jinlin</creatorcontrib><creatorcontrib>Cao, Jiannong</creatorcontrib><creatorcontrib>Ma, Chao</creatorcontrib><creatorcontrib>Wang, Jia</creatorcontrib><creatorcontrib>Cui, Xiaohui</creatorcontrib><creatorcontrib>Chen, Ping</creatorcontrib><title>EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation</title><title>IEEE transactions on industrial informatics</title><addtitle>TII</addtitle><description>Imbalanced data cause deep neural networks to output biased results, and it becomes more serious when facing extremely imbalanced data regarding the outliers with tiny size (the ratio of the outlier size to the image size is around 0.05%). Many data argumentation models are proposed to supplement imbalanced data to alleviate biased results. However, the existing augmentation models cannot synthesize tiny outliers, which make the generated data unavailable. In this article, we propose a new augmentation model named extremely imbalanced data augmentation generative adversarial nets (EID-GANs) to address the extremely imbalanced data augmentation problem. First, we design a new penalty function by subtracting the outliers from the cropped region of generated instance to guide the generator to learn the features of outliers. After this, we combine the output value of the penalty function with the generator loss to jointly update the generator's parameters with backpropagation. Second, we propose a new evaluation approach that adopts two outlier detectors with k -fold cross-validation to assess the availability of generated instances. We conduct extensive experiments to demonstrate the significant performance improvement of EID-GAN on two extremely imbalanced datasets, which are the industrial Piston and the Fabric datasets, and one general imbalanced dataset, i.e., the public DAGM dataset. The experimental results show that our EID-GAN outperforms the state-of-the-art (SOTA) augmentation models on different imbalanced datasets.</description><subject>Artificial neural networks</subject><subject>Back propagation</subject><subject>Back propagation networks</subject><subject>Data augmentation</subject><subject>Data models</subject><subject>Datasets</subject><subject>Detectors</subject><subject>Extremely imbalanced data augmentation</subject><subject>Fabrics</subject><subject>generated data evaluation</subject><subject>generative adversarial net (GAN)</subject><subject>Generative adversarial networks</subject><subject>Generators</subject><subject>norm penalty function</subject><subject>Outliers (statistics)</subject><subject>Penalty function</subject><subject>Pistons</subject><subject>Prototypes</subject><subject>Training</subject><issn>1551-3203</issn><issn>1941-0050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1PAjEQhhujiYjeTbw08bw4bdlt640A4iYELnhuyu7ULNkPbBci_94SiKeZw_vM5H0IeWYwYgz02ybPRxw4HwmmuFTshgyYHrMEIIXbuKcpSwQHcU8eQtgBCAlCD8h6ns-SxWT1ThfYord9dUQ6KY_og_WVrekK-0Bd5-n8t_fYYH2iebO1tW0LLOnM9pZODt8Ntn1ku_aR3DlbB3y6ziH5-phvpp_Jcr3Ip5NlUnDN-gQZOCccz7RzmQKurAaOhZRMoitUWqjYYguptgygzAor5XgreMqtLlkGQgzJ6-Xu3nc_Bwy92XUH38aXJrYfZ0IIqWIKLqnCdyF4dGbvq8b6k2FgztpM1GbO2sxVW0ReLkiFiP9xLXWqNBN_KSNm0Q</recordid><startdate>20230301</startdate><enddate>20230301</enddate><creator>Li, Wei</creator><creator>Chen, Jinlin</creator><creator>Cao, Jiannong</creator><creator>Ma, Chao</creator><creator>Wang, Jia</creator><creator>Cui, Xiaohui</creator><creator>Chen, Ping</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-6079-009X</orcidid><orcidid>https://orcid.org/0000-0003-3789-7686</orcidid><orcidid>https://orcid.org/0000-0002-2725-2529</orcidid><orcidid>https://orcid.org/0000-0002-7443-6267</orcidid><orcidid>https://orcid.org/0000-0002-3135-0447</orcidid><orcidid>https://orcid.org/0000-0003-3923-8844</orcidid></search><sort><creationdate>20230301</creationdate><title>EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation</title><author>Li, Wei ; Chen, Jinlin ; Cao, Jiannong ; Ma, Chao ; Wang, Jia ; Cui, Xiaohui ; Chen, Ping</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c291t-e10ff3f269ff68028a902ec7717efc85c8182b059a100d6ca774b3252a9d16033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Back propagation</topic><topic>Back propagation networks</topic><topic>Data augmentation</topic><topic>Data models</topic><topic>Datasets</topic><topic>Detectors</topic><topic>Extremely imbalanced data augmentation</topic><topic>Fabrics</topic><topic>generated data evaluation</topic><topic>generative adversarial net (GAN)</topic><topic>Generative adversarial networks</topic><topic>Generators</topic><topic>norm penalty function</topic><topic>Outliers (statistics)</topic><topic>Penalty function</topic><topic>Pistons</topic><topic>Prototypes</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Li, Wei</creatorcontrib><creatorcontrib>Chen, Jinlin</creatorcontrib><creatorcontrib>Cao, Jiannong</creatorcontrib><creatorcontrib>Ma, Chao</creatorcontrib><creatorcontrib>Wang, Jia</creatorcontrib><creatorcontrib>Cui, Xiaohui</creatorcontrib><creatorcontrib>Chen, Ping</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on industrial informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Li, Wei</au><au>Chen, Jinlin</au><au>Cao, Jiannong</au><au>Ma, Chao</au><au>Wang, Jia</au><au>Cui, Xiaohui</au><au>Chen, Ping</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation</atitle><jtitle>IEEE transactions on industrial informatics</jtitle><stitle>TII</stitle><date>2023-03-01</date><risdate>2023</risdate><volume>19</volume><issue>3</issue><spage>3208</spage><epage>3218</epage><pages>3208-3218</pages><issn>1551-3203</issn><eissn>1941-0050</eissn><coden>ITIICH</coden><abstract>Imbalanced data cause deep neural networks to output biased results, and it becomes more serious when facing extremely imbalanced data regarding the outliers with tiny size (the ratio of the outlier size to the image size is around 0.05%). Many data argumentation models are proposed to supplement imbalanced data to alleviate biased results. However, the existing augmentation models cannot synthesize tiny outliers, which make the generated data unavailable. In this article, we propose a new augmentation model named extremely imbalanced data augmentation generative adversarial nets (EID-GANs) to address the extremely imbalanced data augmentation problem. First, we design a new penalty function by subtracting the outliers from the cropped region of generated instance to guide the generator to learn the features of outliers. After this, we combine the output value of the penalty function with the generator loss to jointly update the generator's parameters with backpropagation. Second, we propose a new evaluation approach that adopts two outlier detectors with k -fold cross-validation to assess the availability of generated instances. We conduct extensive experiments to demonstrate the significant performance improvement of EID-GAN on two extremely imbalanced datasets, which are the industrial Piston and the Fabric datasets, and one general imbalanced dataset, i.e., the public DAGM dataset. The experimental results show that our EID-GAN outperforms the state-of-the-art (SOTA) augmentation models on different imbalanced datasets.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TII.2022.3182781</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0001-6079-009X</orcidid><orcidid>https://orcid.org/0000-0003-3789-7686</orcidid><orcidid>https://orcid.org/0000-0002-2725-2529</orcidid><orcidid>https://orcid.org/0000-0002-7443-6267</orcidid><orcidid>https://orcid.org/0000-0002-3135-0447</orcidid><orcidid>https://orcid.org/0000-0003-3923-8844</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1551-3203
ispartof	IEEE transactions on industrial informatics, 2023-03, Vol.19 (3), p.3208-3218
issn	1551-3203 1941-0050
language	eng
recordid	cdi_proquest_journals_2784633378
source	IEEE Electronic Library (IEL)
subjects	Artificial neural networks Back propagation Back propagation networks Data augmentation Data models Datasets Detectors Extremely imbalanced data augmentation Fabrics generated data evaluation generative adversarial net (GAN) Generative adversarial networks Generators norm penalty function Outliers (statistics) Penalty function Pistons Prototypes Training
title	EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T17%3A01%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=EID-GAN:%20Generative%20Adversarial%20Nets%20for%20Extremely%20Imbalanced%20Data%20Augmentation&rft.jtitle=IEEE%20transactions%20on%20industrial%20informatics&rft.au=Li,%20Wei&rft.date=2023-03-01&rft.volume=19&rft.issue=3&rft.spage=3208&rft.epage=3218&rft.pages=3208-3218&rft.issn=1551-3203&rft.eissn=1941-0050&rft.coden=ITIICH&rft_id=info:doi/10.1109/TII.2022.3182781&rft_dat=%3Cproquest_RIE%3E2784633378%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2784633378&rft_id=info:pmid/&rft_ieee_id=9795891&rfr_iscdi=true