A novel twin-support vector machine for binary classification to imbalanced data

PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classif...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Data technologies and applications 2023-06, Vol.57 (3), p.385-396
Hauptverfasser:	Li, Jingyi, Chao, Shiwei
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Artificial Intelligence Boundaries Classification Classifiers Complexity Datasets Lagrange multiplier Machine learning Methods Neural networks Programming Sampling Support vector machines
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	396
container_issue	3
container_start_page	385
container_title	Data technologies and applications
container_volume	57
creator	Li, Jingyi Chao, Shiwei
description	PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.
doi_str_mv	10.1108/DTA-08-2022-0302
format	Article
fullrecord	<record><control><sourceid>proquest_emera</sourceid><recordid>TN_cdi_emerald_primary_10_1108_DTA-08-2022-0302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920179993</sourcerecordid><originalsourceid>FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</originalsourceid><addsrcrecordid>eNptkEtPwzAQhC0EElXpnaMlzqa76zhxjlV5SpXgUM6W4zgiVV7YaRH_nlSFAxKnmcPMrPZj7BrhFhH08m67EqAFAZEACXTGZqQwEblEff7rSetLtohxBwAEKpNazdjrinf9wTd8_Kw7EffD0IeRH7wb-8Bb697rzvNq8kXd2fDFXWNjrKva2bHuOz72vG4L29jO-ZKXdrRX7KKyTfSLH52zt4f77fpJbF4en9erjXCUJqPwoDAFR5WSGnOsMioyTLEkmciyqBCUTgspCSFxHsgqic6mCgsqkbIC5JzdnHaH0H_sfRzNrt-HbjppKCfALM9zOaXglHKhjzH4ygyhbqdHDII5ojMTOjPJEZ05opsqy1PFtz7Ypvyv8Qe2_Ab4Am2H</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920179993</pqid></control><display><type>article</type><title>A novel twin-support vector machine for binary classification to imbalanced data</title><source>Standard: Emerald eJournal Premier Collection</source><creator>Li, Jingyi ; Chao, Shiwei</creator><creatorcontrib>Li, Jingyi ; Chao, Shiwei</creatorcontrib><description>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</description><identifier>ISSN: 2514-9288</identifier><identifier>EISSN: 2514-9318</identifier><identifier>EISSN: 2514-9288</identifier><identifier>DOI: 10.1108/DTA-08-2022-0302</identifier><language>eng</language><publisher>Bingley: Emerald Publishing Limited</publisher><subject>Accuracy ; Artificial Intelligence ; Boundaries ; Classification ; Classifiers ; Complexity ; Datasets ; Lagrange multiplier ; Machine learning ; Methods ; Neural networks ; Programming ; Sampling ; Support vector machines</subject><ispartof>Data technologies and applications, 2023-06, Vol.57 (3), p.385-396</ispartof><rights>Emerald Publishing Limited</rights><rights>Emerald Publishing Limited.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</cites><orcidid>0000-0001-5641-6396</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.emerald.com/insight/content/doi/10.1108/DTA-08-2022-0302/full/html$$EHTML$$P50$$Gemerald$$H</linktohtml><link.rule.ids>314,776,780,21676,27903,27904,53222</link.rule.ids></links><search><creatorcontrib>Li, Jingyi</creatorcontrib><creatorcontrib>Chao, Shiwei</creatorcontrib><title>A novel twin-support vector machine for binary classification to imbalanced data</title><title>Data technologies and applications</title><description>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</description><subject>Accuracy</subject><subject>Artificial Intelligence</subject><subject>Boundaries</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Complexity</subject><subject>Datasets</subject><subject>Lagrange multiplier</subject><subject>Machine learning</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Programming</subject><subject>Sampling</subject><subject>Support vector machines</subject><issn>2514-9288</issn><issn>2514-9318</issn><issn>2514-9288</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNptkEtPwzAQhC0EElXpnaMlzqa76zhxjlV5SpXgUM6W4zgiVV7YaRH_nlSFAxKnmcPMrPZj7BrhFhH08m67EqAFAZEACXTGZqQwEblEff7rSetLtohxBwAEKpNazdjrinf9wTd8_Kw7EffD0IeRH7wb-8Bb697rzvNq8kXd2fDFXWNjrKva2bHuOz72vG4L29jO-ZKXdrRX7KKyTfSLH52zt4f77fpJbF4en9erjXCUJqPwoDAFR5WSGnOsMioyTLEkmciyqBCUTgspCSFxHsgqic6mCgsqkbIC5JzdnHaH0H_sfRzNrt-HbjppKCfALM9zOaXglHKhjzH4ygyhbqdHDII5ojMTOjPJEZ05opsqy1PFtz7Ypvyv8Qe2_Ab4Am2H</recordid><startdate>20230614</startdate><enddate>20230614</enddate><creator>Li, Jingyi</creator><creator>Chao, Shiwei</creator><general>Emerald Publishing Limited</general><general>Emerald Group Publishing Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>CNYFK</scope><scope>DWQXO</scope><scope>E3H</scope><scope>F2A</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M0P</scope><scope>M1O</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PYYUZ</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0001-5641-6396</orcidid></search><sort><creationdate>20230614</creationdate><title>A novel twin-support vector machine for binary classification to imbalanced data</title><author>Li, Jingyi ; Chao, Shiwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Artificial Intelligence</topic><topic>Boundaries</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Complexity</topic><topic>Datasets</topic><topic>Lagrange multiplier</topic><topic>Machine learning</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Programming</topic><topic>Sampling</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Jingyi</creatorcontrib><creatorcontrib>Chao, Shiwei</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>Library & Information Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Library Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ABI/INFORM Collection China</collection><collection>ProQuest Central Basic</collection><jtitle>Data technologies and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Jingyi</au><au>Chao, Shiwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel twin-support vector machine for binary classification to imbalanced data</atitle><jtitle>Data technologies and applications</jtitle><date>2023-06-14</date><risdate>2023</risdate><volume>57</volume><issue>3</issue><spage>385</spage><epage>396</epage><pages>385-396</pages><issn>2514-9288</issn><eissn>2514-9318</eissn><eissn>2514-9288</eissn><abstract>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</abstract><cop>Bingley</cop><pub>Emerald Publishing Limited</pub><doi>10.1108/DTA-08-2022-0302</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-5641-6396</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 2514-9288
ispartof	Data technologies and applications, 2023-06, Vol.57 (3), p.385-396
issn	2514-9288 2514-9318 2514-9288
language	eng
recordid	cdi_emerald_primary_10_1108_DTA-08-2022-0302
source	Standard: Emerald eJournal Premier Collection
subjects	Accuracy Artificial Intelligence Boundaries Classification Classifiers Complexity Datasets Lagrange multiplier Machine learning Methods Neural networks Programming Sampling Support vector machines
title	A novel twin-support vector machine for binary classification to imbalanced data
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T16%3A42%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_emera&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20twin-support%20vector%20machine%20for%20binary%20classification%20to%20imbalanced%20data&rft.jtitle=Data%20technologies%20and%20applications&rft.au=Li,%20Jingyi&rft.date=2023-06-14&rft.volume=57&rft.issue=3&rft.spage=385&rft.epage=396&rft.pages=385-396&rft.issn=2514-9288&rft.eissn=2514-9318&rft_id=info:doi/10.1108/DTA-08-2022-0302&rft_dat=%3Cproquest_emera%3E2920179993%3C/proquest_emera%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2920179993&rft_id=info:pmid/&rfr_iscdi=true