A novel twin-support vector machine for binary classification to imbalanced data

PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classif...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Data technologies and applications 2023-06, Vol.57 (3), p.385-396
Hauptverfasser: Li, Jingyi, Chao, Shiwei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 396
container_issue 3
container_start_page 385
container_title Data technologies and applications
container_volume 57
creator Li, Jingyi
Chao, Shiwei
description PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.
doi_str_mv 10.1108/DTA-08-2022-0302
format Article
fullrecord <record><control><sourceid>proquest_emera</sourceid><recordid>TN_cdi_emerald_primary_10_1108_DTA-08-2022-0302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920179993</sourcerecordid><originalsourceid>FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</originalsourceid><addsrcrecordid>eNptkEtPwzAQhC0EElXpnaMlzqa76zhxjlV5SpXgUM6W4zgiVV7YaRH_nlSFAxKnmcPMrPZj7BrhFhH08m67EqAFAZEACXTGZqQwEblEff7rSetLtohxBwAEKpNazdjrinf9wTd8_Kw7EffD0IeRH7wb-8Bb697rzvNq8kXd2fDFXWNjrKva2bHuOz72vG4L29jO-ZKXdrRX7KKyTfSLH52zt4f77fpJbF4en9erjXCUJqPwoDAFR5WSGnOsMioyTLEkmciyqBCUTgspCSFxHsgqic6mCgsqkbIC5JzdnHaH0H_sfRzNrt-HbjppKCfALM9zOaXglHKhjzH4ygyhbqdHDII5ojMTOjPJEZ05opsqy1PFtz7Ypvyv8Qe2_Ab4Am2H</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920179993</pqid></control><display><type>article</type><title>A novel twin-support vector machine for binary classification to imbalanced data</title><source>Standard: Emerald eJournal Premier Collection</source><creator>Li, Jingyi ; Chao, Shiwei</creator><creatorcontrib>Li, Jingyi ; Chao, Shiwei</creatorcontrib><description>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</description><identifier>ISSN: 2514-9288</identifier><identifier>EISSN: 2514-9318</identifier><identifier>EISSN: 2514-9288</identifier><identifier>DOI: 10.1108/DTA-08-2022-0302</identifier><language>eng</language><publisher>Bingley: Emerald Publishing Limited</publisher><subject>Accuracy ; Artificial Intelligence ; Boundaries ; Classification ; Classifiers ; Complexity ; Datasets ; Lagrange multiplier ; Machine learning ; Methods ; Neural networks ; Programming ; Sampling ; Support vector machines</subject><ispartof>Data technologies and applications, 2023-06, Vol.57 (3), p.385-396</ispartof><rights>Emerald Publishing Limited</rights><rights>Emerald Publishing Limited.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</cites><orcidid>0000-0001-5641-6396</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.emerald.com/insight/content/doi/10.1108/DTA-08-2022-0302/full/html$$EHTML$$P50$$Gemerald$$H</linktohtml><link.rule.ids>314,776,780,21676,27903,27904,53222</link.rule.ids></links><search><creatorcontrib>Li, Jingyi</creatorcontrib><creatorcontrib>Chao, Shiwei</creatorcontrib><title>A novel twin-support vector machine for binary classification to imbalanced data</title><title>Data technologies and applications</title><description>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</description><subject>Accuracy</subject><subject>Artificial Intelligence</subject><subject>Boundaries</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Complexity</subject><subject>Datasets</subject><subject>Lagrange multiplier</subject><subject>Machine learning</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Programming</subject><subject>Sampling</subject><subject>Support vector machines</subject><issn>2514-9288</issn><issn>2514-9318</issn><issn>2514-9288</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNptkEtPwzAQhC0EElXpnaMlzqa76zhxjlV5SpXgUM6W4zgiVV7YaRH_nlSFAxKnmcPMrPZj7BrhFhH08m67EqAFAZEACXTGZqQwEblEff7rSetLtohxBwAEKpNazdjrinf9wTd8_Kw7EffD0IeRH7wb-8Bb697rzvNq8kXd2fDFXWNjrKva2bHuOz72vG4L29jO-ZKXdrRX7KKyTfSLH52zt4f77fpJbF4en9erjXCUJqPwoDAFR5WSGnOsMioyTLEkmciyqBCUTgspCSFxHsgqic6mCgsqkbIC5JzdnHaH0H_sfRzNrt-HbjppKCfALM9zOaXglHKhjzH4ygyhbqdHDII5ojMTOjPJEZ05opsqy1PFtz7Ypvyv8Qe2_Ab4Am2H</recordid><startdate>20230614</startdate><enddate>20230614</enddate><creator>Li, Jingyi</creator><creator>Chao, Shiwei</creator><general>Emerald Publishing Limited</general><general>Emerald Group Publishing Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>CNYFK</scope><scope>DWQXO</scope><scope>E3H</scope><scope>F2A</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M0P</scope><scope>M1O</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PYYUZ</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0001-5641-6396</orcidid></search><sort><creationdate>20230614</creationdate><title>A novel twin-support vector machine for binary classification to imbalanced data</title><author>Li, Jingyi ; Chao, Shiwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Artificial Intelligence</topic><topic>Boundaries</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Complexity</topic><topic>Datasets</topic><topic>Lagrange multiplier</topic><topic>Machine learning</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Programming</topic><topic>Sampling</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Jingyi</creatorcontrib><creatorcontrib>Chao, Shiwei</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>Library &amp; Information Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Library Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ABI/INFORM Collection China</collection><collection>ProQuest Central Basic</collection><jtitle>Data technologies and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Jingyi</au><au>Chao, Shiwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel twin-support vector machine for binary classification to imbalanced data</atitle><jtitle>Data technologies and applications</jtitle><date>2023-06-14</date><risdate>2023</risdate><volume>57</volume><issue>3</issue><spage>385</spage><epage>396</epage><pages>385-396</pages><issn>2514-9288</issn><eissn>2514-9318</eissn><eissn>2514-9288</eissn><abstract>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</abstract><cop>Bingley</cop><pub>Emerald Publishing Limited</pub><doi>10.1108/DTA-08-2022-0302</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-5641-6396</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2514-9288
ispartof Data technologies and applications, 2023-06, Vol.57 (3), p.385-396
issn 2514-9288
2514-9318
2514-9288
language eng
recordid cdi_emerald_primary_10_1108_DTA-08-2022-0302
source Standard: Emerald eJournal Premier Collection
subjects Accuracy
Artificial Intelligence
Boundaries
Classification
Classifiers
Complexity
Datasets
Lagrange multiplier
Machine learning
Methods
Neural networks
Programming
Sampling
Support vector machines
title A novel twin-support vector machine for binary classification to imbalanced data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T16%3A42%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_emera&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20twin-support%20vector%20machine%20for%20binary%20classification%20to%20imbalanced%20data&rft.jtitle=Data%20technologies%20and%20applications&rft.au=Li,%20Jingyi&rft.date=2023-06-14&rft.volume=57&rft.issue=3&rft.spage=385&rft.epage=396&rft.pages=385-396&rft.issn=2514-9288&rft.eissn=2514-9318&rft_id=info:doi/10.1108/DTA-08-2022-0302&rft_dat=%3Cproquest_emera%3E2920179993%3C/proquest_emera%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2920179993&rft_id=info:pmid/&rfr_iscdi=true