A novel twin-support vector machine for binary classification to imbalanced data
PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classif...
Gespeichert in:
Veröffentlicht in: | Data technologies and applications 2023-06, Vol.57 (3), p.385-396 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 396 |
---|---|
container_issue | 3 |
container_start_page | 385 |
container_title | Data technologies and applications |
container_volume | 57 |
creator | Li, Jingyi Chao, Shiwei |
description | PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels. |
doi_str_mv | 10.1108/DTA-08-2022-0302 |
format | Article |
fullrecord | <record><control><sourceid>proquest_emera</sourceid><recordid>TN_cdi_emerald_primary_10_1108_DTA-08-2022-0302</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2920179993</sourcerecordid><originalsourceid>FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</originalsourceid><addsrcrecordid>eNptkEtPwzAQhC0EElXpnaMlzqa76zhxjlV5SpXgUM6W4zgiVV7YaRH_nlSFAxKnmcPMrPZj7BrhFhH08m67EqAFAZEACXTGZqQwEblEff7rSetLtohxBwAEKpNazdjrinf9wTd8_Kw7EffD0IeRH7wb-8Bb697rzvNq8kXd2fDFXWNjrKva2bHuOz72vG4L29jO-ZKXdrRX7KKyTfSLH52zt4f77fpJbF4en9erjXCUJqPwoDAFR5WSGnOsMioyTLEkmciyqBCUTgspCSFxHsgqic6mCgsqkbIC5JzdnHaH0H_sfRzNrt-HbjppKCfALM9zOaXglHKhjzH4ygyhbqdHDII5ojMTOjPJEZ05opsqy1PFtz7Ypvyv8Qe2_Ab4Am2H</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2920179993</pqid></control><display><type>article</type><title>A novel twin-support vector machine for binary classification to imbalanced data</title><source>Standard: Emerald eJournal Premier Collection</source><creator>Li, Jingyi ; Chao, Shiwei</creator><creatorcontrib>Li, Jingyi ; Chao, Shiwei</creatorcontrib><description>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</description><identifier>ISSN: 2514-9288</identifier><identifier>EISSN: 2514-9318</identifier><identifier>EISSN: 2514-9288</identifier><identifier>DOI: 10.1108/DTA-08-2022-0302</identifier><language>eng</language><publisher>Bingley: Emerald Publishing Limited</publisher><subject>Accuracy ; Artificial Intelligence ; Boundaries ; Classification ; Classifiers ; Complexity ; Datasets ; Lagrange multiplier ; Machine learning ; Methods ; Neural networks ; Programming ; Sampling ; Support vector machines</subject><ispartof>Data technologies and applications, 2023-06, Vol.57 (3), p.385-396</ispartof><rights>Emerald Publishing Limited</rights><rights>Emerald Publishing Limited.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</cites><orcidid>0000-0001-5641-6396</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.emerald.com/insight/content/doi/10.1108/DTA-08-2022-0302/full/html$$EHTML$$P50$$Gemerald$$H</linktohtml><link.rule.ids>314,776,780,21676,27903,27904,53222</link.rule.ids></links><search><creatorcontrib>Li, Jingyi</creatorcontrib><creatorcontrib>Chao, Shiwei</creatorcontrib><title>A novel twin-support vector machine for binary classification to imbalanced data</title><title>Data technologies and applications</title><description>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</description><subject>Accuracy</subject><subject>Artificial Intelligence</subject><subject>Boundaries</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Complexity</subject><subject>Datasets</subject><subject>Lagrange multiplier</subject><subject>Machine learning</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Programming</subject><subject>Sampling</subject><subject>Support vector machines</subject><issn>2514-9288</issn><issn>2514-9318</issn><issn>2514-9288</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNptkEtPwzAQhC0EElXpnaMlzqa76zhxjlV5SpXgUM6W4zgiVV7YaRH_nlSFAxKnmcPMrPZj7BrhFhH08m67EqAFAZEACXTGZqQwEblEff7rSetLtohxBwAEKpNazdjrinf9wTd8_Kw7EffD0IeRH7wb-8Bb697rzvNq8kXd2fDFXWNjrKva2bHuOz72vG4L29jO-ZKXdrRX7KKyTfSLH52zt4f77fpJbF4en9erjXCUJqPwoDAFR5WSGnOsMioyTLEkmciyqBCUTgspCSFxHsgqic6mCgsqkbIC5JzdnHaH0H_sfRzNrt-HbjppKCfALM9zOaXglHKhjzH4ygyhbqdHDII5ojMTOjPJEZ05opsqy1PFtz7Ypvyv8Qe2_Ab4Am2H</recordid><startdate>20230614</startdate><enddate>20230614</enddate><creator>Li, Jingyi</creator><creator>Chao, Shiwei</creator><general>Emerald Publishing Limited</general><general>Emerald Group Publishing Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>CNYFK</scope><scope>DWQXO</scope><scope>E3H</scope><scope>F2A</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M0P</scope><scope>M1O</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PYYUZ</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0001-5641-6396</orcidid></search><sort><creationdate>20230614</creationdate><title>A novel twin-support vector machine for binary classification to imbalanced data</title><author>Li, Jingyi ; Chao, Shiwei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c264t-e05160c2f538191f72b7161d2343dbf10586b332104ce02a531ca651b2d127b03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Artificial Intelligence</topic><topic>Boundaries</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Complexity</topic><topic>Datasets</topic><topic>Lagrange multiplier</topic><topic>Machine learning</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Programming</topic><topic>Sampling</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Jingyi</creatorcontrib><creatorcontrib>Chao, Shiwei</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Education Collection</collection><collection>Library & Information Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Education Database</collection><collection>Library Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ABI/INFORM Collection China</collection><collection>ProQuest Central Basic</collection><jtitle>Data technologies and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Jingyi</au><au>Chao, Shiwei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel twin-support vector machine for binary classification to imbalanced data</atitle><jtitle>Data technologies and applications</jtitle><date>2023-06-14</date><risdate>2023</risdate><volume>57</volume><issue>3</issue><spage>385</spage><epage>396</epage><pages>385-396</pages><issn>2514-9288</issn><eissn>2514-9318</eissn><eissn>2514-9288</eissn><abstract>PurposeBinary classification on imbalanced data is a challenge; due to the imbalance of the classes, the minority class is easily masked by the majority class. However, most existing classifiers are better at identifying the majority class, thereby ignoring the minority class, which leads to classifier degradation. To address this, this paper proposes a twin-support vector machines for binary classification on imbalanced data.Design/methodology/approachIn the proposed method, the authors construct two support vector machines to focus on majority classes and minority classes, respectively. In order to promote the learning ability of the two support vector machines, a new kernel is derived for them.Findings(1) A novel twin-support vector machine is proposed for binary classification on imbalanced data, and new kernels are derived. (2) For imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned by using optimizing kernels. (3) Classifiers based on twin architectures have more advantages than those based on single architecture for binary classification on imbalanced data.Originality/valueFor imbalanced data, the complexity of data distribution has negative effects on classification results; however, advanced classification results can be gained and desired boundaries are learned through using optimizing kernels.</abstract><cop>Bingley</cop><pub>Emerald Publishing Limited</pub><doi>10.1108/DTA-08-2022-0302</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-5641-6396</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2514-9288 |
ispartof | Data technologies and applications, 2023-06, Vol.57 (3), p.385-396 |
issn | 2514-9288 2514-9318 2514-9288 |
language | eng |
recordid | cdi_emerald_primary_10_1108_DTA-08-2022-0302 |
source | Standard: Emerald eJournal Premier Collection |
subjects | Accuracy Artificial Intelligence Boundaries Classification Classifiers Complexity Datasets Lagrange multiplier Machine learning Methods Neural networks Programming Sampling Support vector machines |
title | A novel twin-support vector machine for binary classification to imbalanced data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T16%3A42%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_emera&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20twin-support%20vector%20machine%20for%20binary%20classification%20to%20imbalanced%20data&rft.jtitle=Data%20technologies%20and%20applications&rft.au=Li,%20Jingyi&rft.date=2023-06-14&rft.volume=57&rft.issue=3&rft.spage=385&rft.epage=396&rft.pages=385-396&rft.issn=2514-9288&rft.eissn=2514-9318&rft_id=info:doi/10.1108/DTA-08-2022-0302&rft_dat=%3Cproquest_emera%3E2920179993%3C/proquest_emera%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2920179993&rft_id=info:pmid/&rfr_iscdi=true |