An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction

The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2017-01, Vol.32 (2), p.1289-1296
Hauptverfasser: Jędrzejowicz, Joanna, Jędrzejowicz, Piotr
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1296
container_issue 2
container_start_page 1289
container_title Journal of intelligent & fuzzy systems
container_volume 32
creator Jędrzejowicz, Joanna
Jędrzejowicz, Piotr
description The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction tool. It is assumed that vectors of data with unknown class label arrive one by one, and that there is available an initial chunk of data with known class labels serving as the initial training set. Classification is carried-out in rounds. Each round involves a number of the classification decisions equal to the chunk size. For each round a set of base classifiers is constructed using different distance metrics. Set of base classifiers is extended with the Naive-Bayes classifier. The unknown label of each incoming vector is determined through weighted majority voting. After each round has been completed the training set is replaced by the fresh one and the classification process is continued. The approach is validated through computational experiment involving a number of datasets often used for testing data streams mining algorithms.
doi_str_mv 10.3233/JIFS-169127
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1993977530</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1993977530</sourcerecordid><originalsourceid>FETCH-LOGICAL-c219t-898d7ac7dff0f58ff86a1e97130e3ab831181529854e5b31f82fc03d49445a23</originalsourceid><addsrcrecordid>eNo9kE1LAzEQhoMoWKsn_0DAo6zmY7NJjrVYPyh6sPclm0xoSpvUZKv037u14mmGmfedd3gQuqbkjjPO719fZh8VbTRl8gSNqJKiUrqRp0NPmrqirG7O0UUpK0KoFIyMUJpEDLHAplsDTh73S8AulN5EC1VnCjhsosNvJnwBfjB7KNiuTSnBB8gF-5R_LSmuQ4T_lTV9SBF_h36JnekNzuB29jC7RGferAtc_dUxWsweF9Pnav7-9DKdzCvLqO6Hr5WTxkrnPfFCea8aQ0FLyglw0ylOqaKCaSVqEB2nXjFvCXe1rmthGB-jm-PZbU6fOyh9u0q7HIfElmrNtZSCk0F1e1TZnErJ4NttDhuT9y0l7QFoewDaHoHyH8vhaM8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1993977530</pqid></control><display><type>article</type><title>An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction</title><source>Business Source Complete</source><creator>Jędrzejowicz, Joanna ; Jędrzejowicz, Piotr</creator><creatorcontrib>Jędrzejowicz, Joanna ; Jędrzejowicz, Piotr</creatorcontrib><description>The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction tool. It is assumed that vectors of data with unknown class label arrive one by one, and that there is available an initial chunk of data with known class labels serving as the initial training set. Classification is carried-out in rounds. Each round involves a number of the classification decisions equal to the chunk size. For each round a set of base classifiers is constructed using different distance metrics. Set of base classifiers is extended with the Naive-Bayes classifier. The unknown label of each incoming vector is determined through weighted majority voting. After each round has been completed the training set is replaced by the fresh one and the classification process is continued. The approach is validated through computational experiment involving a number of datasets often used for testing data streams mining algorithms.</description><identifier>ISSN: 1064-1246</identifier><identifier>EISSN: 1875-8967</identifier><identifier>DOI: 10.3233/JIFS-169127</identifier><language>eng</language><publisher>Amsterdam: IOS Press BV</publisher><subject>Bayesian analysis ; Classification ; Classifiers ; Clustering ; Data mining ; Data reduction ; Data transmission ; Training</subject><ispartof>Journal of intelligent &amp; fuzzy systems, 2017-01, Vol.32 (2), p.1289-1296</ispartof><rights>Copyright IOS Press BV 2017</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c219t-898d7ac7dff0f58ff86a1e97130e3ab831181529854e5b31f82fc03d49445a23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Jędrzejowicz, Joanna</creatorcontrib><creatorcontrib>Jędrzejowicz, Piotr</creatorcontrib><title>An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction</title><title>Journal of intelligent &amp; fuzzy systems</title><description>The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction tool. It is assumed that vectors of data with unknown class label arrive one by one, and that there is available an initial chunk of data with known class labels serving as the initial training set. Classification is carried-out in rounds. Each round involves a number of the classification decisions equal to the chunk size. For each round a set of base classifiers is constructed using different distance metrics. Set of base classifiers is extended with the Naive-Bayes classifier. The unknown label of each incoming vector is determined through weighted majority voting. After each round has been completed the training set is replaced by the fresh one and the classification process is continued. The approach is validated through computational experiment involving a number of datasets often used for testing data streams mining algorithms.</description><subject>Bayesian analysis</subject><subject>Classification</subject><subject>Classifiers</subject><subject>Clustering</subject><subject>Data mining</subject><subject>Data reduction</subject><subject>Data transmission</subject><subject>Training</subject><issn>1064-1246</issn><issn>1875-8967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNo9kE1LAzEQhoMoWKsn_0DAo6zmY7NJjrVYPyh6sPclm0xoSpvUZKv037u14mmGmfedd3gQuqbkjjPO719fZh8VbTRl8gSNqJKiUrqRp0NPmrqirG7O0UUpK0KoFIyMUJpEDLHAplsDTh73S8AulN5EC1VnCjhsosNvJnwBfjB7KNiuTSnBB8gF-5R_LSmuQ4T_lTV9SBF_h36JnekNzuB29jC7RGferAtc_dUxWsweF9Pnav7-9DKdzCvLqO6Hr5WTxkrnPfFCea8aQ0FLyglw0ylOqaKCaSVqEB2nXjFvCXe1rmthGB-jm-PZbU6fOyh9u0q7HIfElmrNtZSCk0F1e1TZnErJ4NttDhuT9y0l7QFoewDaHoHyH8vhaM8</recordid><startdate>20170101</startdate><enddate>20170101</enddate><creator>Jędrzejowicz, Joanna</creator><creator>Jędrzejowicz, Piotr</creator><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20170101</creationdate><title>An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction</title><author>Jędrzejowicz, Joanna ; Jędrzejowicz, Piotr</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c219t-898d7ac7dff0f58ff86a1e97130e3ab831181529854e5b31f82fc03d49445a23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Bayesian analysis</topic><topic>Classification</topic><topic>Classifiers</topic><topic>Clustering</topic><topic>Data mining</topic><topic>Data reduction</topic><topic>Data transmission</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jędrzejowicz, Joanna</creatorcontrib><creatorcontrib>Jędrzejowicz, Piotr</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jędrzejowicz, Joanna</au><au>Jędrzejowicz, Piotr</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction</atitle><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle><date>2017-01-01</date><risdate>2017</risdate><volume>32</volume><issue>2</issue><spage>1289</spage><epage>1296</epage><pages>1289-1296</pages><issn>1064-1246</issn><eissn>1875-8967</eissn><abstract>The paper proposes two variants of the ensemble distance-based and Naive-Bayes online classifiers with data reduction. In the first variant the reduced dataset is obtained through applying bias-correction fuzzy clustering. In the second we used the kernel-based fuzzy clustering as the data reduction tool. It is assumed that vectors of data with unknown class label arrive one by one, and that there is available an initial chunk of data with known class labels serving as the initial training set. Classification is carried-out in rounds. Each round involves a number of the classification decisions equal to the chunk size. For each round a set of base classifiers is constructed using different distance metrics. Set of base classifiers is extended with the Naive-Bayes classifier. The unknown label of each incoming vector is determined through weighted majority voting. After each round has been completed the training set is replaced by the fresh one and the classification process is continued. The approach is validated through computational experiment involving a number of datasets often used for testing data streams mining algorithms.</abstract><cop>Amsterdam</cop><pub>IOS Press BV</pub><doi>10.3233/JIFS-169127</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1064-1246
ispartof Journal of intelligent & fuzzy systems, 2017-01, Vol.32 (2), p.1289-1296
issn 1064-1246
1875-8967
language eng
recordid cdi_proquest_journals_1993977530
source Business Source Complete
subjects Bayesian analysis
Classification
Classifiers
Clustering
Data mining
Data reduction
Data transmission
Training
title An ensemble of the distance-based and Naive Bayes classifiers for the online classification with data reduction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T08%3A01%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20ensemble%20of%20the%20distance-based%20and%20Naive%20Bayes%20classifiers%20for%20the%20online%20classification%20with%20data%20reduction&rft.jtitle=Journal%20of%20intelligent%20&%20fuzzy%20systems&rft.au=J%C4%99drzejowicz,%20Joanna&rft.date=2017-01-01&rft.volume=32&rft.issue=2&rft.spage=1289&rft.epage=1296&rft.pages=1289-1296&rft.issn=1064-1246&rft.eissn=1875-8967&rft_id=info:doi/10.3233/JIFS-169127&rft_dat=%3Cproquest_cross%3E1993977530%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1993977530&rft_id=info:pmid/&rfr_iscdi=true