Incremental feature selection for efficient classification of dynamic graph bags

Summary Learning and analyzing graph data is one of the most fundamental research areas in machine learning and data mining. Among numerous graph‐based data structures, this paper focuses on a graph bag (simply, bag), which corresponds to a training object containing one or more graphs, and a label...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Concurrency and computation 2020-09, Vol.32 (18), p.n/a
Hauptverfasser: Chae, Dong‐Kyu, Kim, Bo‐Kyum, Kim, Seung‐Ho, Kim, Sang‐Wook
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 18
container_start_page
container_title Concurrency and computation
container_volume 32
creator Chae, Dong‐Kyu
Kim, Bo‐Kyum
Kim, Seung‐Ho
Kim, Sang‐Wook
description Summary Learning and analyzing graph data is one of the most fundamental research areas in machine learning and data mining. Among numerous graph‐based data structures, this paper focuses on a graph bag (simply, bag), which corresponds to a training object containing one or more graphs, and a label is available only for a bag. This type of a bag can represent various real‐world objects such as drugs, web pages, XML documents, and images, among many others, and there have been many researches on models for learning this type of bag data. Within this research context, we define a novel problem of dynamic graph bag classification, and propose an algorithm to solve this problem. Dynamic bag classification aims to build a classification model for bags, which are presented in a streaming fashion, ie, frequent emerging of new bags or graphs over time. Given such changes made to the bag dataset, our proposed algorithm aims to update incrementally the top‐m most discriminative features instead of searching for them from scratch. Incremental gSpan and incremental gScore are proposed as core parts of our algorithm to deal with a stream of bags efficiently. We evaluate our algorithm on two real‐world datasets in terms of both feature selection time and classification accuracy. The experimental results demonstrate that our algorithm derives an informative feature set much faster than the existing one originally designed for targeting static bag data, with little accuracy loss.
doi_str_mv 10.1002/cpe.5502
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2436639100</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2436639100</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2932-ff4df6752d46ba6c9e595b5134ebdf76fa1c2a191b2824eb416753c24bd1c6803</originalsourceid><addsrcrecordid>eNp1kEtLAzEQgIMoWKvgTwh48bI1k1e7RylVC4I96Dlks5Oast1dky3Sf2_aijdP8_qYGT5CboFNgDH-4HqcKMX4GRmBErxgWsjzv5zrS3KV0oYxACZgRFbL1kXcYjvYhnq0wy4iTdigG0LXUt9Fit4HFzJBXWNTCrmyx2Hnab1v7TY4uo62_6SVXadrcuFtk_DmN47Jx9Piff5SvL49L-ePr4XjZX7Fe1l7PVW8lrqy2pWoSlUpEBKr2k-1t-C4hRIqPuO5JyHDwnFZ1eD0jIkxuTvt7WP3tcM0mE23i20-abgUWosy68jU_YlysUspojd9DFsb9waYOfgy2Zc5-MpocUK_Q4P7fzkzXy2O_A8v1WvN</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2436639100</pqid></control><display><type>article</type><title>Incremental feature selection for efficient classification of dynamic graph bags</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Chae, Dong‐Kyu ; Kim, Bo‐Kyum ; Kim, Seung‐Ho ; Kim, Sang‐Wook</creator><creatorcontrib>Chae, Dong‐Kyu ; Kim, Bo‐Kyum ; Kim, Seung‐Ho ; Kim, Sang‐Wook</creatorcontrib><description>Summary Learning and analyzing graph data is one of the most fundamental research areas in machine learning and data mining. Among numerous graph‐based data structures, this paper focuses on a graph bag (simply, bag), which corresponds to a training object containing one or more graphs, and a label is available only for a bag. This type of a bag can represent various real‐world objects such as drugs, web pages, XML documents, and images, among many others, and there have been many researches on models for learning this type of bag data. Within this research context, we define a novel problem of dynamic graph bag classification, and propose an algorithm to solve this problem. Dynamic bag classification aims to build a classification model for bags, which are presented in a streaming fashion, ie, frequent emerging of new bags or graphs over time. Given such changes made to the bag dataset, our proposed algorithm aims to update incrementally the top‐m most discriminative features instead of searching for them from scratch. Incremental gSpan and incremental gScore are proposed as core parts of our algorithm to deal with a stream of bags efficiently. We evaluate our algorithm on two real‐world datasets in terms of both feature selection time and classification accuracy. The experimental results demonstrate that our algorithm derives an informative feature set much faster than the existing one originally designed for targeting static bag data, with little accuracy loss.</description><identifier>ISSN: 1532-0626</identifier><identifier>EISSN: 1532-0634</identifier><identifier>DOI: 10.1002/cpe.5502</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc</publisher><subject>Algorithms ; Classification ; Data mining ; Data structures ; Datasets ; dynamic classification ; Feature selection ; Graph bag classification ; Graphs ; gSpan ; Machine learning ; Websites</subject><ispartof>Concurrency and computation, 2020-09, Vol.32 (18), p.n/a</ispartof><rights>2019 John Wiley &amp; Sons, Ltd.</rights><rights>2020 John Wiley &amp; Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2932-ff4df6752d46ba6c9e595b5134ebdf76fa1c2a191b2824eb416753c24bd1c6803</citedby><cites>FETCH-LOGICAL-c2932-ff4df6752d46ba6c9e595b5134ebdf76fa1c2a191b2824eb416753c24bd1c6803</cites><orcidid>0000-0002-6345-9084</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fcpe.5502$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fcpe.5502$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,776,780,1411,27901,27902,45550,45551</link.rule.ids></links><search><creatorcontrib>Chae, Dong‐Kyu</creatorcontrib><creatorcontrib>Kim, Bo‐Kyum</creatorcontrib><creatorcontrib>Kim, Seung‐Ho</creatorcontrib><creatorcontrib>Kim, Sang‐Wook</creatorcontrib><title>Incremental feature selection for efficient classification of dynamic graph bags</title><title>Concurrency and computation</title><description>Summary Learning and analyzing graph data is one of the most fundamental research areas in machine learning and data mining. Among numerous graph‐based data structures, this paper focuses on a graph bag (simply, bag), which corresponds to a training object containing one or more graphs, and a label is available only for a bag. This type of a bag can represent various real‐world objects such as drugs, web pages, XML documents, and images, among many others, and there have been many researches on models for learning this type of bag data. Within this research context, we define a novel problem of dynamic graph bag classification, and propose an algorithm to solve this problem. Dynamic bag classification aims to build a classification model for bags, which are presented in a streaming fashion, ie, frequent emerging of new bags or graphs over time. Given such changes made to the bag dataset, our proposed algorithm aims to update incrementally the top‐m most discriminative features instead of searching for them from scratch. Incremental gSpan and incremental gScore are proposed as core parts of our algorithm to deal with a stream of bags efficiently. We evaluate our algorithm on two real‐world datasets in terms of both feature selection time and classification accuracy. The experimental results demonstrate that our algorithm derives an informative feature set much faster than the existing one originally designed for targeting static bag data, with little accuracy loss.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Data mining</subject><subject>Data structures</subject><subject>Datasets</subject><subject>dynamic classification</subject><subject>Feature selection</subject><subject>Graph bag classification</subject><subject>Graphs</subject><subject>gSpan</subject><subject>Machine learning</subject><subject>Websites</subject><issn>1532-0626</issn><issn>1532-0634</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp1kEtLAzEQgIMoWKvgTwh48bI1k1e7RylVC4I96Dlks5Oast1dky3Sf2_aijdP8_qYGT5CboFNgDH-4HqcKMX4GRmBErxgWsjzv5zrS3KV0oYxACZgRFbL1kXcYjvYhnq0wy4iTdigG0LXUt9Fit4HFzJBXWNTCrmyx2Hnab1v7TY4uo62_6SVXadrcuFtk_DmN47Jx9Piff5SvL49L-ePr4XjZX7Fe1l7PVW8lrqy2pWoSlUpEBKr2k-1t-C4hRIqPuO5JyHDwnFZ1eD0jIkxuTvt7WP3tcM0mE23i20-abgUWosy68jU_YlysUspojd9DFsb9waYOfgy2Zc5-MpocUK_Q4P7fzkzXy2O_A8v1WvN</recordid><startdate>20200925</startdate><enddate>20200925</enddate><creator>Chae, Dong‐Kyu</creator><creator>Kim, Bo‐Kyum</creator><creator>Kim, Seung‐Ho</creator><creator>Kim, Sang‐Wook</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6345-9084</orcidid></search><sort><creationdate>20200925</creationdate><title>Incremental feature selection for efficient classification of dynamic graph bags</title><author>Chae, Dong‐Kyu ; Kim, Bo‐Kyum ; Kim, Seung‐Ho ; Kim, Sang‐Wook</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2932-ff4df6752d46ba6c9e595b5134ebdf76fa1c2a191b2824eb416753c24bd1c6803</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Data mining</topic><topic>Data structures</topic><topic>Datasets</topic><topic>dynamic classification</topic><topic>Feature selection</topic><topic>Graph bag classification</topic><topic>Graphs</topic><topic>gSpan</topic><topic>Machine learning</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chae, Dong‐Kyu</creatorcontrib><creatorcontrib>Kim, Bo‐Kyum</creatorcontrib><creatorcontrib>Kim, Seung‐Ho</creatorcontrib><creatorcontrib>Kim, Sang‐Wook</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Concurrency and computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chae, Dong‐Kyu</au><au>Kim, Bo‐Kyum</au><au>Kim, Seung‐Ho</au><au>Kim, Sang‐Wook</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Incremental feature selection for efficient classification of dynamic graph bags</atitle><jtitle>Concurrency and computation</jtitle><date>2020-09-25</date><risdate>2020</risdate><volume>32</volume><issue>18</issue><epage>n/a</epage><issn>1532-0626</issn><eissn>1532-0634</eissn><abstract>Summary Learning and analyzing graph data is one of the most fundamental research areas in machine learning and data mining. Among numerous graph‐based data structures, this paper focuses on a graph bag (simply, bag), which corresponds to a training object containing one or more graphs, and a label is available only for a bag. This type of a bag can represent various real‐world objects such as drugs, web pages, XML documents, and images, among many others, and there have been many researches on models for learning this type of bag data. Within this research context, we define a novel problem of dynamic graph bag classification, and propose an algorithm to solve this problem. Dynamic bag classification aims to build a classification model for bags, which are presented in a streaming fashion, ie, frequent emerging of new bags or graphs over time. Given such changes made to the bag dataset, our proposed algorithm aims to update incrementally the top‐m most discriminative features instead of searching for them from scratch. Incremental gSpan and incremental gScore are proposed as core parts of our algorithm to deal with a stream of bags efficiently. We evaluate our algorithm on two real‐world datasets in terms of both feature selection time and classification accuracy. The experimental results demonstrate that our algorithm derives an informative feature set much faster than the existing one originally designed for targeting static bag data, with little accuracy loss.</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/cpe.5502</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-6345-9084</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1532-0626
ispartof Concurrency and computation, 2020-09, Vol.32 (18), p.n/a
issn 1532-0626
1532-0634
language eng
recordid cdi_proquest_journals_2436639100
source Wiley Online Library Journals Frontfile Complete
subjects Algorithms
Classification
Data mining
Data structures
Datasets
dynamic classification
Feature selection
Graph bag classification
Graphs
gSpan
Machine learning
Websites
title Incremental feature selection for efficient classification of dynamic graph bags
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T15%3A09%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Incremental%20feature%20selection%20for%20efficient%20classification%20of%20dynamic%20graph%20bags&rft.jtitle=Concurrency%20and%20computation&rft.au=Chae,%20Dong%E2%80%90Kyu&rft.date=2020-09-25&rft.volume=32&rft.issue=18&rft.epage=n/a&rft.issn=1532-0626&rft.eissn=1532-0634&rft_id=info:doi/10.1002/cpe.5502&rft_dat=%3Cproquest_cross%3E2436639100%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2436639100&rft_id=info:pmid/&rfr_iscdi=true