Shapelet Based Two-Step Time Series Positive and Unlabeled Learning
In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular...
Gespeichert in:
Veröffentlicht in: | Journal of computer science and technology 2023-12, Vol.38 (6), p.1387-1402 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1402 |
---|---|
container_issue | 6 |
container_start_page | 1387 |
container_title | Journal of computer science and technology |
container_volume | 38 |
creator | Zhang, Han-Bo Wang, Peng Zhang, Ming-Ming Wang, Wei |
description | In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average
F
1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest
F
1 score on 10 out of 15 time series datasets. |
doi_str_mv | 10.1007/s11390-022-1320-9 |
format | Article |
fullrecord | <record><control><sourceid>wanfang_jour_proqu</sourceid><recordid>TN_cdi_wanfang_journals_jsjkxjsxb_e202306012</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A785799226</galeid><wanfj_id>jsjkxjsxb_e202306012</wanfj_id><sourcerecordid>jsjkxjsxb_e202306012</sourcerecordid><originalsourceid>FETCH-LOGICAL-c344t-e9c515cd0faf4b3a9d2e10175f7f0684f1727cfdf2373a275825b17f2720ad053</originalsourceid><addsrcrecordid>eNp1kU1LAzEQhhdRsFZ_gLcFr26dTHab5liLX1BQaHsO6e6kZm2zNVlt_femrNCTDGTC8DyTwJsk1wwGDEDcBca4hAwQM8YRMnmS9NhoCFkucnka7wBxGI_z5CKEGoALyPNeMpm96y2tqU3vdaAqne-abNbSNp3bDaUz8pZC-tYE29pvSrWr0oVb62U0qnRK2jvrVpfJmdHrQFd_vZ8sHh_mk-ds-vr0MhlPs5LneZuRLAtWlBUYbfIl17JCYsBEYYSB4Sg3TKAoTWWQC65RFCMslkwYFAi6goL3k9tu7047o91K1c2Xd_FFVYf6Y1-H_VIRAnIYAsOI33T41jefXxTaI48SGZNcFAdq0FErvSZlnWlar8tYFW1s2TgyNs7HYlQIKRGHUWCdUPomBE9Gbb3daP-jGKhDFqrLQsUs1CELJaODnRMi61bkj1_5X_oFPHSJig</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2921193752</pqid></control><display><type>article</type><title>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</title><source>Springer Nature - Complete Springer Journals</source><creator>Zhang, Han-Bo ; Wang, Peng ; Zhang, Ming-Ming ; Wang, Wei</creator><creatorcontrib>Zhang, Han-Bo ; Wang, Peng ; Zhang, Ming-Ming ; Wang, Wei</creatorcontrib><description>In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average
F
1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest
F
1 score on 10 out of 15 time series datasets.</description><identifier>ISSN: 1000-9000</identifier><identifier>EISSN: 1860-4749</identifier><identifier>DOI: 10.1007/s11390-022-1320-9</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Accuracy ; Artificial Intelligence ; Classification ; Computer Science ; Data Structures and Information Theory ; Datasets ; Information Systems Applications (incl.Internet) ; Learning ; Regular Paper ; Software Engineering ; Theory of Computation ; Time series</subject><ispartof>Journal of computer science and technology, 2023-12, Vol.38 (6), p.1387-1402</ispartof><rights>Institute of Computing Technology, Chinese Academy of Sciences 2024</rights><rights>COPYRIGHT 2023 Springer</rights><rights>Institute of Computing Technology, Chinese Academy of Sciences 2024.</rights><rights>Copyright © Wanfang Data Co. Ltd. All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c344t-e9c515cd0faf4b3a9d2e10175f7f0684f1727cfdf2373a275825b17f2720ad053</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://www.wanfangdata.com.cn/images/PeriodicalImages/jsjkxjsxb-e/jsjkxjsxb-e.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11390-022-1320-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11390-022-1320-9$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Zhang, Han-Bo</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Zhang, Ming-Ming</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><title>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</title><title>Journal of computer science and technology</title><addtitle>J. Comput. Sci. Technol</addtitle><description>In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average
F
1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest
F
1 score on 10 out of 15 time series datasets.</description><subject>Accuracy</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Learning</subject><subject>Regular Paper</subject><subject>Software Engineering</subject><subject>Theory of Computation</subject><subject>Time series</subject><issn>1000-9000</issn><issn>1860-4749</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp1kU1LAzEQhhdRsFZ_gLcFr26dTHab5liLX1BQaHsO6e6kZm2zNVlt_femrNCTDGTC8DyTwJsk1wwGDEDcBca4hAwQM8YRMnmS9NhoCFkucnka7wBxGI_z5CKEGoALyPNeMpm96y2tqU3vdaAqne-abNbSNp3bDaUz8pZC-tYE29pvSrWr0oVb62U0qnRK2jvrVpfJmdHrQFd_vZ8sHh_mk-ds-vr0MhlPs5LneZuRLAtWlBUYbfIl17JCYsBEYYSB4Sg3TKAoTWWQC65RFCMslkwYFAi6goL3k9tu7047o91K1c2Xd_FFVYf6Y1-H_VIRAnIYAsOI33T41jefXxTaI48SGZNcFAdq0FErvSZlnWlar8tYFW1s2TgyNs7HYlQIKRGHUWCdUPomBE9Gbb3daP-jGKhDFqrLQsUs1CELJaODnRMi61bkj1_5X_oFPHSJig</recordid><startdate>20231201</startdate><enddate>20231201</enddate><creator>Zhang, Han-Bo</creator><creator>Wang, Peng</creator><creator>Zhang, Ming-Ming</creator><creator>Wang, Wei</creator><general>Springer Nature Singapore</general><general>Springer</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>Q9U</scope><scope>2B.</scope><scope>4A8</scope><scope>92I</scope><scope>93N</scope><scope>PSX</scope><scope>TCJ</scope></search><sort><creationdate>20231201</creationdate><title>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</title><author>Zhang, Han-Bo ; Wang, Peng ; Zhang, Ming-Ming ; Wang, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c344t-e9c515cd0faf4b3a9d2e10175f7f0684f1727cfdf2373a275825b17f2720ad053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Learning</topic><topic>Regular Paper</topic><topic>Software Engineering</topic><topic>Theory of Computation</topic><topic>Time series</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Han-Bo</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Zhang, Ming-Ming</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><collection>Wanfang Data Journals - Hong Kong</collection><collection>WANFANG Data Centre</collection><collection>Wanfang Data Journals</collection><collection>万方数据期刊 - 香港版</collection><collection>China Online Journals (COJ)</collection><collection>China Online Journals (COJ)</collection><jtitle>Journal of computer science and technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Han-Bo</au><au>Wang, Peng</au><au>Zhang, Ming-Ming</au><au>Wang, Wei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</atitle><jtitle>Journal of computer science and technology</jtitle><stitle>J. Comput. Sci. Technol</stitle><date>2023-12-01</date><risdate>2023</risdate><volume>38</volume><issue>6</issue><spage>1387</spage><epage>1402</epage><pages>1387-1402</pages><issn>1000-9000</issn><eissn>1860-4749</eissn><abstract>In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average
F
1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest
F
1 score on 10 out of 15 time series datasets.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s11390-022-1320-9</doi><tpages>16</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1000-9000 |
ispartof | Journal of computer science and technology, 2023-12, Vol.38 (6), p.1387-1402 |
issn | 1000-9000 1860-4749 |
language | eng |
recordid | cdi_wanfang_journals_jsjkxjsxb_e202306012 |
source | Springer Nature - Complete Springer Journals |
subjects | Accuracy Artificial Intelligence Classification Computer Science Data Structures and Information Theory Datasets Information Systems Applications (incl.Internet) Learning Regular Paper Software Engineering Theory of Computation Time series |
title | Shapelet Based Two-Step Time Series Positive and Unlabeled Learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T16%3A19%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wanfang_jour_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Shapelet%20Based%20Two-Step%20Time%20Series%20Positive%20and%20Unlabeled%20Learning&rft.jtitle=Journal%20of%20computer%20science%20and%20technology&rft.au=Zhang,%20Han-Bo&rft.date=2023-12-01&rft.volume=38&rft.issue=6&rft.spage=1387&rft.epage=1402&rft.pages=1387-1402&rft.issn=1000-9000&rft.eissn=1860-4749&rft_id=info:doi/10.1007/s11390-022-1320-9&rft_dat=%3Cwanfang_jour_proqu%3Ejsjkxjsxb_e202306012%3C/wanfang_jour_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2921193752&rft_id=info:pmid/&rft_galeid=A785799226&rft_wanfj_id=jsjkxjsxb_e202306012&rfr_iscdi=true |