Shapelet Based Two-Step Time Series Positive and Unlabeled Learning

In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of computer science and technology 2023-12, Vol.38 (6), p.1387-1402
Hauptverfasser: Zhang, Han-Bo, Wang, Peng, Zhang, Ming-Ming, Wang, Wei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1402
container_issue 6
container_start_page 1387
container_title Journal of computer science and technology
container_volume 38
creator Zhang, Han-Bo
Wang, Peng
Zhang, Ming-Ming
Wang, Wei
description In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average F 1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest F 1 score on 10 out of 15 time series datasets.
doi_str_mv 10.1007/s11390-022-1320-9
format Article
fullrecord <record><control><sourceid>wanfang_jour_proqu</sourceid><recordid>TN_cdi_wanfang_journals_jsjkxjsxb_e202306012</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A785799226</galeid><wanfj_id>jsjkxjsxb_e202306012</wanfj_id><sourcerecordid>jsjkxjsxb_e202306012</sourcerecordid><originalsourceid>FETCH-LOGICAL-c344t-e9c515cd0faf4b3a9d2e10175f7f0684f1727cfdf2373a275825b17f2720ad053</originalsourceid><addsrcrecordid>eNp1kU1LAzEQhhdRsFZ_gLcFr26dTHab5liLX1BQaHsO6e6kZm2zNVlt_femrNCTDGTC8DyTwJsk1wwGDEDcBca4hAwQM8YRMnmS9NhoCFkucnka7wBxGI_z5CKEGoALyPNeMpm96y2tqU3vdaAqne-abNbSNp3bDaUz8pZC-tYE29pvSrWr0oVb62U0qnRK2jvrVpfJmdHrQFd_vZ8sHh_mk-ds-vr0MhlPs5LneZuRLAtWlBUYbfIl17JCYsBEYYSB4Sg3TKAoTWWQC65RFCMslkwYFAi6goL3k9tu7047o91K1c2Xd_FFVYf6Y1-H_VIRAnIYAsOI33T41jefXxTaI48SGZNcFAdq0FErvSZlnWlar8tYFW1s2TgyNs7HYlQIKRGHUWCdUPomBE9Gbb3daP-jGKhDFqrLQsUs1CELJaODnRMi61bkj1_5X_oFPHSJig</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2921193752</pqid></control><display><type>article</type><title>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</title><source>Springer Nature - Complete Springer Journals</source><creator>Zhang, Han-Bo ; Wang, Peng ; Zhang, Ming-Ming ; Wang, Wei</creator><creatorcontrib>Zhang, Han-Bo ; Wang, Peng ; Zhang, Ming-Ming ; Wang, Wei</creatorcontrib><description>In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average F 1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest F 1 score on 10 out of 15 time series datasets.</description><identifier>ISSN: 1000-9000</identifier><identifier>EISSN: 1860-4749</identifier><identifier>DOI: 10.1007/s11390-022-1320-9</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Accuracy ; Artificial Intelligence ; Classification ; Computer Science ; Data Structures and Information Theory ; Datasets ; Information Systems Applications (incl.Internet) ; Learning ; Regular Paper ; Software Engineering ; Theory of Computation ; Time series</subject><ispartof>Journal of computer science and technology, 2023-12, Vol.38 (6), p.1387-1402</ispartof><rights>Institute of Computing Technology, Chinese Academy of Sciences 2024</rights><rights>COPYRIGHT 2023 Springer</rights><rights>Institute of Computing Technology, Chinese Academy of Sciences 2024.</rights><rights>Copyright © Wanfang Data Co. Ltd. All Rights Reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c344t-e9c515cd0faf4b3a9d2e10175f7f0684f1727cfdf2373a275825b17f2720ad053</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttp://www.wanfangdata.com.cn/images/PeriodicalImages/jsjkxjsxb-e/jsjkxjsxb-e.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11390-022-1320-9$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11390-022-1320-9$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Zhang, Han-Bo</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Zhang, Ming-Ming</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><title>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</title><title>Journal of computer science and technology</title><addtitle>J. Comput. Sci. Technol</addtitle><description>In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average F 1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest F 1 score on 10 out of 15 time series datasets.</description><subject>Accuracy</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Learning</subject><subject>Regular Paper</subject><subject>Software Engineering</subject><subject>Theory of Computation</subject><subject>Time series</subject><issn>1000-9000</issn><issn>1860-4749</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNp1kU1LAzEQhhdRsFZ_gLcFr26dTHab5liLX1BQaHsO6e6kZm2zNVlt_femrNCTDGTC8DyTwJsk1wwGDEDcBca4hAwQM8YRMnmS9NhoCFkucnka7wBxGI_z5CKEGoALyPNeMpm96y2tqU3vdaAqne-abNbSNp3bDaUz8pZC-tYE29pvSrWr0oVb62U0qnRK2jvrVpfJmdHrQFd_vZ8sHh_mk-ds-vr0MhlPs5LneZuRLAtWlBUYbfIl17JCYsBEYYSB4Sg3TKAoTWWQC65RFCMslkwYFAi6goL3k9tu7047o91K1c2Xd_FFVYf6Y1-H_VIRAnIYAsOI33T41jefXxTaI48SGZNcFAdq0FErvSZlnWlar8tYFW1s2TgyNs7HYlQIKRGHUWCdUPomBE9Gbb3daP-jGKhDFqrLQsUs1CELJaODnRMi61bkj1_5X_oFPHSJig</recordid><startdate>20231201</startdate><enddate>20231201</enddate><creator>Zhang, Han-Bo</creator><creator>Wang, Peng</creator><creator>Zhang, Ming-Ming</creator><creator>Wang, Wei</creator><general>Springer Nature Singapore</general><general>Springer</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>Q9U</scope><scope>2B.</scope><scope>4A8</scope><scope>92I</scope><scope>93N</scope><scope>PSX</scope><scope>TCJ</scope></search><sort><creationdate>20231201</creationdate><title>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</title><author>Zhang, Han-Bo ; Wang, Peng ; Zhang, Ming-Ming ; Wang, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c344t-e9c515cd0faf4b3a9d2e10175f7f0684f1727cfdf2373a275825b17f2720ad053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Learning</topic><topic>Regular Paper</topic><topic>Software Engineering</topic><topic>Theory of Computation</topic><topic>Time series</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Han-Bo</creatorcontrib><creatorcontrib>Wang, Peng</creatorcontrib><creatorcontrib>Zhang, Ming-Ming</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><collection>Wanfang Data Journals - Hong Kong</collection><collection>WANFANG Data Centre</collection><collection>Wanfang Data Journals</collection><collection>万方数据期刊 - 香港版</collection><collection>China Online Journals (COJ)</collection><collection>China Online Journals (COJ)</collection><jtitle>Journal of computer science and technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Han-Bo</au><au>Wang, Peng</au><au>Zhang, Ming-Ming</au><au>Wang, Wei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Shapelet Based Two-Step Time Series Positive and Unlabeled Learning</atitle><jtitle>Journal of computer science and technology</jtitle><stitle>J. Comput. Sci. Technol</stitle><date>2023-12-01</date><risdate>2023</risdate><volume>38</volume><issue>6</issue><spage>1387</spage><epage>1402</epage><pages>1387-1402</pages><issn>1000-9000</issn><eissn>1860-4749</eissn><abstract>In the last decade, there has been significant progress in time series classification. However, in real-world industrial settings, it is expensive and difficult to obtain high-quality labeled data. Therefore, the positive and unlabeled learning (PU-learning) problem has become more and more popular recently. The current PU-learning approaches of the time series data suffer from low accuracy due to the lack of negative labeled time series. In this paper, we propose a novel shapelet based two-step (2STEP) PU-learning approach. In the first step, we generate shapelet features based on the positive time series, which are used to select a set of negative examples. In the second step, based on both positive and negative time series, we select the final features and build the classification model. The experimental results show that our 2STEP approach can improve the average F 1 score on 15 datasets by 9.1% compared with the baselines, and achieves the highest F 1 score on 10 out of 15 time series datasets.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s11390-022-1320-9</doi><tpages>16</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1000-9000
ispartof Journal of computer science and technology, 2023-12, Vol.38 (6), p.1387-1402
issn 1000-9000
1860-4749
language eng
recordid cdi_wanfang_journals_jsjkxjsxb_e202306012
source Springer Nature - Complete Springer Journals
subjects Accuracy
Artificial Intelligence
Classification
Computer Science
Data Structures and Information Theory
Datasets
Information Systems Applications (incl.Internet)
Learning
Regular Paper
Software Engineering
Theory of Computation
Time series
title Shapelet Based Two-Step Time Series Positive and Unlabeled Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T16%3A19%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wanfang_jour_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Shapelet%20Based%20Two-Step%20Time%20Series%20Positive%20and%20Unlabeled%20Learning&rft.jtitle=Journal%20of%20computer%20science%20and%20technology&rft.au=Zhang,%20Han-Bo&rft.date=2023-12-01&rft.volume=38&rft.issue=6&rft.spage=1387&rft.epage=1402&rft.pages=1387-1402&rft.issn=1000-9000&rft.eissn=1860-4749&rft_id=info:doi/10.1007/s11390-022-1320-9&rft_dat=%3Cwanfang_jour_proqu%3Ejsjkxjsxb_e202306012%3C/wanfang_jour_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2921193752&rft_id=info:pmid/&rft_galeid=A785799226&rft_wanfj_id=jsjkxjsxb_e202306012&rfr_iscdi=true