Clustering and classification of time series using topological data analysis with applications to finance

In this paper, we propose new methods for time series classification and clustering. These methods are based on techniques of Topological Data Analysis (TDA) such as persistent homology and time delay embedding for analysing time-series data. We present a new clustering method SOM-TDA and a new clas...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2020-12, Vol.162, p.113868, Article 113868
Hauptverfasser: Majumdar, Sourav, Laha, Arnab Kumar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 113868
container_title Expert systems with applications
container_volume 162
creator Majumdar, Sourav
Laha, Arnab Kumar
description In this paper, we propose new methods for time series classification and clustering. These methods are based on techniques of Topological Data Analysis (TDA) such as persistent homology and time delay embedding for analysing time-series data. We present a new clustering method SOM-TDA and a new classification method RF-TDA based on TDA. Using SOM-TDA we examine the topological similarities and dissimilarities of some well-known time-series models used in finance. We also use the RF-TDA to examine if the topological features can be used to distinguish between time series models using simulated data. The performance of RF-TDA on the classification task is compared against three other classification methods. We also consider an application of RF-TDA to financial time series classification using real-life price data of stocks belonging to different sectors. RF-TDA is seen to perform quite well in the two experiments based on real-life stock-price data. This implies that the topological features of the time series of stock prices in the different sectors are not identical and have distinctive features that can be discerned through the use of TDA. We also briefly consider multi-class classification using RF-TDA. •New methods for time series clustering (SOM-TDA) and classification (RF-TDA).•RF-TDA outperforms other methods on the classification task.•Dependence of stock price movements on sectors in NSE is revealed using RF-TDA.
doi_str_mv 10.1016/j.eswa.2020.113868
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2465476881</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S095741742030676X</els_id><sourcerecordid>2465476881</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-76d78583630f1c32c33c42aa327af213f59f8e6f5525ba7f2383b6dc97b310b83</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWKt_wFPA89Z87CYpeJHiFxS86Dmk2aRm2W7WTGrpvzdlPXsaeHmeYeZF6JaSBSVU3HcLBwezYISVgHIl1BmaUSV5JeSSn6MZWTayqqmsL9EVQEcIlYTIGQqrfg_ZpTBssRlabHsDEHywJoc44OhxDjuHoRAO8B5OXI5j7OO2MD1uTTZFNP0RAuBDyF_YjGP_50NhsQ-DGay7Rhfe9OBu_uYcfT4_faxeq_X7y9vqcV1ZzlSupGilahQXnHhaIsu5rZkxnEnjGeW-WXrlhG8a1myM9IwrvhGtXcoNp2Sj-BzdTXvHFL_3DrLu4j6VC0GzWjS1FErRQrGJsikCJOf1mMLOpKOmRJ8q1Z0-VapPleqp0iI9TJIr9_8ElzTY4MpvbUjOZt3G8J_-C6HegKk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2465476881</pqid></control><display><type>article</type><title>Clustering and classification of time series using topological data analysis with applications to finance</title><source>Elsevier ScienceDirect Journals</source><creator>Majumdar, Sourav ; Laha, Arnab Kumar</creator><creatorcontrib>Majumdar, Sourav ; Laha, Arnab Kumar</creatorcontrib><description>In this paper, we propose new methods for time series classification and clustering. These methods are based on techniques of Topological Data Analysis (TDA) such as persistent homology and time delay embedding for analysing time-series data. We present a new clustering method SOM-TDA and a new classification method RF-TDA based on TDA. Using SOM-TDA we examine the topological similarities and dissimilarities of some well-known time-series models used in finance. We also use the RF-TDA to examine if the topological features can be used to distinguish between time series models using simulated data. The performance of RF-TDA on the classification task is compared against three other classification methods. We also consider an application of RF-TDA to financial time series classification using real-life price data of stocks belonging to different sectors. RF-TDA is seen to perform quite well in the two experiments based on real-life stock-price data. This implies that the topological features of the time series of stock prices in the different sectors are not identical and have distinctive features that can be discerned through the use of TDA. We also briefly consider multi-class classification using RF-TDA. •New methods for time series clustering (SOM-TDA) and classification (RF-TDA).•RF-TDA outperforms other methods on the classification task.•Dependence of stock price movements on sectors in NSE is revealed using RF-TDA.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2020.113868</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Classification ; Clustering ; Data analysis ; Finance ; Homology ; Persistent homology ; Random forest ; Self organizing maps ; Takens theorem ; Time delay embedding ; Time lag ; Time series ; Topology</subject><ispartof>Expert systems with applications, 2020-12, Vol.162, p.113868, Article 113868</ispartof><rights>2020 Elsevier Ltd</rights><rights>Copyright Elsevier BV Dec 30, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-76d78583630f1c32c33c42aa327af213f59f8e6f5525ba7f2383b6dc97b310b83</citedby><cites>FETCH-LOGICAL-c328t-76d78583630f1c32c33c42aa327af213f59f8e6f5525ba7f2383b6dc97b310b83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S095741742030676X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids></links><search><creatorcontrib>Majumdar, Sourav</creatorcontrib><creatorcontrib>Laha, Arnab Kumar</creatorcontrib><title>Clustering and classification of time series using topological data analysis with applications to finance</title><title>Expert systems with applications</title><description>In this paper, we propose new methods for time series classification and clustering. These methods are based on techniques of Topological Data Analysis (TDA) such as persistent homology and time delay embedding for analysing time-series data. We present a new clustering method SOM-TDA and a new classification method RF-TDA based on TDA. Using SOM-TDA we examine the topological similarities and dissimilarities of some well-known time-series models used in finance. We also use the RF-TDA to examine if the topological features can be used to distinguish between time series models using simulated data. The performance of RF-TDA on the classification task is compared against three other classification methods. We also consider an application of RF-TDA to financial time series classification using real-life price data of stocks belonging to different sectors. RF-TDA is seen to perform quite well in the two experiments based on real-life stock-price data. This implies that the topological features of the time series of stock prices in the different sectors are not identical and have distinctive features that can be discerned through the use of TDA. We also briefly consider multi-class classification using RF-TDA. •New methods for time series clustering (SOM-TDA) and classification (RF-TDA).•RF-TDA outperforms other methods on the classification task.•Dependence of stock price movements on sectors in NSE is revealed using RF-TDA.</description><subject>Classification</subject><subject>Clustering</subject><subject>Data analysis</subject><subject>Finance</subject><subject>Homology</subject><subject>Persistent homology</subject><subject>Random forest</subject><subject>Self organizing maps</subject><subject>Takens theorem</subject><subject>Time delay embedding</subject><subject>Time lag</subject><subject>Time series</subject><subject>Topology</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LAzEQhoMoWKt_wFPA89Z87CYpeJHiFxS86Dmk2aRm2W7WTGrpvzdlPXsaeHmeYeZF6JaSBSVU3HcLBwezYISVgHIl1BmaUSV5JeSSn6MZWTayqqmsL9EVQEcIlYTIGQqrfg_ZpTBssRlabHsDEHywJoc44OhxDjuHoRAO8B5OXI5j7OO2MD1uTTZFNP0RAuBDyF_YjGP_50NhsQ-DGay7Rhfe9OBu_uYcfT4_faxeq_X7y9vqcV1ZzlSupGilahQXnHhaIsu5rZkxnEnjGeW-WXrlhG8a1myM9IwrvhGtXcoNp2Sj-BzdTXvHFL_3DrLu4j6VC0GzWjS1FErRQrGJsikCJOf1mMLOpKOmRJ8q1Z0-VapPleqp0iI9TJIr9_8ElzTY4MpvbUjOZt3G8J_-C6HegKk</recordid><startdate>20201230</startdate><enddate>20201230</enddate><creator>Majumdar, Sourav</creator><creator>Laha, Arnab Kumar</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20201230</creationdate><title>Clustering and classification of time series using topological data analysis with applications to finance</title><author>Majumdar, Sourav ; Laha, Arnab Kumar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-76d78583630f1c32c33c42aa327af213f59f8e6f5525ba7f2383b6dc97b310b83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Classification</topic><topic>Clustering</topic><topic>Data analysis</topic><topic>Finance</topic><topic>Homology</topic><topic>Persistent homology</topic><topic>Random forest</topic><topic>Self organizing maps</topic><topic>Takens theorem</topic><topic>Time delay embedding</topic><topic>Time lag</topic><topic>Time series</topic><topic>Topology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Majumdar, Sourav</creatorcontrib><creatorcontrib>Laha, Arnab Kumar</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Majumdar, Sourav</au><au>Laha, Arnab Kumar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Clustering and classification of time series using topological data analysis with applications to finance</atitle><jtitle>Expert systems with applications</jtitle><date>2020-12-30</date><risdate>2020</risdate><volume>162</volume><spage>113868</spage><pages>113868-</pages><artnum>113868</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>In this paper, we propose new methods for time series classification and clustering. These methods are based on techniques of Topological Data Analysis (TDA) such as persistent homology and time delay embedding for analysing time-series data. We present a new clustering method SOM-TDA and a new classification method RF-TDA based on TDA. Using SOM-TDA we examine the topological similarities and dissimilarities of some well-known time-series models used in finance. We also use the RF-TDA to examine if the topological features can be used to distinguish between time series models using simulated data. The performance of RF-TDA on the classification task is compared against three other classification methods. We also consider an application of RF-TDA to financial time series classification using real-life price data of stocks belonging to different sectors. RF-TDA is seen to perform quite well in the two experiments based on real-life stock-price data. This implies that the topological features of the time series of stock prices in the different sectors are not identical and have distinctive features that can be discerned through the use of TDA. We also briefly consider multi-class classification using RF-TDA. •New methods for time series clustering (SOM-TDA) and classification (RF-TDA).•RF-TDA outperforms other methods on the classification task.•Dependence of stock price movements on sectors in NSE is revealed using RF-TDA.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2020.113868</doi></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2020-12, Vol.162, p.113868, Article 113868
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_2465476881
source Elsevier ScienceDirect Journals
subjects Classification
Clustering
Data analysis
Finance
Homology
Persistent homology
Random forest
Self organizing maps
Takens theorem
Time delay embedding
Time lag
Time series
Topology
title Clustering and classification of time series using topological data analysis with applications to finance
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-21T21%3A55%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Clustering%20and%20classification%20of%20time%20series%20using%20topological%20data%20analysis%20with%20applications%20to%20finance&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Majumdar,%20Sourav&rft.date=2020-12-30&rft.volume=162&rft.spage=113868&rft.pages=113868-&rft.artnum=113868&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2020.113868&rft_dat=%3Cproquest_cross%3E2465476881%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2465476881&rft_id=info:pmid/&rft_els_id=S095741742030676X&rfr_iscdi=true