Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies
Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to m...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2016-03 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Belletti, Francois W Sparks, Evan R Franklin, Michael J Bayen, Alexandre M Gonzalez, Joseph E |
description | Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to many real-world time series presents three critical challenges: irregular temporal sampling, long range dependencies, and scale. Moreover, real-world data is often collected at irregular time intervals across vast arrays of decentralized sensors and with long range dependencies which make naive time domain correlation estimators spurious. In this paper we present a frequency domain based estimation framework which naturally handles irregularly sampled data and long range dependencies while enabled memory and communication efficient distributed processing of time series data. By operating in the frequency domain we eliminate the need to interpolate and help mitigate the effects of long range dependencies. We implement and evaluate our new work-flow in the distributed setting using Apache Spark and demonstrate on both Monte Carlo simulations and high-frequency financial trading that we can accurately recover causal structure at scale. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2078185898</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2078185898</sourcerecordid><originalsourceid>FETCH-proquest_journals_20781858983</originalsourceid><addsrcrecordid>eNqNi8EKgkAUAJcgSMp_eNBZsDVzO1tR4Cm9y0ufpqyrvVWiv89DH9BpDjOzEI4Mgp2n9lKuhGtt6_u-PEQyDANHYFqgxocmSBpDyBDjZFHDzVTEZAqCqme4MVM9aWT9gRS7QVMJWdMRpMQNWXg34xOS3tRwR1MTnGggU877LDdiWaG25P64FtvLOYuv3sD9ayI75m0_sZlVLv1I7VSojir4r_oC4xREyQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2078185898</pqid></control><display><type>article</type><title>Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies</title><source>Free E- Journals</source><creator>Belletti, Francois W ; Sparks, Evan R ; Franklin, Michael J ; Bayen, Alexandre M ; Gonzalez, Joseph E</creator><creatorcontrib>Belletti, Francois W ; Sparks, Evan R ; Franklin, Michael J ; Bayen, Alexandre M ; Gonzalez, Joseph E</creatorcontrib><description>Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to many real-world time series presents three critical challenges: irregular temporal sampling, long range dependencies, and scale. Moreover, real-world data is often collected at irregular time intervals across vast arrays of decentralized sensors and with long range dependencies which make naive time domain correlation estimators spurious. In this paper we present a frequency domain based estimation framework which naturally handles irregularly sampled data and long range dependencies while enabled memory and communication efficient distributed processing of time series data. By operating in the frequency domain we eliminate the need to interpolate and help mitigate the effects of long range dependencies. We implement and evaluate our new work-flow in the distributed setting using Apache Spark and demonstrate on both Monte Carlo simulations and high-frequency financial trading that we can accurately recover causal structure at scale.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computer simulation ; Distributed memory ; Distributed processing ; Frequency domain analysis ; Physical sciences ; Sensor arrays ; Time domain analysis ; Time series ; Workflow</subject><ispartof>arXiv.org, 2016-03</ispartof><rights>2016. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Belletti, Francois W</creatorcontrib><creatorcontrib>Sparks, Evan R</creatorcontrib><creatorcontrib>Franklin, Michael J</creatorcontrib><creatorcontrib>Bayen, Alexandre M</creatorcontrib><creatorcontrib>Gonzalez, Joseph E</creatorcontrib><title>Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies</title><title>arXiv.org</title><description>Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to many real-world time series presents three critical challenges: irregular temporal sampling, long range dependencies, and scale. Moreover, real-world data is often collected at irregular time intervals across vast arrays of decentralized sensors and with long range dependencies which make naive time domain correlation estimators spurious. In this paper we present a frequency domain based estimation framework which naturally handles irregularly sampled data and long range dependencies while enabled memory and communication efficient distributed processing of time series data. By operating in the frequency domain we eliminate the need to interpolate and help mitigate the effects of long range dependencies. We implement and evaluate our new work-flow in the distributed setting using Apache Spark and demonstrate on both Monte Carlo simulations and high-frequency financial trading that we can accurately recover causal structure at scale.</description><subject>Computer simulation</subject><subject>Distributed memory</subject><subject>Distributed processing</subject><subject>Frequency domain analysis</subject><subject>Physical sciences</subject><subject>Sensor arrays</subject><subject>Time domain analysis</subject><subject>Time series</subject><subject>Workflow</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi8EKgkAUAJcgSMp_eNBZsDVzO1tR4Cm9y0ufpqyrvVWiv89DH9BpDjOzEI4Mgp2n9lKuhGtt6_u-PEQyDANHYFqgxocmSBpDyBDjZFHDzVTEZAqCqme4MVM9aWT9gRS7QVMJWdMRpMQNWXg34xOS3tRwR1MTnGggU877LDdiWaG25P64FtvLOYuv3sD9ayI75m0_sZlVLv1I7VSojir4r_oC4xREyQ</recordid><startdate>20160310</startdate><enddate>20160310</enddate><creator>Belletti, Francois W</creator><creator>Sparks, Evan R</creator><creator>Franklin, Michael J</creator><creator>Bayen, Alexandre M</creator><creator>Gonzalez, Joseph E</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20160310</creationdate><title>Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies</title><author>Belletti, Francois W ; Sparks, Evan R ; Franklin, Michael J ; Bayen, Alexandre M ; Gonzalez, Joseph E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20781858983</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Computer simulation</topic><topic>Distributed memory</topic><topic>Distributed processing</topic><topic>Frequency domain analysis</topic><topic>Physical sciences</topic><topic>Sensor arrays</topic><topic>Time domain analysis</topic><topic>Time series</topic><topic>Workflow</topic><toplevel>online_resources</toplevel><creatorcontrib>Belletti, Francois W</creatorcontrib><creatorcontrib>Sparks, Evan R</creatorcontrib><creatorcontrib>Franklin, Michael J</creatorcontrib><creatorcontrib>Bayen, Alexandre M</creatorcontrib><creatorcontrib>Gonzalez, Joseph E</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Belletti, Francois W</au><au>Sparks, Evan R</au><au>Franklin, Michael J</au><au>Bayen, Alexandre M</au><au>Gonzalez, Joseph E</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies</atitle><jtitle>arXiv.org</jtitle><date>2016-03-10</date><risdate>2016</risdate><eissn>2331-8422</eissn><abstract>Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to many real-world time series presents three critical challenges: irregular temporal sampling, long range dependencies, and scale. Moreover, real-world data is often collected at irregular time intervals across vast arrays of decentralized sensors and with long range dependencies which make naive time domain correlation estimators spurious. In this paper we present a frequency domain based estimation framework which naturally handles irregularly sampled data and long range dependencies while enabled memory and communication efficient distributed processing of time series data. By operating in the frequency domain we eliminate the need to interpolate and help mitigate the effects of long range dependencies. We implement and evaluate our new work-flow in the distributed setting using Apache Spark and demonstrate on both Monte Carlo simulations and high-frequency financial trading that we can accurately recover causal structure at scale.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2016-03 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2078185898 |
source | Free E- Journals |
subjects | Computer simulation Distributed memory Distributed processing Frequency domain analysis Physical sciences Sensor arrays Time domain analysis Time series Workflow |
title | Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T04%3A29%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Scalable%20Linear%20Causal%20Inference%20for%20Irregularly%20Sampled%20Time%20Series%20with%20Long%20Range%20Dependencies&rft.jtitle=arXiv.org&rft.au=Belletti,%20Francois%20W&rft.date=2016-03-10&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2078185898%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2078185898&rft_id=info:pmid/&rfr_iscdi=true |