A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamics

The weather and climate domains are undergoing a significant transformation thanks to advances in AI-based foundation models such as FourCastNet, GraphCast, ClimaX and Pangu-Weather. While these models show considerable potential, they are not ready yet for operational use in weather forecasting or...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2024-07
Hauptverfasser: Yin, Junqi, Liang, Siming, Liu, Siyan, Bao, Feng, Chipilski, Hristo G, Lu, Dan, Zhang, Guannan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Yin, Junqi
Liang, Siming
Liu, Siyan
Bao, Feng
Chipilski, Hristo G
Lu, Dan
Zhang, Guannan
description The weather and climate domains are undergoing a significant transformation thanks to advances in AI-based foundation models such as FourCastNet, GraphCast, ClimaX and Pangu-Weather. While these models show considerable potential, they are not ready yet for operational use in weather forecasting or climate prediction. This is due to the lack of a data assimilation method as part of their workflow to enable the assimilation of incoming Earth system observations in real time. This limitation affects their effectiveness in predicting complex atmospheric phenomena such as tropical cyclones and atmospheric rivers. To overcome these obstacles, we introduce a generic real-time data assimilation framework and demonstrate its end-to-end performance on the Frontier supercomputer. This framework comprises two primary modules: an ensemble score filter (EnSF), which significantly outperforms the state-of-the-art data assimilation method, namely, the Local Ensemble Transform Kalman Filter (LETKF); and a vision transformer-based surrogate capable of real-time adaptation through the integration of observational data. The ViT surrogate can represent either physics-based models or AI-based foundation models. We demonstrate both the strong and weak scaling of our framework up to 1024 GPUs on the Exascale supercomputer, Frontier. Our results not only illustrate the framework's exceptional scalability on high-performance computing systems, but also demonstrate the importance of supercomputers in real-time data assimilation for weather and climate predictions. Even though the proposed framework is tested only on a benchmark surface quasi-geostrophic (SQG) turbulence system, it has the potential to be combined with existing AI-based foundation models, making it suitable for future operational implementations.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3082383614</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3082383614</sourcerecordid><originalsourceid>FETCH-proquest_journals_30823836143</originalsourceid><addsrcrecordid>eNqNizsKwkAUABdBUDR3eGAdWHc1pg1-sBRNK_KML7pxP7ofxNtr4QGsppiZHhsKKad5ORNiwLIQOs65KBZiPpdDdqzg0KDGsybYE-q8VoZghRGhCkEZpTEqZ2Hj0dDL-Tu0zsPO00U1Udkr1MmfkyYboYrGhceN_Pd_WzSqCWPWb1EHyn4csclmXS-3-cO7Z6IQT51L3n7VSfJSyFIW05n8r_oAuoBDgA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3082383614</pqid></control><display><type>article</type><title>A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamics</title><source>Free E- Journals</source><creator>Yin, Junqi ; Liang, Siming ; Liu, Siyan ; Bao, Feng ; Chipilski, Hristo G ; Lu, Dan ; Zhang, Guannan</creator><creatorcontrib>Yin, Junqi ; Liang, Siming ; Liu, Siyan ; Bao, Feng ; Chipilski, Hristo G ; Lu, Dan ; Zhang, Guannan</creatorcontrib><description>The weather and climate domains are undergoing a significant transformation thanks to advances in AI-based foundation models such as FourCastNet, GraphCast, ClimaX and Pangu-Weather. While these models show considerable potential, they are not ready yet for operational use in weather forecasting or climate prediction. This is due to the lack of a data assimilation method as part of their workflow to enable the assimilation of incoming Earth system observations in real time. This limitation affects their effectiveness in predicting complex atmospheric phenomena such as tropical cyclones and atmospheric rivers. To overcome these obstacles, we introduce a generic real-time data assimilation framework and demonstrate its end-to-end performance on the Frontier supercomputer. This framework comprises two primary modules: an ensemble score filter (EnSF), which significantly outperforms the state-of-the-art data assimilation method, namely, the Local Ensemble Transform Kalman Filter (LETKF); and a vision transformer-based surrogate capable of real-time adaptation through the integration of observational data. The ViT surrogate can represent either physics-based models or AI-based foundation models. We demonstrate both the strong and weak scaling of our framework up to 1024 GPUs on the Exascale supercomputer, Frontier. Our results not only illustrate the framework's exceptional scalability on high-performance computing systems, but also demonstrate the importance of supercomputers in real-time data assimilation for weather and climate predictions. Even though the proposed framework is tested only on a benchmark surface quasi-geostrophic (SQG) turbulence system, it has the potential to be combined with existing AI-based foundation models, making it suitable for future operational implementations.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Climate models ; Cyclones ; Data assimilation ; Kalman filters ; Predictions ; Real time operation ; Supercomputers ; Turbulence ; Weather forecasting ; Workflow</subject><ispartof>arXiv.org, 2024-07</ispartof><rights>2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Yin, Junqi</creatorcontrib><creatorcontrib>Liang, Siming</creatorcontrib><creatorcontrib>Liu, Siyan</creatorcontrib><creatorcontrib>Bao, Feng</creatorcontrib><creatorcontrib>Chipilski, Hristo G</creatorcontrib><creatorcontrib>Lu, Dan</creatorcontrib><creatorcontrib>Zhang, Guannan</creatorcontrib><title>A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamics</title><title>arXiv.org</title><description>The weather and climate domains are undergoing a significant transformation thanks to advances in AI-based foundation models such as FourCastNet, GraphCast, ClimaX and Pangu-Weather. While these models show considerable potential, they are not ready yet for operational use in weather forecasting or climate prediction. This is due to the lack of a data assimilation method as part of their workflow to enable the assimilation of incoming Earth system observations in real time. This limitation affects their effectiveness in predicting complex atmospheric phenomena such as tropical cyclones and atmospheric rivers. To overcome these obstacles, we introduce a generic real-time data assimilation framework and demonstrate its end-to-end performance on the Frontier supercomputer. This framework comprises two primary modules: an ensemble score filter (EnSF), which significantly outperforms the state-of-the-art data assimilation method, namely, the Local Ensemble Transform Kalman Filter (LETKF); and a vision transformer-based surrogate capable of real-time adaptation through the integration of observational data. The ViT surrogate can represent either physics-based models or AI-based foundation models. We demonstrate both the strong and weak scaling of our framework up to 1024 GPUs on the Exascale supercomputer, Frontier. Our results not only illustrate the framework's exceptional scalability on high-performance computing systems, but also demonstrate the importance of supercomputers in real-time data assimilation for weather and climate predictions. Even though the proposed framework is tested only on a benchmark surface quasi-geostrophic (SQG) turbulence system, it has the potential to be combined with existing AI-based foundation models, making it suitable for future operational implementations.</description><subject>Climate models</subject><subject>Cyclones</subject><subject>Data assimilation</subject><subject>Kalman filters</subject><subject>Predictions</subject><subject>Real time operation</subject><subject>Supercomputers</subject><subject>Turbulence</subject><subject>Weather forecasting</subject><subject>Workflow</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNizsKwkAUABdBUDR3eGAdWHc1pg1-sBRNK_KML7pxP7ofxNtr4QGsppiZHhsKKad5ORNiwLIQOs65KBZiPpdDdqzg0KDGsybYE-q8VoZghRGhCkEZpTEqZ2Hj0dDL-Tu0zsPO00U1Udkr1MmfkyYboYrGhceN_Pd_WzSqCWPWb1EHyn4csclmXS-3-cO7Z6IQT51L3n7VSfJSyFIW05n8r_oAuoBDgA</recordid><startdate>20240716</startdate><enddate>20240716</enddate><creator>Yin, Junqi</creator><creator>Liang, Siming</creator><creator>Liu, Siyan</creator><creator>Bao, Feng</creator><creator>Chipilski, Hristo G</creator><creator>Lu, Dan</creator><creator>Zhang, Guannan</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240716</creationdate><title>A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamics</title><author>Yin, Junqi ; Liang, Siming ; Liu, Siyan ; Bao, Feng ; Chipilski, Hristo G ; Lu, Dan ; Zhang, Guannan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30823836143</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Climate models</topic><topic>Cyclones</topic><topic>Data assimilation</topic><topic>Kalman filters</topic><topic>Predictions</topic><topic>Real time operation</topic><topic>Supercomputers</topic><topic>Turbulence</topic><topic>Weather forecasting</topic><topic>Workflow</topic><toplevel>online_resources</toplevel><creatorcontrib>Yin, Junqi</creatorcontrib><creatorcontrib>Liang, Siming</creatorcontrib><creatorcontrib>Liu, Siyan</creatorcontrib><creatorcontrib>Bao, Feng</creatorcontrib><creatorcontrib>Chipilski, Hristo G</creatorcontrib><creatorcontrib>Lu, Dan</creatorcontrib><creatorcontrib>Zhang, Guannan</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yin, Junqi</au><au>Liang, Siming</au><au>Liu, Siyan</au><au>Bao, Feng</au><au>Chipilski, Hristo G</au><au>Lu, Dan</au><au>Zhang, Guannan</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamics</atitle><jtitle>arXiv.org</jtitle><date>2024-07-16</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>The weather and climate domains are undergoing a significant transformation thanks to advances in AI-based foundation models such as FourCastNet, GraphCast, ClimaX and Pangu-Weather. While these models show considerable potential, they are not ready yet for operational use in weather forecasting or climate prediction. This is due to the lack of a data assimilation method as part of their workflow to enable the assimilation of incoming Earth system observations in real time. This limitation affects their effectiveness in predicting complex atmospheric phenomena such as tropical cyclones and atmospheric rivers. To overcome these obstacles, we introduce a generic real-time data assimilation framework and demonstrate its end-to-end performance on the Frontier supercomputer. This framework comprises two primary modules: an ensemble score filter (EnSF), which significantly outperforms the state-of-the-art data assimilation method, namely, the Local Ensemble Transform Kalman Filter (LETKF); and a vision transformer-based surrogate capable of real-time adaptation through the integration of observational data. The ViT surrogate can represent either physics-based models or AI-based foundation models. We demonstrate both the strong and weak scaling of our framework up to 1024 GPUs on the Exascale supercomputer, Frontier. Our results not only illustrate the framework's exceptional scalability on high-performance computing systems, but also demonstrate the importance of supercomputers in real-time data assimilation for weather and climate predictions. Even though the proposed framework is tested only on a benchmark surface quasi-geostrophic (SQG) turbulence system, it has the potential to be combined with existing AI-based foundation models, making it suitable for future operational implementations.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-07
issn 2331-8422
language eng
recordid cdi_proquest_journals_3082383614
source Free E- Journals
subjects Climate models
Cyclones
Data assimilation
Kalman filters
Predictions
Real time operation
Supercomputers
Turbulence
Weather forecasting
Workflow
title A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T12%3A36%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=A%20Scalable%20Real-Time%20Data%20Assimilation%20Framework%20for%20Predicting%20Turbulent%20Atmosphere%20Dynamics&rft.jtitle=arXiv.org&rft.au=Yin,%20Junqi&rft.date=2024-07-16&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3082383614%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3082383614&rft_id=info:pmid/&rfr_iscdi=true