A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction

This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Astrophysical journal 2018-01, Vol.853 (1), p.90
Hauptverfasser: Benvenuto, Federico, Piana, Michele, Campi, Cristina, Massone, Anna Maria
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 90
container_title The Astrophysical journal
container_volume 853
creator Benvenuto, Federico
Piana, Michele
Campi, Cristina
Massone, Anna Maria
description This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.
doi_str_mv 10.3847/1538-4357/aaa23c
format Article
fullrecord <record><control><sourceid>proquest_O3W</sourceid><recordid>TN_cdi_proquest_journals_2365793186</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2365793186</sourcerecordid><originalsourceid>FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</originalsourceid><addsrcrecordid>eNp1UE1LAzEQDaJgrd49Bry6Nmk2X8dSbCtUFNqCt5DmQ1PqZk22Qv-9u6zUk5cZ5vHem5kHwC1GD0SUfIQpEUVJKB9prcfEnIHBCToHA4RQWTDC3y7BVc67bhxLOQDrCVwctylYuDrULn2H7OxoU-XTAJ-1-QiVg0unUxWqdzip6xRbEDYRruJeJzhri4OvydlgmhCra3Dh9T67m98-BJvZ43q6KJYv86fpZFmYErOmwNRbjbBjgtIt01YbjphllvqtK2XpveTtI4wbQbWz2knnicWSeCIEFRqRIbjrfduDvg4uN2oXD6lqV6oxYZRLggVrWahnmRRzTs6rOoVPnY4KI9Vlp7qgVBeU6rNrJfe9JMT6z_Nf-g_Ve3EU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2365793186</pqid></control><display><type>article</type><title>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</title><source>IOP Publishing Free Content</source><creator>Benvenuto, Federico ; Piana, Michele ; Campi, Cristina ; Massone, Anna Maria</creator><creatorcontrib>Benvenuto, Federico ; Piana, Michele ; Campi, Cristina ; Massone, Anna Maria</creatorcontrib><description>This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.</description><identifier>ISSN: 0004-637X</identifier><identifier>EISSN: 1538-4357</identifier><identifier>DOI: 10.3847/1538-4357/aaa23c</identifier><language>eng</language><publisher>Philadelphia: The American Astronomical Society</publisher><subject>Astrophysics ; Classification ; Clustering ; Cost function ; Machine learning ; methods: data analysis ; methods: statistical ; Optimization ; Predictions ; Regularization ; Regularization methods ; Solar flares ; Space weather ; Sun: flares ; sunspots ; Unsupervised learning ; Weather forecasting</subject><ispartof>The Astrophysical journal, 2018-01, Vol.853 (1), p.90</ispartof><rights>2018. The American Astronomical Society. All rights reserved.</rights><rights>Copyright IOP Publishing Jan 20, 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</citedby><cites>FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</cites><orcidid>0000-0003-4966-8864 ; 0000-0003-1700-991X ; 0000-0002-4776-0256 ; 0000-0003-2105-8554</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://iopscience.iop.org/article/10.3847/1538-4357/aaa23c/pdf$$EPDF$$P50$$Giop$$H</linktopdf><link.rule.ids>314,776,780,27901,27902,38867,53842</link.rule.ids><linktorsrc>$$Uhttps://iopscience.iop.org/article/10.3847/1538-4357/aaa23c$$EView_record_in_IOP_Publishing$$FView_record_in_$$GIOP_Publishing</linktorsrc></links><search><creatorcontrib>Benvenuto, Federico</creatorcontrib><creatorcontrib>Piana, Michele</creatorcontrib><creatorcontrib>Campi, Cristina</creatorcontrib><creatorcontrib>Massone, Anna Maria</creatorcontrib><title>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</title><title>The Astrophysical journal</title><addtitle>APJ</addtitle><addtitle>Astrophys. J</addtitle><description>This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.</description><subject>Astrophysics</subject><subject>Classification</subject><subject>Clustering</subject><subject>Cost function</subject><subject>Machine learning</subject><subject>methods: data analysis</subject><subject>methods: statistical</subject><subject>Optimization</subject><subject>Predictions</subject><subject>Regularization</subject><subject>Regularization methods</subject><subject>Solar flares</subject><subject>Space weather</subject><subject>Sun: flares</subject><subject>sunspots</subject><subject>Unsupervised learning</subject><subject>Weather forecasting</subject><issn>0004-637X</issn><issn>1538-4357</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp1UE1LAzEQDaJgrd49Bry6Nmk2X8dSbCtUFNqCt5DmQ1PqZk22Qv-9u6zUk5cZ5vHem5kHwC1GD0SUfIQpEUVJKB9prcfEnIHBCToHA4RQWTDC3y7BVc67bhxLOQDrCVwctylYuDrULn2H7OxoU-XTAJ-1-QiVg0unUxWqdzip6xRbEDYRruJeJzhri4OvydlgmhCra3Dh9T67m98-BJvZ43q6KJYv86fpZFmYErOmwNRbjbBjgtIt01YbjphllvqtK2XpveTtI4wbQbWz2knnicWSeCIEFRqRIbjrfduDvg4uN2oXD6lqV6oxYZRLggVrWahnmRRzTs6rOoVPnY4KI9Vlp7qgVBeU6rNrJfe9JMT6z_Nf-g_Ve3EU</recordid><startdate>20180120</startdate><enddate>20180120</enddate><creator>Benvenuto, Federico</creator><creator>Piana, Michele</creator><creator>Campi, Cristina</creator><creator>Massone, Anna Maria</creator><general>The American Astronomical Society</general><general>IOP Publishing</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7TG</scope><scope>8FD</scope><scope>H8D</scope><scope>KL.</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0003-4966-8864</orcidid><orcidid>https://orcid.org/0000-0003-1700-991X</orcidid><orcidid>https://orcid.org/0000-0002-4776-0256</orcidid><orcidid>https://orcid.org/0000-0003-2105-8554</orcidid></search><sort><creationdate>20180120</creationdate><title>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</title><author>Benvenuto, Federico ; Piana, Michele ; Campi, Cristina ; Massone, Anna Maria</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Astrophysics</topic><topic>Classification</topic><topic>Clustering</topic><topic>Cost function</topic><topic>Machine learning</topic><topic>methods: data analysis</topic><topic>methods: statistical</topic><topic>Optimization</topic><topic>Predictions</topic><topic>Regularization</topic><topic>Regularization methods</topic><topic>Solar flares</topic><topic>Space weather</topic><topic>Sun: flares</topic><topic>sunspots</topic><topic>Unsupervised learning</topic><topic>Weather forecasting</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Benvenuto, Federico</creatorcontrib><creatorcontrib>Piana, Michele</creatorcontrib><creatorcontrib>Campi, Cristina</creatorcontrib><creatorcontrib>Massone, Anna Maria</creatorcontrib><collection>CrossRef</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>The Astrophysical journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Benvenuto, Federico</au><au>Piana, Michele</au><au>Campi, Cristina</au><au>Massone, Anna Maria</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</atitle><jtitle>The Astrophysical journal</jtitle><stitle>APJ</stitle><addtitle>Astrophys. J</addtitle><date>2018-01-20</date><risdate>2018</risdate><volume>853</volume><issue>1</issue><spage>90</spage><pages>90-</pages><issn>0004-637X</issn><eissn>1538-4357</eissn><abstract>This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.</abstract><cop>Philadelphia</cop><pub>The American Astronomical Society</pub><doi>10.3847/1538-4357/aaa23c</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0003-4966-8864</orcidid><orcidid>https://orcid.org/0000-0003-1700-991X</orcidid><orcidid>https://orcid.org/0000-0002-4776-0256</orcidid><orcidid>https://orcid.org/0000-0003-2105-8554</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0004-637X
ispartof The Astrophysical journal, 2018-01, Vol.853 (1), p.90
issn 0004-637X
1538-4357
language eng
recordid cdi_proquest_journals_2365793186
source IOP Publishing Free Content
subjects Astrophysics
Classification
Clustering
Cost function
Machine learning
methods: data analysis
methods: statistical
Optimization
Predictions
Regularization
Regularization methods
Solar flares
Space weather
Sun: flares
sunspots
Unsupervised learning
Weather forecasting
title A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T05%3A06%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_O3W&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Hybrid%20Supervised/Unsupervised%20Machine%20Learning%20Approach%20to%20Solar%20Flare%20Prediction&rft.jtitle=The%20Astrophysical%20journal&rft.au=Benvenuto,%20Federico&rft.date=2018-01-20&rft.volume=853&rft.issue=1&rft.spage=90&rft.pages=90-&rft.issn=0004-637X&rft.eissn=1538-4357&rft_id=info:doi/10.3847/1538-4357/aaa23c&rft_dat=%3Cproquest_O3W%3E2365793186%3C/proquest_O3W%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2365793186&rft_id=info:pmid/&rfr_iscdi=true