A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction
This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied,...
Gespeichert in:
Veröffentlicht in: | The Astrophysical journal 2018-01, Vol.853 (1), p.90 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | 90 |
container_title | The Astrophysical journal |
container_volume | 853 |
creator | Benvenuto, Federico Piana, Michele Campi, Cristina Massone, Anna Maria |
description | This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast. |
doi_str_mv | 10.3847/1538-4357/aaa23c |
format | Article |
fullrecord | <record><control><sourceid>proquest_O3W</sourceid><recordid>TN_cdi_proquest_journals_2365793186</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2365793186</sourcerecordid><originalsourceid>FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</originalsourceid><addsrcrecordid>eNp1UE1LAzEQDaJgrd49Bry6Nmk2X8dSbCtUFNqCt5DmQ1PqZk22Qv-9u6zUk5cZ5vHem5kHwC1GD0SUfIQpEUVJKB9prcfEnIHBCToHA4RQWTDC3y7BVc67bhxLOQDrCVwctylYuDrULn2H7OxoU-XTAJ-1-QiVg0unUxWqdzip6xRbEDYRruJeJzhri4OvydlgmhCra3Dh9T67m98-BJvZ43q6KJYv86fpZFmYErOmwNRbjbBjgtIt01YbjphllvqtK2XpveTtI4wbQbWz2knnicWSeCIEFRqRIbjrfduDvg4uN2oXD6lqV6oxYZRLggVrWahnmRRzTs6rOoVPnY4KI9Vlp7qgVBeU6rNrJfe9JMT6z_Nf-g_Ve3EU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2365793186</pqid></control><display><type>article</type><title>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</title><source>IOP Publishing Free Content</source><creator>Benvenuto, Federico ; Piana, Michele ; Campi, Cristina ; Massone, Anna Maria</creator><creatorcontrib>Benvenuto, Federico ; Piana, Michele ; Campi, Cristina ; Massone, Anna Maria</creatorcontrib><description>This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.</description><identifier>ISSN: 0004-637X</identifier><identifier>EISSN: 1538-4357</identifier><identifier>DOI: 10.3847/1538-4357/aaa23c</identifier><language>eng</language><publisher>Philadelphia: The American Astronomical Society</publisher><subject>Astrophysics ; Classification ; Clustering ; Cost function ; Machine learning ; methods: data analysis ; methods: statistical ; Optimization ; Predictions ; Regularization ; Regularization methods ; Solar flares ; Space weather ; Sun: flares ; sunspots ; Unsupervised learning ; Weather forecasting</subject><ispartof>The Astrophysical journal, 2018-01, Vol.853 (1), p.90</ispartof><rights>2018. The American Astronomical Society. All rights reserved.</rights><rights>Copyright IOP Publishing Jan 20, 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</citedby><cites>FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</cites><orcidid>0000-0003-4966-8864 ; 0000-0003-1700-991X ; 0000-0002-4776-0256 ; 0000-0003-2105-8554</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://iopscience.iop.org/article/10.3847/1538-4357/aaa23c/pdf$$EPDF$$P50$$Giop$$H</linktopdf><link.rule.ids>314,776,780,27901,27902,38867,53842</link.rule.ids><linktorsrc>$$Uhttps://iopscience.iop.org/article/10.3847/1538-4357/aaa23c$$EView_record_in_IOP_Publishing$$FView_record_in_$$GIOP_Publishing</linktorsrc></links><search><creatorcontrib>Benvenuto, Federico</creatorcontrib><creatorcontrib>Piana, Michele</creatorcontrib><creatorcontrib>Campi, Cristina</creatorcontrib><creatorcontrib>Massone, Anna Maria</creatorcontrib><title>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</title><title>The Astrophysical journal</title><addtitle>APJ</addtitle><addtitle>Astrophys. J</addtitle><description>This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.</description><subject>Astrophysics</subject><subject>Classification</subject><subject>Clustering</subject><subject>Cost function</subject><subject>Machine learning</subject><subject>methods: data analysis</subject><subject>methods: statistical</subject><subject>Optimization</subject><subject>Predictions</subject><subject>Regularization</subject><subject>Regularization methods</subject><subject>Solar flares</subject><subject>Space weather</subject><subject>Sun: flares</subject><subject>sunspots</subject><subject>Unsupervised learning</subject><subject>Weather forecasting</subject><issn>0004-637X</issn><issn>1538-4357</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp1UE1LAzEQDaJgrd49Bry6Nmk2X8dSbCtUFNqCt5DmQ1PqZk22Qv-9u6zUk5cZ5vHem5kHwC1GD0SUfIQpEUVJKB9prcfEnIHBCToHA4RQWTDC3y7BVc67bhxLOQDrCVwctylYuDrULn2H7OxoU-XTAJ-1-QiVg0unUxWqdzip6xRbEDYRruJeJzhri4OvydlgmhCra3Dh9T67m98-BJvZ43q6KJYv86fpZFmYErOmwNRbjbBjgtIt01YbjphllvqtK2XpveTtI4wbQbWz2knnicWSeCIEFRqRIbjrfduDvg4uN2oXD6lqV6oxYZRLggVrWahnmRRzTs6rOoVPnY4KI9Vlp7qgVBeU6rNrJfe9JMT6z_Nf-g_Ve3EU</recordid><startdate>20180120</startdate><enddate>20180120</enddate><creator>Benvenuto, Federico</creator><creator>Piana, Michele</creator><creator>Campi, Cristina</creator><creator>Massone, Anna Maria</creator><general>The American Astronomical Society</general><general>IOP Publishing</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7TG</scope><scope>8FD</scope><scope>H8D</scope><scope>KL.</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0003-4966-8864</orcidid><orcidid>https://orcid.org/0000-0003-1700-991X</orcidid><orcidid>https://orcid.org/0000-0002-4776-0256</orcidid><orcidid>https://orcid.org/0000-0003-2105-8554</orcidid></search><sort><creationdate>20180120</creationdate><title>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</title><author>Benvenuto, Federico ; Piana, Michele ; Campi, Cristina ; Massone, Anna Maria</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c416t-15fda01e6855b6adac706d6d5fbe494ff9723c67c85aedae9ef3d193f38858a03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Astrophysics</topic><topic>Classification</topic><topic>Clustering</topic><topic>Cost function</topic><topic>Machine learning</topic><topic>methods: data analysis</topic><topic>methods: statistical</topic><topic>Optimization</topic><topic>Predictions</topic><topic>Regularization</topic><topic>Regularization methods</topic><topic>Solar flares</topic><topic>Space weather</topic><topic>Sun: flares</topic><topic>sunspots</topic><topic>Unsupervised learning</topic><topic>Weather forecasting</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Benvenuto, Federico</creatorcontrib><creatorcontrib>Piana, Michele</creatorcontrib><creatorcontrib>Campi, Cristina</creatorcontrib><creatorcontrib>Massone, Anna Maria</creatorcontrib><collection>CrossRef</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>The Astrophysical journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Benvenuto, Federico</au><au>Piana, Michele</au><au>Campi, Cristina</au><au>Massone, Anna Maria</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction</atitle><jtitle>The Astrophysical journal</jtitle><stitle>APJ</stitle><addtitle>Astrophys. J</addtitle><date>2018-01-20</date><risdate>2018</risdate><volume>853</volume><issue>1</issue><spage>90</spage><pages>90-</pages><issn>0004-637X</issn><eissn>1538-4357</eissn><abstract>This paper introduces a novel method for flare forecasting, combining prediction accuracy with the ability to identify the most relevant predictive variables. This result is obtained by means of a two-step approach: first, a supervised regularization method for regression, namely, LASSO is applied, where a sparsity-enhancing penalty term allows the identification of the significance with which each data feature contributes to the prediction; then, an unsupervised fuzzy clustering technique for classification, namely, Fuzzy C-Means, is applied, where the regression outcome is partitioned through the minimization of a cost function and without focusing on the optimization of a specific skill score. This approach is therefore hybrid, since it combines supervised and unsupervised learning; realizes classification in an automatic, skill-score-independent way; and provides effective prediction performances even in the case of imbalanced data sets. Its prediction power is verified against NOAA Space Weather Prediction Center data, using as a test set, data in the range between 1996 August and 2010 December and as training set, data in the range between 1988 December and 1996 June. To validate the method, we computed several skill scores typically utilized in flare prediction and compared the values provided by the hybrid approach with the ones provided by several standard (non-hybrid) machine learning methods. The results showed that the hybrid approach performs classification better than all other supervised methods and with an effectiveness comparable to the one of clustering methods; but, in addition, it provides a reliable ranking of the weights with which the data properties contribute to the forecast.</abstract><cop>Philadelphia</cop><pub>The American Astronomical Society</pub><doi>10.3847/1538-4357/aaa23c</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0003-4966-8864</orcidid><orcidid>https://orcid.org/0000-0003-1700-991X</orcidid><orcidid>https://orcid.org/0000-0002-4776-0256</orcidid><orcidid>https://orcid.org/0000-0003-2105-8554</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0004-637X |
ispartof | The Astrophysical journal, 2018-01, Vol.853 (1), p.90 |
issn | 0004-637X 1538-4357 |
language | eng |
recordid | cdi_proquest_journals_2365793186 |
source | IOP Publishing Free Content |
subjects | Astrophysics Classification Clustering Cost function Machine learning methods: data analysis methods: statistical Optimization Predictions Regularization Regularization methods Solar flares Space weather Sun: flares sunspots Unsupervised learning Weather forecasting |
title | A Hybrid Supervised/Unsupervised Machine Learning Approach to Solar Flare Prediction |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T05%3A06%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_O3W&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Hybrid%20Supervised/Unsupervised%20Machine%20Learning%20Approach%20to%20Solar%20Flare%20Prediction&rft.jtitle=The%20Astrophysical%20journal&rft.au=Benvenuto,%20Federico&rft.date=2018-01-20&rft.volume=853&rft.issue=1&rft.spage=90&rft.pages=90-&rft.issn=0004-637X&rft.eissn=1538-4357&rft_id=info:doi/10.3847/1538-4357/aaa23c&rft_dat=%3Cproquest_O3W%3E2365793186%3C/proquest_O3W%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2365793186&rft_id=info:pmid/&rfr_iscdi=true |