Optimizing exoplanet atmosphere retrieval using unsupervised machine-learning classification

ABSTRACT One of the principal bottlenecks to atmosphere characterization in the era of all-sky surveys is the availability of fast, autonomous, and robust atmospheric retrieval methods. We present a new approach using unsupervised machine learning to generate informed priors for retrieval of exoplan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Monthly notices of the Royal Astronomical Society 2020-05, Vol.494 (3), p.4492-4508
Hauptverfasser: Hayes, J J C, Kerins, E, Awiphan, S, McDonald, I, Morgan, J S, Chuanraksasat, P, Komonjinda, S, Sanguansak, N, Kittara, P
Format: Artikel
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:ABSTRACT One of the principal bottlenecks to atmosphere characterization in the era of all-sky surveys is the availability of fast, autonomous, and robust atmospheric retrieval methods. We present a new approach using unsupervised machine learning to generate informed priors for retrieval of exoplanetary atmosphere parameters from transmission spectra. We use principal component analysis (PCA) to efficiently compress the information content of a library of transmission spectra forward models generated using the platon package. We then apply a k-means clustering algorithm in PCA space to segregate the library into discrete classes. We show that our classifier is almost always able to instantaneously place a previously unseen spectrum into the correct class, for low-to-moderate spectral resolutions, R, in the range R = 30−300 and noise levels up to 10 per cent of the peak-to-trough spectrum amplitude. The distribution of physical parameters for all members of the class therefore provides an informed prior for standard retrieval methods such as nested sampling. We benchmark our informed-prior approach against a standard uniform-prior nested sampler, finding that our approach is up to a factor of 2 faster, with negligible reduction in accuracy. We demonstrate the application of this method to existing and near-future observatories, and show that it is suitable for real-world application. Our general approach is not specific to transmission spectroscopy and should be more widely applicable to cases that involve the repetitive fitting of trusted high-dimensional models to large data catalogues, including beyond exoplanetary science.
ISSN:0035-8711
1365-2966
DOI:10.1093/mnras/staa978