Exploring Dimensionality Reduction of SDSS Spectral Abundances

High-resolution stellar spectra offer valuable insights into atmospheric parameters and chemical compositions. However, their inherent complexity and high-dimensionality present challenges in fully utilizing the information they contain. In this study, we utilize data from the Apache Point Observato...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Fan, Qianyu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Fan, Qianyu
description High-resolution stellar spectra offer valuable insights into atmospheric parameters and chemical compositions. However, their inherent complexity and high-dimensionality present challenges in fully utilizing the information they contain. In this study, we utilize data from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) within the Sloan Digital Sky Survey IV (SDSS-IV) to explore latent representations of chemical abundances by applying five dimensionality reduction techniques: PCA, t-SNE, UMAP, Autoencoder, and VAE. Through this exploration, we evaluate the preservation of information and compare reconstructed outputs with the original 19 chemical abundance data. Our findings reveal a performance ranking of PCA < UMAP < t-SNE < VAE < Autoencoder, through comparing their explained variance under optimized MSE. The performance of non-linear (Autoencoder and VAE) algorithms has approximately 10\% improvement compared to linear (PCA) algorithm. This difference can be referred to as the "non-linearity gap." Future work should focus on incorporating measurement errors into extension VAEs, thereby enhancing the reliability and interpretability of chemical abundance exploration in astronomical spectra.
doi_str_mv 10.48550/arxiv.2409.09227
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_09227</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_09227</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_092273</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DOwNDIy52Swc60oyMkvysxLV3DJzE3NK87Mz0vMySypVAhKTSlNLgFyFfLTFIJdgoMVggtSk0uKEnMUHJNK81IS85JTi3kYWNMSc4pTeaE0N4O8m2uIs4cu2Kb4gqLM3MSiyniQjfFgG40JqwAAjVE2DQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Exploring Dimensionality Reduction of SDSS Spectral Abundances</title><source>arXiv.org</source><creator>Fan, Qianyu</creator><creatorcontrib>Fan, Qianyu</creatorcontrib><description>High-resolution stellar spectra offer valuable insights into atmospheric parameters and chemical compositions. However, their inherent complexity and high-dimensionality present challenges in fully utilizing the information they contain. In this study, we utilize data from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) within the Sloan Digital Sky Survey IV (SDSS-IV) to explore latent representations of chemical abundances by applying five dimensionality reduction techniques: PCA, t-SNE, UMAP, Autoencoder, and VAE. Through this exploration, we evaluate the preservation of information and compare reconstructed outputs with the original 19 chemical abundance data. Our findings reveal a performance ranking of PCA &lt; UMAP &lt; t-SNE &lt; VAE &lt; Autoencoder, through comparing their explained variance under optimized MSE. The performance of non-linear (Autoencoder and VAE) algorithms has approximately 10\% improvement compared to linear (PCA) algorithm. This difference can be referred to as the "non-linearity gap." Future work should focus on incorporating measurement errors into extension VAEs, thereby enhancing the reliability and interpretability of chemical abundance exploration in astronomical spectra.</description><identifier>DOI: 10.48550/arxiv.2409.09227</identifier><language>eng</language><subject>Physics - Instrumentation and Methods for Astrophysics ; Statistics - Applications</subject><creationdate>2024-09</creationdate><rights>http://creativecommons.org/publicdomain/zero/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.09227$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.09227$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Fan, Qianyu</creatorcontrib><title>Exploring Dimensionality Reduction of SDSS Spectral Abundances</title><description>High-resolution stellar spectra offer valuable insights into atmospheric parameters and chemical compositions. However, their inherent complexity and high-dimensionality present challenges in fully utilizing the information they contain. In this study, we utilize data from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) within the Sloan Digital Sky Survey IV (SDSS-IV) to explore latent representations of chemical abundances by applying five dimensionality reduction techniques: PCA, t-SNE, UMAP, Autoencoder, and VAE. Through this exploration, we evaluate the preservation of information and compare reconstructed outputs with the original 19 chemical abundance data. Our findings reveal a performance ranking of PCA &lt; UMAP &lt; t-SNE &lt; VAE &lt; Autoencoder, through comparing their explained variance under optimized MSE. The performance of non-linear (Autoencoder and VAE) algorithms has approximately 10\% improvement compared to linear (PCA) algorithm. This difference can be referred to as the "non-linearity gap." Future work should focus on incorporating measurement errors into extension VAEs, thereby enhancing the reliability and interpretability of chemical abundance exploration in astronomical spectra.</description><subject>Physics - Instrumentation and Methods for Astrophysics</subject><subject>Statistics - Applications</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DOwNDIy52Swc60oyMkvysxLV3DJzE3NK87Mz0vMySypVAhKTSlNLgFyFfLTFIJdgoMVggtSk0uKEnMUHJNK81IS85JTi3kYWNMSc4pTeaE0N4O8m2uIs4cu2Kb4gqLM3MSiyniQjfFgG40JqwAAjVE2DQ</recordid><startdate>20240913</startdate><enddate>20240913</enddate><creator>Fan, Qianyu</creator><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20240913</creationdate><title>Exploring Dimensionality Reduction of SDSS Spectral Abundances</title><author>Fan, Qianyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_092273</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Physics - Instrumentation and Methods for Astrophysics</topic><topic>Statistics - Applications</topic><toplevel>online_resources</toplevel><creatorcontrib>Fan, Qianyu</creatorcontrib><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Fan, Qianyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Exploring Dimensionality Reduction of SDSS Spectral Abundances</atitle><date>2024-09-13</date><risdate>2024</risdate><abstract>High-resolution stellar spectra offer valuable insights into atmospheric parameters and chemical compositions. However, their inherent complexity and high-dimensionality present challenges in fully utilizing the information they contain. In this study, we utilize data from the Apache Point Observatory Galactic Evolution Experiment (APOGEE) within the Sloan Digital Sky Survey IV (SDSS-IV) to explore latent representations of chemical abundances by applying five dimensionality reduction techniques: PCA, t-SNE, UMAP, Autoencoder, and VAE. Through this exploration, we evaluate the preservation of information and compare reconstructed outputs with the original 19 chemical abundance data. Our findings reveal a performance ranking of PCA &lt; UMAP &lt; t-SNE &lt; VAE &lt; Autoencoder, through comparing their explained variance under optimized MSE. The performance of non-linear (Autoencoder and VAE) algorithms has approximately 10\% improvement compared to linear (PCA) algorithm. This difference can be referred to as the "non-linearity gap." Future work should focus on incorporating measurement errors into extension VAEs, thereby enhancing the reliability and interpretability of chemical abundance exploration in astronomical spectra.</abstract><doi>10.48550/arxiv.2409.09227</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2409.09227
ispartof
issn
language eng
recordid cdi_arxiv_primary_2409_09227
source arXiv.org
subjects Physics - Instrumentation and Methods for Astrophysics
Statistics - Applications
title Exploring Dimensionality Reduction of SDSS Spectral Abundances
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T15%3A58%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Exploring%20Dimensionality%20Reduction%20of%20SDSS%20Spectral%20Abundances&rft.au=Fan,%20Qianyu&rft.date=2024-09-13&rft_id=info:doi/10.48550/arxiv.2409.09227&rft_dat=%3Carxiv_GOX%3E2409_09227%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true