SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models

SeeFar is an evolving collection of multi-resolution satellite images from public and commercial satellites. We specifically curated this dataset for training geospatial foundation models, unconstrained by satellite type. In recent years, advances in technology have made satellite imagery more acces...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lowman, James, Zheng, Kelly Liu, Fraser, Roydon, The, Jesse Van Griensven, Valipour, Mojtaba
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Lowman, James
Zheng, Kelly Liu
Fraser, Roydon
The, Jesse Van Griensven
Valipour, Mojtaba
description SeeFar is an evolving collection of multi-resolution satellite images from public and commercial satellites. We specifically curated this dataset for training geospatial foundation models, unconstrained by satellite type. In recent years, advances in technology have made satellite imagery more accessible than ever. More earth-observing satellites have been launched in the last five years than in the previous fifty. Modern commercial satellites now offer up to 100 times the spatial resolution of public access satellites. However, the high cost and limited historical availability of commercial satellite imagery is a barrier to the training of foundational models, impacting what images can be used during inference. The SeeFar dataset represents a step towards training models that are satellite-agnostic by combining multi-resolution commercial and public access pre-processed images. This will enable users to utilize historical data alongside higher-resolution, more expensive satellite imagery, offering greater flexibility during inference. To achieve this, we describe a process for standardizing data from diverse satellite sources, normalizing different data formats, and aligning spectral bands to enhance interoperability. The SeeFar dataset includes images at a resolution of 384x384 pixels, spanning four spectral bands (Blue, Green, Red, and Near-Infrared) and expanding spatial resolutions (starting with 30, 10, 1.5, and 1.0 meters), all in cloud-optimized GeoTIFF format. It also provides consistent and comprehensive metadata to enhance data transparency and reliability. By aggregating data from multiple sources, SeeFar makes processed and consistent satellite data accessible to a wider range of users - from researchers to policymakers - fostering competition and innovation in satellite imagery analysis. The dataset is available at \url{coastalcarbon.ai/seefar}.
doi_str_mv 10.48550/arxiv.2406.06776
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2406_06776</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2406_06776</sourcerecordid><originalsourceid>FETCH-LOGICAL-a676-e4cae7122867e25691c190bc95affddef77536d6be7db9d3902b7fe65e402c7d3</originalsourceid><addsrcrecordid>eNotz0FLwzAYxvFcPMj0A3gyX6A1TZtk2W1MO4WNgeu9vG3ejEBsRpKKfnuxenoufx74EfJQsbJZC8GeIH65z5I3TJZMKiVvSXdGbCFu6Bkyeu8y0u1lCim7kR5nn13xjin4Obsw0WfIkDBTGyLdY0hXyA48bcM8GViKYzDo0x25seAT3v_vinTtS7d7LQ6n_dtueyhAKllgMwKqivO1VMiF1NVYaTaMWoC1xqBVStTSyAGVGbSpNeODsigFNoyPytQr8vh3u7D6a3QfEL_7X16_8Oof2XFL3w</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models</title><source>arXiv.org</source><creator>Lowman, James ; Zheng, Kelly Liu ; Fraser, Roydon ; The, Jesse Van Griensven ; Valipour, Mojtaba</creator><creatorcontrib>Lowman, James ; Zheng, Kelly Liu ; Fraser, Roydon ; The, Jesse Van Griensven ; Valipour, Mojtaba</creatorcontrib><description>SeeFar is an evolving collection of multi-resolution satellite images from public and commercial satellites. We specifically curated this dataset for training geospatial foundation models, unconstrained by satellite type. In recent years, advances in technology have made satellite imagery more accessible than ever. More earth-observing satellites have been launched in the last five years than in the previous fifty. Modern commercial satellites now offer up to 100 times the spatial resolution of public access satellites. However, the high cost and limited historical availability of commercial satellite imagery is a barrier to the training of foundational models, impacting what images can be used during inference. The SeeFar dataset represents a step towards training models that are satellite-agnostic by combining multi-resolution commercial and public access pre-processed images. This will enable users to utilize historical data alongside higher-resolution, more expensive satellite imagery, offering greater flexibility during inference. To achieve this, we describe a process for standardizing data from diverse satellite sources, normalizing different data formats, and aligning spectral bands to enhance interoperability. The SeeFar dataset includes images at a resolution of 384x384 pixels, spanning four spectral bands (Blue, Green, Red, and Near-Infrared) and expanding spatial resolutions (starting with 30, 10, 1.5, and 1.0 meters), all in cloud-optimized GeoTIFF format. It also provides consistent and comprehensive metadata to enhance data transparency and reliability. By aggregating data from multiple sources, SeeFar makes processed and consistent satellite data accessible to a wider range of users - from researchers to policymakers - fostering competition and innovation in satellite imagery analysis. The dataset is available at \url{coastalcarbon.ai/seefar}.</description><identifier>DOI: 10.48550/arxiv.2406.06776</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning</subject><creationdate>2024-06</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2406.06776$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2406.06776$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Lowman, James</creatorcontrib><creatorcontrib>Zheng, Kelly Liu</creatorcontrib><creatorcontrib>Fraser, Roydon</creatorcontrib><creatorcontrib>The, Jesse Van Griensven</creatorcontrib><creatorcontrib>Valipour, Mojtaba</creatorcontrib><title>SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models</title><description>SeeFar is an evolving collection of multi-resolution satellite images from public and commercial satellites. We specifically curated this dataset for training geospatial foundation models, unconstrained by satellite type. In recent years, advances in technology have made satellite imagery more accessible than ever. More earth-observing satellites have been launched in the last five years than in the previous fifty. Modern commercial satellites now offer up to 100 times the spatial resolution of public access satellites. However, the high cost and limited historical availability of commercial satellite imagery is a barrier to the training of foundational models, impacting what images can be used during inference. The SeeFar dataset represents a step towards training models that are satellite-agnostic by combining multi-resolution commercial and public access pre-processed images. This will enable users to utilize historical data alongside higher-resolution, more expensive satellite imagery, offering greater flexibility during inference. To achieve this, we describe a process for standardizing data from diverse satellite sources, normalizing different data formats, and aligning spectral bands to enhance interoperability. The SeeFar dataset includes images at a resolution of 384x384 pixels, spanning four spectral bands (Blue, Green, Red, and Near-Infrared) and expanding spatial resolutions (starting with 30, 10, 1.5, and 1.0 meters), all in cloud-optimized GeoTIFF format. It also provides consistent and comprehensive metadata to enhance data transparency and reliability. By aggregating data from multiple sources, SeeFar makes processed and consistent satellite data accessible to a wider range of users - from researchers to policymakers - fostering competition and innovation in satellite imagery analysis. The dataset is available at \url{coastalcarbon.ai/seefar}.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0FLwzAYxvFcPMj0A3gyX6A1TZtk2W1MO4WNgeu9vG3ejEBsRpKKfnuxenoufx74EfJQsbJZC8GeIH65z5I3TJZMKiVvSXdGbCFu6Bkyeu8y0u1lCim7kR5nn13xjin4Obsw0WfIkDBTGyLdY0hXyA48bcM8GViKYzDo0x25seAT3v_vinTtS7d7LQ6n_dtueyhAKllgMwKqivO1VMiF1NVYaTaMWoC1xqBVStTSyAGVGbSpNeODsigFNoyPytQr8vh3u7D6a3QfEL_7X16_8Oof2XFL3w</recordid><startdate>20240610</startdate><enddate>20240610</enddate><creator>Lowman, James</creator><creator>Zheng, Kelly Liu</creator><creator>Fraser, Roydon</creator><creator>The, Jesse Van Griensven</creator><creator>Valipour, Mojtaba</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240610</creationdate><title>SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models</title><author>Lowman, James ; Zheng, Kelly Liu ; Fraser, Roydon ; The, Jesse Van Griensven ; Valipour, Mojtaba</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a676-e4cae7122867e25691c190bc95affddef77536d6be7db9d3902b7fe65e402c7d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Lowman, James</creatorcontrib><creatorcontrib>Zheng, Kelly Liu</creatorcontrib><creatorcontrib>Fraser, Roydon</creatorcontrib><creatorcontrib>The, Jesse Van Griensven</creatorcontrib><creatorcontrib>Valipour, Mojtaba</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lowman, James</au><au>Zheng, Kelly Liu</au><au>Fraser, Roydon</au><au>The, Jesse Van Griensven</au><au>Valipour, Mojtaba</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models</atitle><date>2024-06-10</date><risdate>2024</risdate><abstract>SeeFar is an evolving collection of multi-resolution satellite images from public and commercial satellites. We specifically curated this dataset for training geospatial foundation models, unconstrained by satellite type. In recent years, advances in technology have made satellite imagery more accessible than ever. More earth-observing satellites have been launched in the last five years than in the previous fifty. Modern commercial satellites now offer up to 100 times the spatial resolution of public access satellites. However, the high cost and limited historical availability of commercial satellite imagery is a barrier to the training of foundational models, impacting what images can be used during inference. The SeeFar dataset represents a step towards training models that are satellite-agnostic by combining multi-resolution commercial and public access pre-processed images. This will enable users to utilize historical data alongside higher-resolution, more expensive satellite imagery, offering greater flexibility during inference. To achieve this, we describe a process for standardizing data from diverse satellite sources, normalizing different data formats, and aligning spectral bands to enhance interoperability. The SeeFar dataset includes images at a resolution of 384x384 pixels, spanning four spectral bands (Blue, Green, Red, and Near-Infrared) and expanding spatial resolutions (starting with 30, 10, 1.5, and 1.0 meters), all in cloud-optimized GeoTIFF format. It also provides consistent and comprehensive metadata to enhance data transparency and reliability. By aggregating data from multiple sources, SeeFar makes processed and consistent satellite data accessible to a wider range of users - from researchers to policymakers - fostering competition and innovation in satellite imagery analysis. The dataset is available at \url{coastalcarbon.ai/seefar}.</abstract><doi>10.48550/arxiv.2406.06776</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2406.06776
ispartof
issn
language eng
recordid cdi_arxiv_primary_2406_06776
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
Computer Science - Learning
title SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T16%3A11%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=SeeFar:%20Satellite%20Agnostic%20Multi-Resolution%20Dataset%20for%20Geospatial%20Foundation%20Models&rft.au=Lowman,%20James&rft.date=2024-06-10&rft_id=info:doi/10.48550/arxiv.2406.06776&rft_dat=%3Carxiv_GOX%3E2406_06776%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true