Tangent Images for Mitigating Spherical Distortion

In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar imag...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Eder, Marc, Shvets, Mykhailo, Lim, John, Frahm, Jan-Michael
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Eder, Marc
Shvets, Mykhailo
Lim, John
Frahm, Jan-Michael
description In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar image grids tangent to a subdivided icosahedron. By varying the resolution of these grids independently of the subdivision level, we can effectively represent high resolution spherical images while still benefiting from the low-distortion icosahedral spherical approximation. We show that training standard convolutional neural networks on tangent images compares favorably to the many specialized spherical convolutional kernels that have been developed, while also scaling efficiently to handle significantly higher spherical resolutions. Furthermore, because our approach does not require specialized kernels, we show that we can transfer networks trained on perspective images to spherical data without fine-tuning and with limited performance drop-off. Finally, we demonstrate that tangent images can be used to improve the quality of sparse feature detection on spherical images, illustrating its usefulness for traditional computer vision tasks like structure-from-motion and SLAM.
doi_str_mv 10.48550/arxiv.1912.09390
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1912_09390</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1912_09390</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-b77726923a193c002a94a304aeecd8b40f2c9e89684702718d4748d860399e223</originalsourceid><addsrcrecordid>eNotzr1uwjAUQGEvDBX0ATrhF0i4vjax74hCf5CoOpA9uiROsEQS5FhV-_ZVaaezHX1CPCnIjdtuYcPxK3zmihTmQJrgQWDFY-_HJA8D936W3RTle0ih5xTGXp5uFx9Dw1e5D3OaYgrTuBKLjq-zf_zvUlQvz1X5lh0_Xg_l7phxYSE7W2uxINSsSDcAyGRYg2Hvm9adDXTYkHdUOGMBrXKtsca1rgBN5BH1Uqz_tnd0fYth4Phd_-LrO17_AHp6PWI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Tangent Images for Mitigating Spherical Distortion</title><source>arXiv.org</source><creator>Eder, Marc ; Shvets, Mykhailo ; Lim, John ; Frahm, Jan-Michael</creator><creatorcontrib>Eder, Marc ; Shvets, Mykhailo ; Lim, John ; Frahm, Jan-Michael</creatorcontrib><description>In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar image grids tangent to a subdivided icosahedron. By varying the resolution of these grids independently of the subdivision level, we can effectively represent high resolution spherical images while still benefiting from the low-distortion icosahedral spherical approximation. We show that training standard convolutional neural networks on tangent images compares favorably to the many specialized spherical convolutional kernels that have been developed, while also scaling efficiently to handle significantly higher spherical resolutions. Furthermore, because our approach does not require specialized kernels, we show that we can transfer networks trained on perspective images to spherical data without fine-tuning and with limited performance drop-off. Finally, we demonstrate that tangent images can be used to improve the quality of sparse feature detection on spherical images, illustrating its usefulness for traditional computer vision tasks like structure-from-motion and SLAM.</description><identifier>DOI: 10.48550/arxiv.1912.09390</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2019-12</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1912.09390$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1912.09390$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Eder, Marc</creatorcontrib><creatorcontrib>Shvets, Mykhailo</creatorcontrib><creatorcontrib>Lim, John</creatorcontrib><creatorcontrib>Frahm, Jan-Michael</creatorcontrib><title>Tangent Images for Mitigating Spherical Distortion</title><description>In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar image grids tangent to a subdivided icosahedron. By varying the resolution of these grids independently of the subdivision level, we can effectively represent high resolution spherical images while still benefiting from the low-distortion icosahedral spherical approximation. We show that training standard convolutional neural networks on tangent images compares favorably to the many specialized spherical convolutional kernels that have been developed, while also scaling efficiently to handle significantly higher spherical resolutions. Furthermore, because our approach does not require specialized kernels, we show that we can transfer networks trained on perspective images to spherical data without fine-tuning and with limited performance drop-off. Finally, we demonstrate that tangent images can be used to improve the quality of sparse feature detection on spherical images, illustrating its usefulness for traditional computer vision tasks like structure-from-motion and SLAM.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzr1uwjAUQGEvDBX0ATrhF0i4vjax74hCf5CoOpA9uiROsEQS5FhV-_ZVaaezHX1CPCnIjdtuYcPxK3zmihTmQJrgQWDFY-_HJA8D936W3RTle0ih5xTGXp5uFx9Dw1e5D3OaYgrTuBKLjq-zf_zvUlQvz1X5lh0_Xg_l7phxYSE7W2uxINSsSDcAyGRYg2Hvm9adDXTYkHdUOGMBrXKtsca1rgBN5BH1Uqz_tnd0fYth4Phd_-LrO17_AHp6PWI</recordid><startdate>20191219</startdate><enddate>20191219</enddate><creator>Eder, Marc</creator><creator>Shvets, Mykhailo</creator><creator>Lim, John</creator><creator>Frahm, Jan-Michael</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20191219</creationdate><title>Tangent Images for Mitigating Spherical Distortion</title><author>Eder, Marc ; Shvets, Mykhailo ; Lim, John ; Frahm, Jan-Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-b77726923a193c002a94a304aeecd8b40f2c9e89684702718d4748d860399e223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Eder, Marc</creatorcontrib><creatorcontrib>Shvets, Mykhailo</creatorcontrib><creatorcontrib>Lim, John</creatorcontrib><creatorcontrib>Frahm, Jan-Michael</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Eder, Marc</au><au>Shvets, Mykhailo</au><au>Lim, John</au><au>Frahm, Jan-Michael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Tangent Images for Mitigating Spherical Distortion</atitle><date>2019-12-19</date><risdate>2019</risdate><abstract>In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar image grids tangent to a subdivided icosahedron. By varying the resolution of these grids independently of the subdivision level, we can effectively represent high resolution spherical images while still benefiting from the low-distortion icosahedral spherical approximation. We show that training standard convolutional neural networks on tangent images compares favorably to the many specialized spherical convolutional kernels that have been developed, while also scaling efficiently to handle significantly higher spherical resolutions. Furthermore, because our approach does not require specialized kernels, we show that we can transfer networks trained on perspective images to spherical data without fine-tuning and with limited performance drop-off. Finally, we demonstrate that tangent images can be used to improve the quality of sparse feature detection on spherical images, illustrating its usefulness for traditional computer vision tasks like structure-from-motion and SLAM.</abstract><doi>10.48550/arxiv.1912.09390</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1912.09390
ispartof
issn
language eng
recordid cdi_arxiv_primary_1912_09390
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title Tangent Images for Mitigating Spherical Distortion
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T18%3A25%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Tangent%20Images%20for%20Mitigating%20Spherical%20Distortion&rft.au=Eder,%20Marc&rft.date=2019-12-19&rft_id=info:doi/10.48550/arxiv.1912.09390&rft_dat=%3Carxiv_GOX%3E1912_09390%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true