Painting-to-3D Model Alignment via Discriminative Visual Elements

This article describes a technique that can reliably align arbitrary 2D depictions of an architectural site, including drawings, paintings, and historical photographs, with a 3D model of the site. This is a tremendously difficult task, as the appearance and scene structure in the 2D depictions can b...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on graphics 2014-03, Vol.33 (2), p.1-14
Hauptverfasser: AUBRY, Mathieu, RUSSELL, Bryan C, SIVIC, Josef
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 14
container_issue 2
container_start_page 1
container_title ACM transactions on graphics
container_volume 33
creator AUBRY, Mathieu
RUSSELL, Bryan C
SIVIC, Josef
description This article describes a technique that can reliably align arbitrary 2D depictions of an architectural site, including drawings, paintings, and historical photographs, with a 3D model of the site. This is a tremendously difficult task, as the appearance and scene structure in the 2D depictions can be very different from the appearance and geometry of the 3D model, for example, due to the specific rendering style, drawing error, age, lighting, or change of seasons. In addition, we face a hard search problem: the number of possible alignments of the painting to a large 3D model, such as a partial reconstruction of a city, is huge. To address these issues, we develop a new compact representation of complex 3D scenes. The 3D model of the scene is represented by a small set of discriminative visual elements that are automatically learned from rendered views. Similar to object detection, the set of visual elements, as well as the weights of individual features for each element, are learned in a discriminative fashion. We show that the learned visual elements are reliably matched in 2D depictions of the scene despite large variations in rendering style (e.g., watercolor, sketch, historical photograph) and structural changes (e.g., missing scene parts, large occluders) of the scene. We demonstrate an application of the proposed approach to automatic rephotography to find an approximate viewpoint of historical paintings and photographs with respect to a 3D model of the site. The proposed alignment procedure is validated via a human user study on a new database of paintings and sketches spanning several sites. The results demonstrate that our algorithm produces significantly better alignments than several baseline methods.
doi_str_mv 10.1145/2591009
format Article
fullrecord <record><control><sourceid>proquest_hal_p</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_00863615v3</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1793241282</sourcerecordid><originalsourceid>FETCH-LOGICAL-c421t-cf5526fe555dba3d01f5120af71f50fabe4bc0248b22e92eaffc5f9b406e81933</originalsourceid><addsrcrecordid>eNo90E1Lw0AQBuBFFKxV_Au5iHqIzn4mOYa2WqGiB_W6TLa7dWWb1Gxa8N-b0NLTDMPDy8wQck3hgVIhH5ksKEBxQkZUyizNuMpPyQgyDilwoOfkIsYfAFBCqBEp39HXna9XadekfJq8NksbkjL4Vb22dZfsPCZTH03r177Gzu9s8uXjFkMyC3YQ8ZKcOQzRXh3qmHw-zT4m83Tx9vwyKRepEYx2qXFSMuWslHJZIV8CdZIyQJf1DTisrKgMMJFXjNmCWXTOSFdUApTNacH5mNzvc78x6E2_D7Z_ukGv5-VCDzOAXHFF5W6wd3u7aZvfrY2dXvc32BCwts02apoVnAnKctbT2z01bRNja90xm4IeHqoPD-3lzSEUo8HgWqyNj0fOcgmKqYL_A1Focow</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1793241282</pqid></control><display><type>article</type><title>Painting-to-3D Model Alignment via Discriminative Visual Elements</title><source>ACM Digital Library</source><creator>AUBRY, Mathieu ; RUSSELL, Bryan C ; SIVIC, Josef</creator><creatorcontrib>AUBRY, Mathieu ; RUSSELL, Bryan C ; SIVIC, Josef</creatorcontrib><description>This article describes a technique that can reliably align arbitrary 2D depictions of an architectural site, including drawings, paintings, and historical photographs, with a 3D model of the site. This is a tremendously difficult task, as the appearance and scene structure in the 2D depictions can be very different from the appearance and geometry of the 3D model, for example, due to the specific rendering style, drawing error, age, lighting, or change of seasons. In addition, we face a hard search problem: the number of possible alignments of the painting to a large 3D model, such as a partial reconstruction of a city, is huge. To address these issues, we develop a new compact representation of complex 3D scenes. The 3D model of the scene is represented by a small set of discriminative visual elements that are automatically learned from rendered views. Similar to object detection, the set of visual elements, as well as the weights of individual features for each element, are learned in a discriminative fashion. We show that the learned visual elements are reliably matched in 2D depictions of the scene despite large variations in rendering style (e.g., watercolor, sketch, historical photograph) and structural changes (e.g., missing scene parts, large occluders) of the scene. We demonstrate an application of the proposed approach to automatic rephotography to find an approximate viewpoint of historical paintings and photographs with respect to a 3D model of the site. The proposed alignment procedure is validated via a human user study on a new database of paintings and sketches spanning several sites. The results demonstrate that our algorithm produces significantly better alignments than several baseline methods.</description><identifier>ISSN: 0730-0301</identifier><identifier>EISSN: 1557-7368</identifier><identifier>DOI: 10.1145/2591009</identifier><language>eng</language><publisher>New York, NY: Association for Computing Machinery</publisher><subject>Alignment ; Applied sciences ; Artificial intelligence ; Computer Science ; Computer science; control theory; systems ; Computer Vision and Pattern Recognition ; Exact sciences and technology ; Historic ; Mathematical models ; Pattern recognition. Digital image processing. Computational geometry ; Three dimensional models ; Two dimensional ; Visual</subject><ispartof>ACM transactions on graphics, 2014-03, Vol.33 (2), p.1-14</ispartof><rights>2015 INIST-CNRS</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c421t-cf5526fe555dba3d01f5120af71f50fabe4bc0248b22e92eaffc5f9b406e81933</citedby><cites>FETCH-LOGICAL-c421t-cf5526fe555dba3d01f5120af71f50fabe4bc0248b22e92eaffc5f9b406e81933</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27901,27902</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=28506269$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://inria.hal.science/hal-00863615$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>AUBRY, Mathieu</creatorcontrib><creatorcontrib>RUSSELL, Bryan C</creatorcontrib><creatorcontrib>SIVIC, Josef</creatorcontrib><title>Painting-to-3D Model Alignment via Discriminative Visual Elements</title><title>ACM transactions on graphics</title><description>This article describes a technique that can reliably align arbitrary 2D depictions of an architectural site, including drawings, paintings, and historical photographs, with a 3D model of the site. This is a tremendously difficult task, as the appearance and scene structure in the 2D depictions can be very different from the appearance and geometry of the 3D model, for example, due to the specific rendering style, drawing error, age, lighting, or change of seasons. In addition, we face a hard search problem: the number of possible alignments of the painting to a large 3D model, such as a partial reconstruction of a city, is huge. To address these issues, we develop a new compact representation of complex 3D scenes. The 3D model of the scene is represented by a small set of discriminative visual elements that are automatically learned from rendered views. Similar to object detection, the set of visual elements, as well as the weights of individual features for each element, are learned in a discriminative fashion. We show that the learned visual elements are reliably matched in 2D depictions of the scene despite large variations in rendering style (e.g., watercolor, sketch, historical photograph) and structural changes (e.g., missing scene parts, large occluders) of the scene. We demonstrate an application of the proposed approach to automatic rephotography to find an approximate viewpoint of historical paintings and photographs with respect to a 3D model of the site. The proposed alignment procedure is validated via a human user study on a new database of paintings and sketches spanning several sites. The results demonstrate that our algorithm produces significantly better alignments than several baseline methods.</description><subject>Alignment</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Computer Science</subject><subject>Computer science; control theory; systems</subject><subject>Computer Vision and Pattern Recognition</subject><subject>Exact sciences and technology</subject><subject>Historic</subject><subject>Mathematical models</subject><subject>Pattern recognition. Digital image processing. Computational geometry</subject><subject>Three dimensional models</subject><subject>Two dimensional</subject><subject>Visual</subject><issn>0730-0301</issn><issn>1557-7368</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNo90E1Lw0AQBuBFFKxV_Au5iHqIzn4mOYa2WqGiB_W6TLa7dWWb1Gxa8N-b0NLTDMPDy8wQck3hgVIhH5ksKEBxQkZUyizNuMpPyQgyDilwoOfkIsYfAFBCqBEp39HXna9XadekfJq8NksbkjL4Vb22dZfsPCZTH03r177Gzu9s8uXjFkMyC3YQ8ZKcOQzRXh3qmHw-zT4m83Tx9vwyKRepEYx2qXFSMuWslHJZIV8CdZIyQJf1DTisrKgMMJFXjNmCWXTOSFdUApTNacH5mNzvc78x6E2_D7Z_ukGv5-VCDzOAXHFF5W6wd3u7aZvfrY2dXvc32BCwts02apoVnAnKctbT2z01bRNja90xm4IeHqoPD-3lzSEUo8HgWqyNj0fOcgmKqYL_A1Focow</recordid><startdate>20140301</startdate><enddate>20140301</enddate><creator>AUBRY, Mathieu</creator><creator>RUSSELL, Bryan C</creator><creator>SIVIC, Josef</creator><general>Association for Computing Machinery</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>1XC</scope><scope>VOOES</scope></search><sort><creationdate>20140301</creationdate><title>Painting-to-3D Model Alignment via Discriminative Visual Elements</title><author>AUBRY, Mathieu ; RUSSELL, Bryan C ; SIVIC, Josef</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c421t-cf5526fe555dba3d01f5120af71f50fabe4bc0248b22e92eaffc5f9b406e81933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Alignment</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Computer Science</topic><topic>Computer science; control theory; systems</topic><topic>Computer Vision and Pattern Recognition</topic><topic>Exact sciences and technology</topic><topic>Historic</topic><topic>Mathematical models</topic><topic>Pattern recognition. Digital image processing. Computational geometry</topic><topic>Three dimensional models</topic><topic>Two dimensional</topic><topic>Visual</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>AUBRY, Mathieu</creatorcontrib><creatorcontrib>RUSSELL, Bryan C</creatorcontrib><creatorcontrib>SIVIC, Josef</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>ACM transactions on graphics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>AUBRY, Mathieu</au><au>RUSSELL, Bryan C</au><au>SIVIC, Josef</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Painting-to-3D Model Alignment via Discriminative Visual Elements</atitle><jtitle>ACM transactions on graphics</jtitle><date>2014-03-01</date><risdate>2014</risdate><volume>33</volume><issue>2</issue><spage>1</spage><epage>14</epage><pages>1-14</pages><issn>0730-0301</issn><eissn>1557-7368</eissn><abstract>This article describes a technique that can reliably align arbitrary 2D depictions of an architectural site, including drawings, paintings, and historical photographs, with a 3D model of the site. This is a tremendously difficult task, as the appearance and scene structure in the 2D depictions can be very different from the appearance and geometry of the 3D model, for example, due to the specific rendering style, drawing error, age, lighting, or change of seasons. In addition, we face a hard search problem: the number of possible alignments of the painting to a large 3D model, such as a partial reconstruction of a city, is huge. To address these issues, we develop a new compact representation of complex 3D scenes. The 3D model of the scene is represented by a small set of discriminative visual elements that are automatically learned from rendered views. Similar to object detection, the set of visual elements, as well as the weights of individual features for each element, are learned in a discriminative fashion. We show that the learned visual elements are reliably matched in 2D depictions of the scene despite large variations in rendering style (e.g., watercolor, sketch, historical photograph) and structural changes (e.g., missing scene parts, large occluders) of the scene. We demonstrate an application of the proposed approach to automatic rephotography to find an approximate viewpoint of historical paintings and photographs with respect to a 3D model of the site. The proposed alignment procedure is validated via a human user study on a new database of paintings and sketches spanning several sites. The results demonstrate that our algorithm produces significantly better alignments than several baseline methods.</abstract><cop>New York, NY</cop><pub>Association for Computing Machinery</pub><doi>10.1145/2591009</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0730-0301
ispartof ACM transactions on graphics, 2014-03, Vol.33 (2), p.1-14
issn 0730-0301
1557-7368
language eng
recordid cdi_hal_primary_oai_HAL_hal_00863615v3
source ACM Digital Library
subjects Alignment
Applied sciences
Artificial intelligence
Computer Science
Computer science
control theory
systems
Computer Vision and Pattern Recognition
Exact sciences and technology
Historic
Mathematical models
Pattern recognition. Digital image processing. Computational geometry
Three dimensional models
Two dimensional
Visual
title Painting-to-3D Model Alignment via Discriminative Visual Elements
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-20T19%3A55%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Painting-to-3D%20Model%20Alignment%20via%20Discriminative%20Visual%20Elements&rft.jtitle=ACM%20transactions%20on%20graphics&rft.au=AUBRY,%20Mathieu&rft.date=2014-03-01&rft.volume=33&rft.issue=2&rft.spage=1&rft.epage=14&rft.pages=1-14&rft.issn=0730-0301&rft.eissn=1557-7368&rft_id=info:doi/10.1145/2591009&rft_dat=%3Cproquest_hal_p%3E1793241282%3C/proquest_hal_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1793241282&rft_id=info:pmid/&rfr_iscdi=true