Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions

We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently int...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-09
Hauptverfasser: Burkov, Egor, Rakhimov, Ruslan, Safin, Aleksandr, Burnaev, Evgeny, Lempitsky, Victor
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Burkov, Egor
Rakhimov, Ruslan
Safin, Aleksandr
Burnaev, Evgeny
Lempitsky, Victor
description We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation \(\unicode{x2013}\) neural implicit functions \(\unicode{x2013}\) which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.
doi_str_mv 10.48550/arxiv.2209.04436
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2209_04436</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2713090542</sourcerecordid><originalsourceid>FETCH-LOGICAL-a956-7e22c0ee8ef451bb98fd42d21332c258f6b83f8380131a6bb4b870eec4032f0c3</originalsourceid><addsrcrecordid>eNotj11LwzAYhYMgOOZ-gFcGvO5M3jdpU-9kug-YU9juS9omM6MfM039-PfWzasDh4fDeQi54WwqlJTsXvtv9zkFYOmUCYHxBRkBIo-UALgik647MMYgTkBKHJHNS18FF21Mv32g-ESXRpf0rfXBaxc6an1b061r9pWhq1rvDf1y4Z0OuNfV0BwrV7hA531TBNc23TW5tLrqzOQ_x2Q3f97NltH6dbGaPa4jnco4SgxAwYxRxgrJ8zxVthRQAkeEAqSyca7QKlSMI9dxnotcJQNfCIZgWYFjcnuePclmR-9q7X-yP-nsJD0Qd2fi6NuP3nQhO7S9b4ZPGSQcWcqkAPwFbylYkg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2713090542</pqid></control><display><type>article</type><title>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Burkov, Egor ; Rakhimov, Ruslan ; Safin, Aleksandr ; Burnaev, Evgeny ; Lempitsky, Victor</creator><creatorcontrib>Burkov, Egor ; Rakhimov, Ruslan ; Safin, Aleksandr ; Burnaev, Evgeny ; Lempitsky, Victor</creatorcontrib><description>We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation \(\unicode{x2013}\) neural implicit functions \(\unicode{x2013}\) which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2209.04436</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Computer Science - Computer Vision and Pattern Recognition ; Finite element method ; Image reconstruction ; Neural networks</subject><ispartof>arXiv.org, 2023-09</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27924</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2209.04436$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1109/ACCESS.2023.3309412$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Burkov, Egor</creatorcontrib><creatorcontrib>Rakhimov, Ruslan</creatorcontrib><creatorcontrib>Safin, Aleksandr</creatorcontrib><creatorcontrib>Burnaev, Evgeny</creatorcontrib><creatorcontrib>Lempitsky, Victor</creatorcontrib><title>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</title><title>arXiv.org</title><description>We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation \(\unicode{x2013}\) neural implicit functions \(\unicode{x2013}\) which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.</description><subject>Algorithms</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Finite element method</subject><subject>Image reconstruction</subject><subject>Neural networks</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj11LwzAYhYMgOOZ-gFcGvO5M3jdpU-9kug-YU9juS9omM6MfM039-PfWzasDh4fDeQi54WwqlJTsXvtv9zkFYOmUCYHxBRkBIo-UALgik647MMYgTkBKHJHNS18FF21Mv32g-ESXRpf0rfXBaxc6an1b061r9pWhq1rvDf1y4Z0OuNfV0BwrV7hA531TBNc23TW5tLrqzOQ_x2Q3f97NltH6dbGaPa4jnco4SgxAwYxRxgrJ8zxVthRQAkeEAqSyca7QKlSMI9dxnotcJQNfCIZgWYFjcnuePclmR-9q7X-yP-nsJD0Qd2fi6NuP3nQhO7S9b4ZPGSQcWcqkAPwFbylYkg</recordid><startdate>20230908</startdate><enddate>20230908</enddate><creator>Burkov, Egor</creator><creator>Rakhimov, Ruslan</creator><creator>Safin, Aleksandr</creator><creator>Burnaev, Evgeny</creator><creator>Lempitsky, Victor</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230908</creationdate><title>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</title><author>Burkov, Egor ; Rakhimov, Ruslan ; Safin, Aleksandr ; Burnaev, Evgeny ; Lempitsky, Victor</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a956-7e22c0ee8ef451bb98fd42d21332c258f6b83f8380131a6bb4b870eec4032f0c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Finite element method</topic><topic>Image reconstruction</topic><topic>Neural networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Burkov, Egor</creatorcontrib><creatorcontrib>Rakhimov, Ruslan</creatorcontrib><creatorcontrib>Safin, Aleksandr</creatorcontrib><creatorcontrib>Burnaev, Evgeny</creatorcontrib><creatorcontrib>Lempitsky, Victor</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Burkov, Egor</au><au>Rakhimov, Ruslan</au><au>Safin, Aleksandr</au><au>Burnaev, Evgeny</au><au>Lempitsky, Victor</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</atitle><jtitle>arXiv.org</jtitle><date>2023-09-08</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation \(\unicode{x2013}\) neural implicit functions \(\unicode{x2013}\) which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2209.04436</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-09
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2209_04436
source arXiv.org; Free E- Journals
subjects Algorithms
Computer Science - Computer Vision and Pattern Recognition
Finite element method
Image reconstruction
Neural networks
title Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T06%3A47%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multi-NeuS:%203D%20Head%20Portraits%20from%20Single%20Image%20with%20Neural%20Implicit%20Functions&rft.jtitle=arXiv.org&rft.au=Burkov,%20Egor&rft.date=2023-09-08&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2209.04436&rft_dat=%3Cproquest_arxiv%3E2713090542%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2713090542&rft_id=info:pmid/&rfr_iscdi=true