Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions

We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently int...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2023-09
Hauptverfasser:	Burkov, Egor, Rakhimov, Ruslan, Safin, Aleksandr, Burnaev, Evgeny, Lempitsky, Victor
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Computer Science - Computer Vision and Pattern Recognition Finite element method Image reconstruction Neural networks
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Burkov, Egor Rakhimov, Ruslan Safin, Aleksandr Burnaev, Evgeny Lempitsky, Victor
description	We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation $\unicode{x2013}$ neural implicit functions $\unicode{x2013}$ which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.
doi_str_mv	10.48550/arxiv.2209.04436
format	Article
fullrecord	<record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2209_04436</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2713090542</sourcerecordid><originalsourceid>FETCH-LOGICAL-a956-7e22c0ee8ef451bb98fd42d21332c258f6b83f8380131a6bb4b870eec4032f0c3</originalsourceid><addsrcrecordid>eNotj11LwzAYhYMgOOZ-gFcGvO5M3jdpU-9kug-YU9juS9omM6MfM039-PfWzasDh4fDeQi54WwqlJTsXvtv9zkFYOmUCYHxBRkBIo-UALgik647MMYgTkBKHJHNS18FF21Mv32g-ESXRpf0rfXBaxc6an1b061r9pWhq1rvDf1y4Z0OuNfV0BwrV7hA531TBNc23TW5tLrqzOQ_x2Q3f97NltH6dbGaPa4jnco4SgxAwYxRxgrJ8zxVthRQAkeEAqSyca7QKlSMI9dxnotcJQNfCIZgWYFjcnuePclmR-9q7X-yP-nsJD0Qd2fi6NuP3nQhO7S9b4ZPGSQcWcqkAPwFbylYkg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2713090542</pqid></control><display><type>article</type><title>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Burkov, Egor ; Rakhimov, Ruslan ; Safin, Aleksandr ; Burnaev, Evgeny ; Lempitsky, Victor</creator><creatorcontrib>Burkov, Egor ; Rakhimov, Ruslan ; Safin, Aleksandr ; Burnaev, Evgeny ; Lempitsky, Victor</creatorcontrib><description>We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation $\unicode{x2013}$ neural implicit functions $\unicode{x2013}$ which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2209.04436</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Computer Science - Computer Vision and Pattern Recognition ; Finite element method ; Image reconstruction ; Neural networks</subject><ispartof>arXiv.org, 2023-09</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27924</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2209.04436$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1109/ACCESS.2023.3309412$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Burkov, Egor</creatorcontrib><creatorcontrib>Rakhimov, Ruslan</creatorcontrib><creatorcontrib>Safin, Aleksandr</creatorcontrib><creatorcontrib>Burnaev, Evgeny</creatorcontrib><creatorcontrib>Lempitsky, Victor</creatorcontrib><title>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</title><title>arXiv.org</title><description>We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation $\unicode{x2013}$ neural implicit functions $\unicode{x2013}$ which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.</description><subject>Algorithms</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Finite element method</subject><subject>Image reconstruction</subject><subject>Neural networks</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj11LwzAYhYMgOOZ-gFcGvO5M3jdpU-9kug-YU9juS9omM6MfM039-PfWzasDh4fDeQi54WwqlJTsXvtv9zkFYOmUCYHxBRkBIo-UALgik647MMYgTkBKHJHNS18FF21Mv32g-ESXRpf0rfXBaxc6an1b061r9pWhq1rvDf1y4Z0OuNfV0BwrV7hA531TBNc23TW5tLrqzOQ_x2Q3f97NltH6dbGaPa4jnco4SgxAwYxRxgrJ8zxVthRQAkeEAqSyca7QKlSMI9dxnotcJQNfCIZgWYFjcnuePclmR-9q7X-yP-nsJD0Qd2fi6NuP3nQhO7S9b4ZPGSQcWcqkAPwFbylYkg</recordid><startdate>20230908</startdate><enddate>20230908</enddate><creator>Burkov, Egor</creator><creator>Rakhimov, Ruslan</creator><creator>Safin, Aleksandr</creator><creator>Burnaev, Evgeny</creator><creator>Lempitsky, Victor</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230908</creationdate><title>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</title><author>Burkov, Egor ; Rakhimov, Ruslan ; Safin, Aleksandr ; Burnaev, Evgeny ; Lempitsky, Victor</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a956-7e22c0ee8ef451bb98fd42d21332c258f6b83f8380131a6bb4b870eec4032f0c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Finite element method</topic><topic>Image reconstruction</topic><topic>Neural networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Burkov, Egor</creatorcontrib><creatorcontrib>Rakhimov, Ruslan</creatorcontrib><creatorcontrib>Safin, Aleksandr</creatorcontrib><creatorcontrib>Burnaev, Evgeny</creatorcontrib><creatorcontrib>Lempitsky, Victor</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Burkov, Egor</au><au>Rakhimov, Ruslan</au><au>Safin, Aleksandr</au><au>Burnaev, Evgeny</au><au>Lempitsky, Victor</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions</atitle><jtitle>arXiv.org</jtitle><date>2023-09-08</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>We present an approach for the reconstruction of textured 3D meshes of human heads from one or few views. Since such few-shot reconstruction is underconstrained, it requires prior knowledge which is hard to impose on traditional 3D reconstruction algorithms. In this work, we rely on the recently introduced 3D representation $\unicode{x2013}$ neural implicit functions $\unicode{x2013}$ which, being based on neural networks, allows to naturally learn priors about human heads from data, and is directly convertible to textured mesh. Namely, we extend NeuS, a state-of-the-art neural implicit function formulation, to represent multiple objects of a class (human heads in our case) simultaneously. The underlying neural net architecture is designed to learn the commonalities among these objects and to generalize to unseen ones. Our model is trained on just a hundred smartphone videos and does not require any scanned 3D data. Afterwards, the model can fit novel heads in the few-shot or one-shot modes with good results.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2209.04436</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2023-09
issn	2331-8422
language	eng
recordid	cdi_arxiv_primary_2209_04436
source	arXiv.org; Free E- Journals
subjects	Algorithms Computer Science - Computer Vision and Pattern Recognition Finite element method Image reconstruction Neural networks
title	Multi-NeuS: 3D Head Portraits from Single Image with Neural Implicit Functions
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T06%3A47%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multi-NeuS:%203D%20Head%20Portraits%20from%20Single%20Image%20with%20Neural%20Implicit%20Functions&rft.jtitle=arXiv.org&rft.au=Burkov,%20Egor&rft.date=2023-09-08&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2209.04436&rft_dat=%3Cproquest_arxiv%3E2713090542%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2713090542&rft_id=info:pmid/&rfr_iscdi=true