Methods and systems for image and voice processing

Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bogan, III, Carl Davis, Laser, Jacob Myles, Berlin, Cody Gustave, Lande, Kenneth Michael, Øland, Anders, Lee, Brian Sung
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Bogan, III, Carl Davis Laser, Jacob Myles Berlin, Cody Gustave Lande, Kenneth Michael Øland, Anders Lee, Brian Sung
description	Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US11670024B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US11670024B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US11670024B23</originalsourceid><addsrcrecordid>eNrjZDDyTS3JyE8pVkjMS1EoriwuSc0tVkjLL1LIzE1MTwWLluVnJqcqFBTlJ6cWF2fmpfMwsKYl5hSn8kJpbgZFN9cQZw_d1IL8-NTigsTk1LzUkvjQYENDM3MDAyMTJyNjYtQAALo2LAY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Methods and systems for image and voice processing</title><source>esp@cenet</source><creator>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</creator><creatorcontrib>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</creatorcontrib><description>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; IMAGE DATA PROCESSING OR GENERATION, IN GENERAL ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230606&DB=EPODOC&CC=US&NR=11670024B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76418</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230606&DB=EPODOC&CC=US&NR=11670024B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Bogan, III, Carl Davis</creatorcontrib><creatorcontrib>Laser, Jacob Myles</creatorcontrib><creatorcontrib>Berlin, Cody Gustave</creatorcontrib><creatorcontrib>Lande, Kenneth Michael</creatorcontrib><creatorcontrib>Øland, Anders</creatorcontrib><creatorcontrib>Lee, Brian Sung</creatorcontrib><title>Methods and systems for image and voice processing</title><description>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDyTS3JyE8pVkjMS1EoriwuSc0tVkjLL1LIzE1MTwWLluVnJqcqFBTlJ6cWF2fmpfMwsKYl5hSn8kJpbgZFN9cQZw_d1IL8-NTigsTk1LzUkvjQYENDM3MDAyMTJyNjYtQAALo2LAY</recordid><startdate>20230606</startdate><enddate>20230606</enddate><creator>Bogan, III, Carl Davis</creator><creator>Laser, Jacob Myles</creator><creator>Berlin, Cody Gustave</creator><creator>Lande, Kenneth Michael</creator><creator>Øland, Anders</creator><creator>Lee, Brian Sung</creator><scope>EVB</scope></search><sort><creationdate>20230606</creationdate><title>Methods and systems for image and voice processing</title><author>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US11670024B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Bogan, III, Carl Davis</creatorcontrib><creatorcontrib>Laser, Jacob Myles</creatorcontrib><creatorcontrib>Berlin, Cody Gustave</creatorcontrib><creatorcontrib>Lande, Kenneth Michael</creatorcontrib><creatorcontrib>Øland, Anders</creatorcontrib><creatorcontrib>Lee, Brian Sung</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bogan, III, Carl Davis</au><au>Laser, Jacob Myles</au><au>Berlin, Cody Gustave</au><au>Lande, Kenneth Michael</au><au>Øland, Anders</au><au>Lee, Brian Sung</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Methods and systems for image and voice processing</title><date>2023-06-06</date><risdate>2023</risdate><abstract>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US11670024B2
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS
title	Methods and systems for image and voice processing
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T19%3A14%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Bogan,%20III,%20Carl%20Davis&rft.date=2023-06-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS11670024B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true