Methods and systems for image and voice processing

Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Bogan, III, Carl Davis, Laser, Jacob Myles, Berlin, Cody Gustave, Lande, Kenneth Michael, Øland, Anders, Lee, Brian Sung
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Bogan, III, Carl Davis
Laser, Jacob Myles
Berlin, Cody Gustave
Lande, Kenneth Michael
Øland, Anders
Lee, Brian Sung
description Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US11670024B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US11670024B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US11670024B23</originalsourceid><addsrcrecordid>eNrjZDDyTS3JyE8pVkjMS1EoriwuSc0tVkjLL1LIzE1MTwWLluVnJqcqFBTlJ6cWF2fmpfMwsKYl5hSn8kJpbgZFN9cQZw_d1IL8-NTigsTk1LzUkvjQYENDM3MDAyMTJyNjYtQAALo2LAY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Methods and systems for image and voice processing</title><source>esp@cenet</source><creator>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</creator><creatorcontrib>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</creatorcontrib><description>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; IMAGE DATA PROCESSING OR GENERATION, IN GENERAL ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230606&amp;DB=EPODOC&amp;CC=US&amp;NR=11670024B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76418</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230606&amp;DB=EPODOC&amp;CC=US&amp;NR=11670024B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Bogan, III, Carl Davis</creatorcontrib><creatorcontrib>Laser, Jacob Myles</creatorcontrib><creatorcontrib>Berlin, Cody Gustave</creatorcontrib><creatorcontrib>Lande, Kenneth Michael</creatorcontrib><creatorcontrib>Øland, Anders</creatorcontrib><creatorcontrib>Lee, Brian Sung</creatorcontrib><title>Methods and systems for image and voice processing</title><description>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDyTS3JyE8pVkjMS1EoriwuSc0tVkjLL1LIzE1MTwWLluVnJqcqFBTlJ6cWF2fmpfMwsKYl5hSn8kJpbgZFN9cQZw_d1IL8-NTigsTk1LzUkvjQYENDM3MDAyMTJyNjYtQAALo2LAY</recordid><startdate>20230606</startdate><enddate>20230606</enddate><creator>Bogan, III, Carl Davis</creator><creator>Laser, Jacob Myles</creator><creator>Berlin, Cody Gustave</creator><creator>Lande, Kenneth Michael</creator><creator>Øland, Anders</creator><creator>Lee, Brian Sung</creator><scope>EVB</scope></search><sort><creationdate>20230606</creationdate><title>Methods and systems for image and voice processing</title><author>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US11670024B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Bogan, III, Carl Davis</creatorcontrib><creatorcontrib>Laser, Jacob Myles</creatorcontrib><creatorcontrib>Berlin, Cody Gustave</creatorcontrib><creatorcontrib>Lande, Kenneth Michael</creatorcontrib><creatorcontrib>Øland, Anders</creatorcontrib><creatorcontrib>Lee, Brian Sung</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bogan, III, Carl Davis</au><au>Laser, Jacob Myles</au><au>Berlin, Cody Gustave</au><au>Lande, Kenneth Michael</au><au>Øland, Anders</au><au>Lee, Brian Sung</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Methods and systems for image and voice processing</title><date>2023-06-06</date><risdate>2023</risdate><abstract>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US11670024B2
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
PHYSICS
title Methods and systems for image and voice processing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T19%3A14%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Bogan,%20III,%20Carl%20Davis&rft.date=2023-06-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS11670024B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true