Methods and systems for image and voice processing
Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Bogan, III, Carl Davis Laser, Jacob Myles Berlin, Cody Gustave Lande, Kenneth Michael Øland, Anders Lee, Brian Sung |
description | Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US11670024B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US11670024B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US11670024B23</originalsourceid><addsrcrecordid>eNrjZDDyTS3JyE8pVkjMS1EoriwuSc0tVkjLL1LIzE1MTwWLluVnJqcqFBTlJ6cWF2fmpfMwsKYl5hSn8kJpbgZFN9cQZw_d1IL8-NTigsTk1LzUkvjQYENDM3MDAyMTJyNjYtQAALo2LAY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Methods and systems for image and voice processing</title><source>esp@cenet</source><creator>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</creator><creatorcontrib>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</creatorcontrib><description>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; IMAGE DATA PROCESSING OR GENERATION, IN GENERAL ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230606&DB=EPODOC&CC=US&NR=11670024B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25563,76418</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230606&DB=EPODOC&CC=US&NR=11670024B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Bogan, III, Carl Davis</creatorcontrib><creatorcontrib>Laser, Jacob Myles</creatorcontrib><creatorcontrib>Berlin, Cody Gustave</creatorcontrib><creatorcontrib>Lande, Kenneth Michael</creatorcontrib><creatorcontrib>Øland, Anders</creatorcontrib><creatorcontrib>Lee, Brian Sung</creatorcontrib><title>Methods and systems for image and voice processing</title><description>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDDyTS3JyE8pVkjMS1EoriwuSc0tVkjLL1LIzE1MTwWLluVnJqcqFBTlJ6cWF2fmpfMwsKYl5hSn8kJpbgZFN9cQZw_d1IL8-NTigsTk1LzUkvjQYENDM3MDAyMTJyNjYtQAALo2LAY</recordid><startdate>20230606</startdate><enddate>20230606</enddate><creator>Bogan, III, Carl Davis</creator><creator>Laser, Jacob Myles</creator><creator>Berlin, Cody Gustave</creator><creator>Lande, Kenneth Michael</creator><creator>Øland, Anders</creator><creator>Lee, Brian Sung</creator><scope>EVB</scope></search><sort><creationdate>20230606</creationdate><title>Methods and systems for image and voice processing</title><author>Bogan, III, Carl Davis ; Laser, Jacob Myles ; Berlin, Cody Gustave ; Lande, Kenneth Michael ; Øland, Anders ; Lee, Brian Sung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US11670024B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Bogan, III, Carl Davis</creatorcontrib><creatorcontrib>Laser, Jacob Myles</creatorcontrib><creatorcontrib>Berlin, Cody Gustave</creatorcontrib><creatorcontrib>Lande, Kenneth Michael</creatorcontrib><creatorcontrib>Øland, Anders</creatorcontrib><creatorcontrib>Lee, Brian Sung</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bogan, III, Carl Davis</au><au>Laser, Jacob Myles</au><au>Berlin, Cody Gustave</au><au>Lande, Kenneth Michael</au><au>Øland, Anders</au><au>Lee, Brian Sung</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Methods and systems for image and voice processing</title><date>2023-06-06</date><risdate>2023</risdate><abstract>Systems and methods are disclosed configured to train an autoencoder using images that include faces, wherein the autoencoder comprises an input layer, an encoder configured to output a latent image from a corresponding input image, and a decoder configured to attempt to reconstruct the input image from the latent image. An image sequence of a face exhibiting a plurality of facial expressions and transitions between facial expressions is generated and accessed. Images of the plurality of facial expressions and transitions between facial expressions are captured from a plurality of different angles and using different lighting. An autoencoder is trained using source images that include the face with different facial expressions captured at different angles with different lighting, and using destination images that include a destination face. The trained autoencoder is used to generate an output where the likeness of the face in the destination images is swapped with the likeness of the source face, while preserving expressions of the destination face.</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng |
recordid | cdi_epo_espacenet_US11670024B2 |
source | esp@cenet |
subjects | CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS |
title | Methods and systems for image and voice processing |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T19%3A14%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Bogan,%20III,%20Carl%20Davis&rft.date=2023-06-06&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS11670024B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |