Method for segmenting videos and audios into clips using speaker recognition

A method for segmenting video and audio into clips using speaker recognition is provided to segment audio according to speaker audio, and to make audio clips correspond to the audio and video signals to generate audio and video clips. The method instantly trains an independent speaker model by incre...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wang Chun-Lin, Liu Chi-Shi, Lin Chih-Jung
Format:	Patent
Sprache:	eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Wang Chun-Lin Liu Chi-Shi Lin Chih-Jung
description	A method for segmenting video and audio into clips using speaker recognition is provided to segment audio according to speaker audio, and to make audio clips correspond to the audio and video signals to generate audio and video clips. The method instantly trains an independent speaker model by increasing an unknown speaker source audio signal, and the speaker recognition result is applied to determine the audio and video clips. Independent speaker clips of source audio are determined according to the speaker model and the speaker model is renewed according the independent speaker clips of source audio. This method segments audio by the speaker model without waiting for complete speaker feature audio signals to be collected. The method is also able to segment the audio and video into clips based on the recognition result of speaker audio, and can be used to segment TV audio and video into clips.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US9741345B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US9741345B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US9741345B23</originalsourceid><addsrcrecordid>eNqNyrEOAUEQh_FrFIJ3mBdQcCeiJURBhfqy2f3fmjgzm509z4_EA6i-r_iNq9MZ5a6BOs1kiE9IYYn04gA1chLIDYE_y1KUfM_JaLAvsQT3QKYMr1G4sMq0GnWuN8x-nVR02F93xzmStrDkPASlvV0262ZRN6vtsv6DvAFRYDXp</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Method for segmenting videos and audios into clips using speaker recognition</title><source>esp@cenet</source><creator>Wang Chun-Lin ; Liu Chi-Shi ; Lin Chih-Jung</creator><creatorcontrib>Wang Chun-Lin ; Liu Chi-Shi ; Lin Chih-Jung</creatorcontrib><description>A method for segmenting video and audio into clips using speaker recognition is provided to segment audio according to speaker audio, and to make audio clips correspond to the audio and video signals to generate audio and video clips. The method instantly trains an independent speaker model by increasing an unknown speaker source audio signal, and the speaker recognition result is applied to determine the audio and video clips. Independent speaker clips of source audio are determined according to the speaker model and the speaker model is renewed according the independent speaker clips of source audio. This method segments audio by the speaker model without waiting for complete speaker feature audio signals to be collected. The method is also able to segment the audio and video into clips based on the recognition result of speaker audio, and can be used to segment TV audio and video into clips.</description><language>eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2017</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20170822&DB=EPODOC&CC=US&NR=9741345B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25544,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20170822&DB=EPODOC&CC=US&NR=9741345B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Wang Chun-Lin</creatorcontrib><creatorcontrib>Liu Chi-Shi</creatorcontrib><creatorcontrib>Lin Chih-Jung</creatorcontrib><title>Method for segmenting videos and audios into clips using speaker recognition</title><description>A method for segmenting video and audio into clips using speaker recognition is provided to segment audio according to speaker audio, and to make audio clips correspond to the audio and video signals to generate audio and video clips. The method instantly trains an independent speaker model by increasing an unknown speaker source audio signal, and the speaker recognition result is applied to determine the audio and video clips. Independent speaker clips of source audio are determined according to the speaker model and the speaker model is renewed according the independent speaker clips of source audio. This method segments audio by the speaker model without waiting for complete speaker feature audio signals to be collected. The method is also able to segment the audio and video into clips based on the recognition result of speaker audio, and can be used to segment TV audio and video into clips.</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2017</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyrEOAUEQh_FrFIJ3mBdQcCeiJURBhfqy2f3fmjgzm509z4_EA6i-r_iNq9MZ5a6BOs1kiE9IYYn04gA1chLIDYE_y1KUfM_JaLAvsQT3QKYMr1G4sMq0GnWuN8x-nVR02F93xzmStrDkPASlvV0262ZRN6vtsv6DvAFRYDXp</recordid><startdate>20170822</startdate><enddate>20170822</enddate><creator>Wang Chun-Lin</creator><creator>Liu Chi-Shi</creator><creator>Lin Chih-Jung</creator><scope>EVB</scope></search><sort><creationdate>20170822</creationdate><title>Method for segmenting videos and audios into clips using speaker recognition</title><author>Wang Chun-Lin ; Liu Chi-Shi ; Lin Chih-Jung</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US9741345B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2017</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang Chun-Lin</creatorcontrib><creatorcontrib>Liu Chi-Shi</creatorcontrib><creatorcontrib>Lin Chih-Jung</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wang Chun-Lin</au><au>Liu Chi-Shi</au><au>Lin Chih-Jung</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Method for segmenting videos and audios into clips using speaker recognition</title><date>2017-08-22</date><risdate>2017</risdate><abstract>A method for segmenting video and audio into clips using speaker recognition is provided to segment audio according to speaker audio, and to make audio clips correspond to the audio and video signals to generate audio and video clips. The method instantly trains an independent speaker model by increasing an unknown speaker source audio signal, and the speaker recognition result is applied to determine the audio and video clips. Independent speaker clips of source audio are determined according to the speaker model and the speaker model is renewed according the independent speaker clips of source audio. This method segments audio by the speaker model without waiting for complete speaker feature audio signals to be collected. The method is also able to segment the audio and video into clips based on the recognition result of speaker audio, and can be used to segment TV audio and video into clips.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US9741345B2
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	Method for segmenting videos and audios into clips using speaker recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T17%3A16%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Wang%20Chun-Lin&rft.date=2017-08-22&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS9741345B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true