SYSTEMS AND METHODS FOR A TWO PASS DIARIZATION, AUTOMATIC SPEECH RECOGNITION, AND TRANSCRIPT GENERATION

In one embodiment, a method for transcript generation includes receiving an audio file and dividing it into a plurality of chunks. The method further includes sending each instance of the plurality of chunks to a speech service module. The method further includes converting speech to text for each i...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ROBICHAUD, Jean-Philippe, SKURIKHIN, Alexei, STANISLAVOVICH, Petrov Evgeny, JETTÉ, Migüel
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	ACOUSTICS MEASURING MEASURING FORCE, STRESS, TORQUE, WORK, MECHANICAL POWER,MECHANICAL EFFICIENCY, OR FLUID PRESSURE MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION TESTING
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In one embodiment, a method for transcript generation includes receiving an audio file and dividing it into a plurality of chunks. The method further includes sending each instance of the plurality of chunks to a speech service module. The method further includes converting speech to text for each instance of the plurality of chunks and returning the text for each instance of the plurality of chunks. The method further includes merging the text for each instance of the plurality of chunks to yield an audio file transcript and sending the audio file and chunks to a diarization module. The method further includes performing first pass diarization on the chunks to yield a plurality of diarized chunks and performing second pass diarization on the plurality of diarized chunks and the audio file to yield a diarized audio file. The method further includes merging the files to yield a final transcript. Selon un mode de réalisation, la présente invention concerne un procédé de génération de transcription comprenant la réception d'un fichier audio et la division dudit fichier en une pluralité de segments. Le procédé consiste en outre à envoyer chaque instance de la pluralité de segments à un module de service vocal. Le procédé comprend en outre la conversion de la parole en texte pour chaque instance de la pluralité de fragments et le renvoi du texte pour chaque instance de la pluralité de segments. Le procédé comprend en outre la fusion du texte pour chaque instance de la pluralité de segments afin d'obtenir une transcription du fichier audio et l'envoi du fichier audio et des segments à un module de segmentation et regroupement. Le procédé comprend en outre la réalisation d'une opération de segmentation et regroupement de premier passage sur les segments pour obtenir une pluralité de segments segmentés et regroupés et la réalisation d'une seconde opération de segmentation et regroupement de second passage sur la pluralité de segments segmentés et regroupés et le fichier audio pour obtenir un fichier audio segmenté et regroupé. Le procédé comprend en outre la fusion des fichiers pour obtenir une transcription finale.