AUTOMATICALLY IDENTIFYING SPEAKERS IN REAL-TIME THROUGH MEDIA PROCESSING WITH DIALOG UNDERSTANDING SUPPORTED BY AI TECHNIQUES

Automatically identifying speakers in real-time through media processing with dialog understanding. A plurality of audio streams may be received, an audio stream representing a speech of a participant speaking during an online meeting. A voice characteristic of a voice corresponding to the speech of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Salles Chevitarese, Daniel, Fontoura de Gusmao Cerqueira, Renato, Hultmann Ayala, Helon Vicente, Ferreira Moreno, Marcio, de Mello Brandao, Rafael R
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Automatically identifying speakers in real-time through media processing with dialog understanding. A plurality of audio streams may be received, an audio stream representing a speech of a participant speaking during an online meeting. A voice characteristic of a voice corresponding to the speech of the participant in the audio stream may be determined. The plurality of audio streams may be converted into text and a natural language processing may be performed to determine content context of the dialog. The natural language processing infers a name to associate with the voice in the audio stream based on the determined content context. A data structure linking the name with the voice may be created and stored in a knowledge base. A user interface associated with the online meeting application is triggered to present the name or identity of the speaker.