Phonetic Transcription using Speech Recognition Technique Considering Variations in Pronunciation

We propose a new approach for performing phonetic transcription of speech and text that combines automatic speech recognition (ASR) and grapheme-to-phoneme (G2P) techniques. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multip...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Min-Siong Liang, Ren-Yuan Lyu, Yuang-Chin Chiang
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Automatic Phonetic Transcription Automatic speech recognition Chinese Computer science Dialect Error analysis Flowcharts Humans Pronunciation Variation Spatial databases Speech processing Speech recognition Statistics Taiwanese Vocabulary
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We propose a new approach for performing phonetic transcription of speech and text that combines automatic speech recognition (ASR) and grapheme-to-phoneme (G2P) techniques. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multiple text pronunciations corresponding to human speech utterance, we are able to reduce the effort for phonetic transcription. By using a multiple pronunciation lexicon, a transcription error rate of 12.74% was achieved. Further improvement can be achieved by adapting the pronunciation lexicon with pronunciation variation (PV) rules and an error rate reduction of 17.11 % could be achieved.
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2007.367175