ME-Match: Tonal Grouping Based Approach in Cross-Script Name Matching

Even though matching between different scripts could be immensely useful for news organizations, author recognition with cross-script matches in digital libraries and homeland security, it is impossible to automatically match. Now, we propose a new approach, ME-match, for matching the proper names a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Phyu, K.Z.Z., Tun, K.M.L.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Even though matching between different scripts could be immensely useful for news organizations, author recognition with cross-script matches in digital libraries and homeland security, it is impossible to automatically match. Now, we propose a new approach, ME-match, for matching the proper names across different scripts. The foremost concept of our approach is to match them via phoneme strings. The main steps in ME-match are creation of bilingual pronouncing mapping, tokenization of query names, transformation of query names to IPA forms based on tonal grouping approach, searching possible various words in both scripts for each query IPA phoneme string, combination of various words to become full name strings and then searching names. The performance is measured by standard information-retrieval metrics: recall, precision, and f-measures.
DOI:10.1109/ICFCC.2009.24