Machine learning for transliteration

Methods, systems, and apparatus, including computer program products, for automatically identifying transliteration pairs are disclosed. In one implementation, a method is provided. The method includes receiving a plurality of resources, the plurality of resources including a plurality of anchor tex...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ICHIKAWA HIROSHI, BILAC SLAVEN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods, systems, and apparatus, including computer program products, for automatically identifying transliteration pairs are disclosed. In one implementation, a method is provided. The method includes receiving a plurality of resources, the plurality of resources including a plurality of anchor text; determining one or more potential transliterations from the plurality of anchor text; and identifying one or more potential transliteration pairs from the one or more potential transliterations, where each potential transliteration pair includes a first anchor text in a first writing system and a second anchor text in a second writing system, the second anchor text and the first anchor text identifying a same resource or location.