Comparison of Chinese Treebanks for Corpus-oriented HPSG Grammar Development

Comparing with the traditional way of manually developing grammar based on lin- guistic theory, corpus-oriented grammar development is more promising. To develop HPSG grammar through the corpus-oriented way, a treebank is an indispensable part. This paper first compares existing Chinese treebanks an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information and Media Technologies 2010, Vol.5(2), pp.910-929
Hauptverfasser: Yu, Kun, Miyao, Yusuke, Matsuzaki, Takuya, Wang, Xiangli, Zhang, Yaozhong, Uchimoto, Kiyotaka, Tsujii, Junichi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Comparing with the traditional way of manually developing grammar based on lin- guistic theory, corpus-oriented grammar development is more promising. To develop HPSG grammar through the corpus-oriented way, a treebank is an indispensable part. This paper first compares existing Chinese treebanks and chooses one of them as the basic resource for HPSG grammar development. Then it proposes a new design of part-of-speech tags based on the assumption that it is not only simple enough to re-duce ambiguity of morphological analysis as much as possible, but also rich enough for HPSG grammar development. Finally, it introduces some on-going work about utilizing a Chinese scientific paper treebank in HPSG grammar development.
ISSN:1881-0896
DOI:10.11185/imt.5.910