Comparison of Chinese Treebanks for Corpus-oriented HPSG Grammar Development
Comparing with the traditional way of manually developing grammar based on lin- guistic theory, corpus-oriented grammar development is more promising. To develop HPSG grammar through the corpus-oriented way, a treebank is an indispensable part. This paper first compares existing Chinese treebanks an...
Gespeichert in:
Veröffentlicht in: | Information and Media Technologies 2010, Vol.5(2), pp.910-929 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Comparing with the traditional way of manually developing grammar based on lin- guistic theory, corpus-oriented grammar development is more promising. To develop HPSG grammar through the corpus-oriented way, a treebank is an indispensable part. This paper first compares existing Chinese treebanks and chooses one of them as the basic resource for HPSG grammar development. Then it proposes a new design of part-of-speech tags based on the assumption that it is not only simple enough to re-duce ambiguity of morphological analysis as much as possible, but also rich enough for HPSG grammar development. Finally, it introduces some on-going work about utilizing a Chinese scientific paper treebank in HPSG grammar development. |
---|---|
ISSN: | 1881-0896 |
DOI: | 10.11185/imt.5.910 |