Kuala Lumpur Travel blogs Dataset
This dataset contains three folders: 1) Training: The first sub-folder "raw training files" contains travel text extracted from 36 travel blog posts related to Kuala Lumpur. The second sub-folder "labeled files" consists of .xml version of raw text files containing 500 annotated...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Dataset |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This dataset contains three folders: 1) Training: The first sub-folder "raw training files" contains travel text extracted from 36 travel blog posts related to Kuala Lumpur. The second sub-folder "labeled files" consists of .xml version of raw text files containing 500 annotated spatial triplets as "trajector, spatial indicator, landmark" for spatial relation extraction. 2) Testing: The first sub-folder "raw testing files" contains travel text extracted from 10 travel blog posts related to Kuala Lumpur. The second sub-folder "labeled files" is the gold standard for evaluation consists of .xml version of raw text files containing 200 annotated spatial triplets as "trajector, spatial relation, landmark". 3) Related files: This folder contains annotation scheme definition (.xml) for training and testing files. |
---|---|
DOI: | 10.17632/9wb5rv45j5.1 |