Kuala Lumpur Travel blogs Dataset

This dataset contains three folders: 1) Training: The first sub-folder "raw training files" contains travel text extracted from 36 travel blog posts related to Kuala Lumpur. The second sub-folder "labeled files" consists of .xml version of raw text files containing 500 annotated...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Haris, Erum
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This dataset contains three folders: 1) Training: The first sub-folder "raw training files" contains travel text extracted from 36 travel blog posts related to Kuala Lumpur. The second sub-folder "labeled files" consists of .xml version of raw text files containing 500 annotated spatial triplets as "trajector, spatial indicator, landmark" for spatial relation extraction. 2) Testing: The first sub-folder "raw testing files" contains travel text extracted from 10 travel blog posts related to Kuala Lumpur. The second sub-folder "labeled files" is the gold standard for evaluation consists of .xml version of raw text files containing 200 annotated spatial triplets as "trajector, spatial relation, landmark". 3) Related files: This folder contains annotation scheme definition (.xml) for training and testing files.
DOI:10.17632/9wb5rv45j5.1