Text Segmentation Based on PLSA-TextTiling Model

Text segmentation is very important for many fields including information retrieval, summarization, language modeling, anaphora resolution and so on. Text segmentation based on PLSA-TextTiling associates different latent topic swith observable pairs of word and sentence. In the experiments, the whol...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied Mechanics and Materials 2014-05, Vol.556-562, p.4018-4022
1. Verfasser: Zheng, Yu Chao
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Text segmentation is very important for many fields including information retrieval, summarization, language modeling, anaphora resolution and so on. Text segmentation based on PLSA-TextTiling associates different latent topic swith observable pairs of word and sentence. In the experiments, the whole sentences are taken as elementary blocks. PLSA model is used to calculated similarity metric basing on the idea of TestTiling and several approaches to discovering boundaries are tried. The results show the Pμ value is 0.87, which is better than that of other algorithms of text segmentation.
ISSN:1660-9336
1662-7482
1662-7482
DOI:10.4028/www.scientific.net/AMM.556-562.4018