Combining a Segmentation-Like Approach and a Density-Based Approach in Content Extraction

Density-based approaches in content extraction, whose task is to extract contents from Web pages, are commonly used to obtain page contents that are critical to many Web mining applications. How- ever, traditional density-based approaches cannot effectively manage pages that contain short contents a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Tsinghua science and technology 2012-06, Vol.17 (3), p.256-264
Hauptverfasser: Lin, Shuang, Chen, Jie, Niu, Zhendong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!