Web page segmentation method and web page segmentation device
The invention relates to an internet technology and provides a web page segmentation method and a web page segmentation device aiming at the defects that the prior page segmentation technology damages a web page structure and has low segmentation efficiency. The web page segmentation method comprise...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention relates to an internet technology and provides a web page segmentation method and a web page segmentation device aiming at the defects that the prior page segmentation technology damages a web page structure and has low segmentation efficiency. The web page segmentation method comprises the steps of DOM tree construction, venation aggregation construction and combination, wherein the step of DOM tree construction comprises constructing a DOM tree corresponding to an original web page; and the step of venation aggregation construction comprises respectively constructing venation aggregations corresponding to each leaf node of the DOM tree, and the venation aggregation contains a root node, the leaf node and all middle nodes between the root node and the leaf node of the DOM tree. The invention also provides the web page segmentation device which carries out web page segmentation by constructing the DOM tree and according to the DOM tree. The web page segmentation method and the web page segmentat |
---|