Text node determination method and apparatus
The invention provides a text node determination method and apparatus. The method comprises the steps of forming at least one webpage template; obtaining at least two target webpages corresponding to target webpage templates; obtaining node information corresponding to each webpage node in at least...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides a text node determination method and apparatus. The method comprises the steps of forming at least one webpage template; obtaining at least two target webpages corresponding to target webpage templates; obtaining node information corresponding to each webpage node in at least two target webpages; according to the node information corresponding to each webpage node, calculating a text density ratio corresponding to each webpage node; according to the target webpage templates and the text density ratio corresponding to each webpage node, calculating an average text density ratio corresponding to each group of webpage nodes corresponding to one another, and determining one group of webpage nodes with the maximum average text density ratio; and according to the group of the webpage nodes with the maximum average text density ratio, determining a text node corresponding to each webpage corresponding to the target webpage template. Through the technical scheme provided by the invention, the t |
---|