Information capturing method and device
The embodiment of the invention provides an information capturing method and device. The information capturing method comprises the following steps: counting an information website list, and saving a list page corresponding to an information website in a list page database in a first database, where...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The embodiment of the invention provides an information capturing method and device. The information capturing method comprises the following steps: counting an information website list, and saving a list page corresponding to an information website in a list page database in a first database, wherein contrasting relation between the information website and a corresponding URL address is saved in the list page; reading contents of the list page from the first database, capturing detail page link addresses conform to a default capturing strategy, and saving the captured detail page link addresses in a detail page database in the first database; allocating the detail page link addresses to different capturing machines for capturing, and saving captured webpage detail data in a second database; and capturing corresponding webpage detail data from the second database according to a database status code in the first database, extracting a target field, and saving the target field in a target format. According to t |
---|