Information capturing method and device

The embodiment of the invention provides an information capturing method and device. The information capturing method comprises the following steps: counting an information website list, and saving a list page corresponding to an information website in a list page database in a first database, where...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LI HONGMEI, LUO YONGJIAN, WU HAO, XIE JINGPENG, DU XIAOMENG, LIU YU, DANG TUO, ZHANG YANG, TAN SHUGUO, ZHANG JIANZHI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The embodiment of the invention provides an information capturing method and device. The information capturing method comprises the following steps: counting an information website list, and saving a list page corresponding to an information website in a list page database in a first database, wherein contrasting relation between the information website and a corresponding URL address is saved in the list page; reading contents of the list page from the first database, capturing detail page link addresses conform to a default capturing strategy, and saving the captured detail page link addresses in a detail page database in the first database; allocating the detail page link addresses to different capturing machines for capturing, and saving captured webpage detail data in a second database; and capturing corresponding webpage detail data from the second database according to a database status code in the first database, extracting a target field, and saving the target field in a target format. According to t