COORDINATING SCHEDULES OF CRAWLING DOCUMENTS BASED ON METADATA ADDED TO THE DOCUMENTS BY TEXT MINING

A computer-implemented method, a computer program product, and a computer system for coordinating schedules of crawling documents based on metadata added to documents by text mining. A computer system determines whether added metadata in internal documents or metadata in original documents necessita...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: WATANABE, KENTA, Kurokawa, Yoshitaka, MIYAMOTO, TAIHEI, Tashiro, Takahito
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A computer-implemented method, a computer program product, and a computer system for coordinating schedules of crawling documents based on metadata added to documents by text mining. A computer system determines whether added metadata in internal documents or metadata in original documents necessitates that the original documents in at least two of respective data sources be crawled by an application with a same crawling schedule. A computer system changes respective crawling schedules of at least two of the respective data sources to the same crawling schedule, in response to determining that the same crawling schedule is needed. A computer system crawls the original documents in at least two of the respective data sources, according to the same crawling schedule.