AGENT SYSTEM AND METHOD FOR CRAWLING SCHEDULE ON THE WEB

An agent system for crawling schedule information in the web and a method thereof are provided to collect and offer the schedule information of each transportation mode in real-time to design an effective complex transportation network by actively searching and recognizing pages providing the schedu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	CHOI, GYUNG RIM, PARK, NAM KYU, KIM, HYUN SOO, KANG, MOO HONG
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	An agent system for crawling schedule information in the web and a method thereof are provided to collect and offer the schedule information of each transportation mode in real-time to design an effective complex transportation network by actively searching and recognizing pages providing the schedule information in the web. A schedule information providing page searcher(200) extracts a URL(Uniform Resource Locator) from an HTML(HyperText Markup Language) document based on a rule, searches an HTML page, and recognizes a schedule information providing page. A rule-base(210) provides the rules needed for recognizing the schedule information providing page, detecting data change of the recognized schedule information providing page, and collecting the schedule information. A schedule page information database(220) stores the schedule providing page information recognized by the schedule information providing page searcher. A schedule information collector(230) collects the schedule information by detecting the data change in the stored recognized schedule information providing page. A schedule database(240) stores the collected schedule information.