AGENT SYSTEM AND METHOD FOR CRAWLING SCHEDULE ON THE WEB

An agent system for crawling schedule information in the web and a method thereof are provided to collect and offer the schedule information of each transportation mode in real-time to design an effective complex transportation network by actively searching and recognizing pages providing the schedu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHOI, GYUNG RIM, PARK, NAM KYU, KIM, HYUN SOO, KANG, MOO HONG
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:An agent system for crawling schedule information in the web and a method thereof are provided to collect and offer the schedule information of each transportation mode in real-time to design an effective complex transportation network by actively searching and recognizing pages providing the schedule information in the web. A schedule information providing page searcher(200) extracts a URL(Uniform Resource Locator) from an HTML(HyperText Markup Language) document based on a rule, searches an HTML page, and recognizes a schedule information providing page. A rule-base(210) provides the rules needed for recognizing the schedule information providing page, detecting data change of the recognized schedule information providing page, and collecting the schedule information. A schedule page information database(220) stores the schedule providing page information recognized by the schedule information providing page searcher. A schedule information collector(230) collects the schedule information by detecting the data change in the stored recognized schedule information providing page. A schedule database(240) stores the collected schedule information.