AGENT SYSTEM AND METHOD FOR CRAWLING SCHEDULE ON THE WEB
An agent system for crawling schedule information in the web and a method thereof are provided to collect and offer the schedule information of each transportation mode in real-time to design an effective complex transportation network by actively searching and recognizing pages providing the schedu...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | An agent system for crawling schedule information in the web and a method thereof are provided to collect and offer the schedule information of each transportation mode in real-time to design an effective complex transportation network by actively searching and recognizing pages providing the schedule information in the web. A schedule information providing page searcher(200) extracts a URL(Uniform Resource Locator) from an HTML(HyperText Markup Language) document based on a rule, searches an HTML page, and recognizes a schedule information providing page. A rule-base(210) provides the rules needed for recognizing the schedule information providing page, detecting data change of the recognized schedule information providing page, and collecting the schedule information. A schedule page information database(220) stores the schedule providing page information recognized by the schedule information providing page searcher. A schedule information collector(230) collects the schedule information by detecting the data change in the stored recognized schedule information providing page. A schedule database(240) stores the collected schedule information. |
---|