Exploiting agent and database technologies for biological data collection

Web data sources constitute an important resource for biological research. A simple tool that can retrieve information from different Web sites through a single interface and store the extracted data in a standardized format for efficient future use is critical to scientific discovery. We discuss an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Davulcu, H., Lacroix, Z., Parekh, K., Ramakrishnan, I.V., Julasana, N.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Web data sources constitute an important resource for biological research. A simple tool that can retrieve information from different Web sites through a single interface and store the extracted data in a standardized format for efficient future use is critical to scientific discovery. We discuss an approach that combines agent and database technologies for biological data integration. To illustrate this, we employ two software tools: WinAgent, for building agents, and dbXML, for XML data management. WinAgent learns from its users by recording a browsing session on Web sites and successive data extraction from regions of interest on retrieved Web pages. The results are stored in a XML document and can be managed, queried and updated using a native XML database system such as dbXML. This approach is currently being evaluated at the Brain Tumor Cancer Unit of the Translational Genomics Research Institute (TGen), Phoenix, Arizona.
ISSN:1529-4188
2378-3915
DOI:10.1109/DEXA.2004.1333503