A Collectional Data Model for Scientific Workflow Composition

Modern scientific computations are usually data intensive, involving large-scale, heterogeneous and structured scientific datasets. Modeling, organizing, and processing scientific data have become key challenges for scientific workflow management systems (SWFMSs). In contrast to business data, which...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Xubo Fei, Shiyong Lu
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Modern scientific computations are usually data intensive, involving large-scale, heterogeneous and structured scientific datasets. Modeling, organizing, and processing scientific data have become key challenges for scientific workflow management systems (SWFMSs). In contrast to business data, which is usually relational and stored in databases, scientific data is often hierarchically organized and collection oriented. Although several data models have been proposed for SWFMSs, none of them provides a formal data model with a set of well-defined operators. In this paper, we take a first step towards formalizing a collection-oriented data model, called collectional data model, to model hierarchical collection oriented scientific data, and a set of well-defined operators to manipulate and query such data. We then apply the collectional data model to VIEW, a dataflow-based scientific workflow composition framework, whose workflow constructs are extended to support collections. We implement our techniques and validate them by a case study in a biological simulation project.
DOI:10.1109/ICWS.2010.93