Rank Aggregation for Automatic Schema Matching

Schema matching is a basic operation of data integration, and several tools for automating it have been proposed and evaluated in the database community. Research in this area reveals that there is no single schema matcher that is guaranteed to succeed in finding a good mapping for all possible doma...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on knowledge and data engineering 2007-04, Vol.19 (4), p.538-553
Hauptverfasser:	Domshlak, C., Gal, A., Roitman, H.
Format:	Artikel
Sprache:	eng
Schlagworte:	Agglomeration Algorithm design and analysis Algorithms Applied sciences Communities Composing Computer science control theory systems Data processing. List processing. Character string processing Database integration Exact sciences and technology Floods HTML Humans Information systems. Data bases Large-scale systems Mapping Matching Memory organisation. Data processing Performance analysis rank aggregation Ranking schema matching Semantic Web Software Studies Web services XML
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Schema matching is a basic operation of data integration, and several tools for automating it have been proposed and evaluated in the database community. Research in this area reveals that there is no single schema matcher that is guaranteed to succeed in finding a good mapping for all possible domains and, thus, an ensemble of schema matchers should be considered. In this paper, we introduce schema metamatching, a general framework for composing an arbitrary ensemble of schema matchers and generating a list of best ranked schema mappings. Informally, schema metamatching stands for computing a "consensus" ranking of alternative mappings between two schemata, given the "individual" graded rankings provided by several schema matchers. We introduce several algorithms for this problem, varying from adaptations of some standard techniques for general quantitative rank aggregation to novel techniques specific to the problem of schema matching, and to combinations of both. We provide a formal analysis of the applicability and relative performance of these algorithms and evaluate them empirically on a set of real-world schemata
ISSN:	1041-4347 1558-2191
DOI:	10.1109/TKDE.2007.1010