Federated Learning over Harmonized Data Silos
Federated Learning is a distributed machine learning approach that enables geographically distributed data silos to collaboratively learn a joint machine learning model without sharing data. Most of the existing work operates on unstructured data, such as images or text, or on structured data assume...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Federated Learning is a distributed machine learning approach that enables
geographically distributed data silos to collaboratively learn a joint machine
learning model without sharing data. Most of the existing work operates on
unstructured data, such as images or text, or on structured data assumed to be
consistent across the different sites. However, sites often have different
schemata, data formats, data values, and access patterns. The field of data
integration has developed many methods to address these challenges, including
techniques for data exchange and query rewriting using declarative schema
mappings, and for entity linkage. Therefore, we propose an architectural vision
for an end-to-end Federated Learning and Integration system, incorporating the
critical steps of data harmonization and data imputation, to spur further
research on the intersection of data management information systems and machine
learning. |
---|---|
DOI: | 10.48550/arxiv.2305.08985 |