A Systematic Review of Intermediate Fusion in Multimodal Deep Learning for Biomedical Applications
Deep learning has revolutionized biomedical research by providing sophisticated methods to handle complex, high-dimensional data. Multimodal deep learning (MDL) further enhances this capability by integrating diverse data types such as imaging, textual data, and genetic information, leading to more...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Deep learning has revolutionized biomedical research by providing
sophisticated methods to handle complex, high-dimensional data. Multimodal deep
learning (MDL) further enhances this capability by integrating diverse data
types such as imaging, textual data, and genetic information, leading to more
robust and accurate predictive models. In MDL, differently from early and late
fusion methods, intermediate fusion stands out for its ability to effectively
combine modality-specific features during the learning process. This systematic
review aims to comprehensively analyze and formalize current intermediate
fusion methods in biomedical applications. We investigate the techniques
employed, the challenges faced, and potential future directions for advancing
intermediate fusion methods. Additionally, we introduce a structured notation
to enhance the understanding and application of these methods beyond the
biomedical domain. Our findings are intended to support researchers, healthcare
professionals, and the broader deep learning community in developing more
sophisticated and insightful multimodal models. Through this review, we aim to
provide a foundational framework for future research and practical applications
in the dynamic field of MDL. |
---|---|
DOI: | 10.48550/arxiv.2408.02686 |