Enhancing Biomedical Knowledge Discovery for Diseases: An Open-Source Framework Applied on Rett Syndrome and Alzheimer's Disease
The rapidly increasing number of biomedical publications presents a significant challenge for efficient knowledge discovery. To address this, we introduce an open-source, end-to-end framework designed to automatically extract and construct knowledge about specific diseases directly from unstructured...
Gespeichert in:
Veröffentlicht in: | IEEE access 2024, Vol.12, p.180652-180673 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The rapidly increasing number of biomedical publications presents a significant challenge for efficient knowledge discovery. To address this, we introduce an open-source, end-to-end framework designed to automatically extract and construct knowledge about specific diseases directly from unstructured text. To facilitate research in disease-related knowledge discovery, we create two annotated datasets focused on Rett syndrome (RS) and Alzheimer's disease (AD), enabling the identification of semantic relations between various biomedical entities. We perform extensive benchmarking to evaluate different approaches for representing relations and entities, providing insights into optimal modeling strategies for semantic relation detection and highlighting language models' competence in knowledge discovery. To gain a deeper understanding of the internal mechanisms of transformer models, we also conduct probing experiments, analyzing different layer representations and attention scores, to explore transformers' ability to capture semantic relations within the text. Both the code and the datasets will be publicly available at https://github.com/christos42/Enhancing-Biomedical-Knowledge-Discovery-for-Diseases , encouraging further research in biomedical knowledge discovery. |
---|---|
ISSN: | 2169-3536 2169-3536 |
DOI: | 10.1109/ACCESS.2024.3509714 |