Identifying "Many-to-Many" Relationships between Gene-Expression Data and Drug-Response Data via Sparse Binary Matching
Identifying gene-drug patterns is a critical step in pharmacology for unveiling disease mechanisms and drug discovery. The availability of high-throughput technologies accumulates massive large-scale pharmacological and genomic data, and thus provides a new substantial opportunity to deeply understa...
Gespeichert in:
Veröffentlicht in: | IEEE/ACM transactions on computational biology and bioinformatics 2020-01, Vol.17 (1), p.165-176 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Identifying gene-drug patterns is a critical step in pharmacology for unveiling disease mechanisms and drug discovery. The availability of high-throughput technologies accumulates massive large-scale pharmacological and genomic data, and thus provides a new substantial opportunity to deeply understand how the oncogenic genes and the therapeutic drugs relate to each other. However, most previous studies merely used the pharmacological and genomic datasets without any prior knowledge to infer the gene-drug patterns. Here, we proposed a novel network-guided sparse binary matching model (NSBM) to decode these relationships hidden in the datasets. Not only the large-scale gene-expression data and drug-response data are jointly analyzed in our method, but also the additional prior information of genes and drugs are integrated into the form of network-based regularization. The essential structure of the NSBM model is a convex quadratic minimization problem with network-based penalties. It was demonstrated to be superior when compared with two benchmark methods through extensive experiments on both synthetic and empirical data. Posterior validation, including gene-ontology and enrichment analysis, confirmed the effectiveness of NSBM in revealing gene-drug patterns on a large-scale heterogeneous data source. |
---|---|
ISSN: | 1545-5963 1557-9964 |
DOI: | 10.1109/TCBB.2018.2849708 |