NodeCoder: a graph-based machine learning platform to predict active sites of modeled protein structures
While accurate protein structure predictions are now available for nearly every observed protein sequence, predicted structures lack much of the functional context offered by experimental structure determination. We address this gap with NodeCoder, a task-independent platform that maps residue-based...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | While accurate protein structure predictions are now available for nearly
every observed protein sequence, predicted structures lack much of the
functional context offered by experimental structure determination. We address
this gap with NodeCoder, a task-independent platform that maps residue-based
datasets onto 3D protein structures, embeds the resulting structural feature
into a contact network, and models residue classification tasks with a Graph
Convolutional Network (GCN). We demonstrate the versatility of this strategy by
modeling six separate tasks, with some labels derived from other experimental
structure studies (ligand, peptide, ion, and nucleic acid binding sites) and
other labels derived from annotation databases (post-translational modification
and transmembrane regions). Moreover, A NodeCoder model trained to identify
ligand binding site residues was able to outperform P2Rank, a widely-used
software developed specifically for ligand binding site detection. NodeCoder is
available as an open-source python package at
https://pypi.org/project/NodeCoder/. |
---|---|
DOI: | 10.48550/arxiv.2302.03590 |