Prediction of models for ordered solvent in macromolecular structures by a classifier based upon resolution‐independent projections of local feature data

Current software tools for the automated building of models for macromolecular X‐ray crystal structures are capable of assembling high‐quality models for ordered macromolecule and small‐molecule scattering components with minimal or no user supervision. Many of these tools also incorporate robust fu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Acta crystallographica. Section D, Biological crystallography. Biological crystallography., 2019-08, Vol.75 (8), p.696-717
Hauptverfasser: Jones, Laurel, Tynes, Michael, Smith, Paul
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Current software tools for the automated building of models for macromolecular X‐ray crystal structures are capable of assembling high‐quality models for ordered macromolecule and small‐molecule scattering components with minimal or no user supervision. Many of these tools also incorporate robust functionality for modelling the ordered water molecules that are found in nearly all macromolecular crystal structures. However, no current tools focus on differentiating these ubiquitous water molecules from other frequently occurring multi‐atom solvent species, such as sulfate, or the automated building of models for such species. PeakProbe has been developed specifically to address the need for such a tool. PeakProbe predicts likely solvent models for a given point (termed a `peak') in a structure based on analysis (`probing') of its local electron density and chemical environment. PeakProbe maps a total of 19 resolution‐dependent features associated with electron density and two associated with the local chemical environment to a two‐dimensional score space that is independent of resolution. Peaks are classified based on the relative frequencies with which four different classes of solvent (including water) are observed within a given region of this score space as determined by large‐scale sampling of solvent models in the Protein Data Bank. Designed to classify peaks generated from difference density maxima, PeakProbe also incorporates functionality for identifying peaks associated with model errors or clusters of peaks likely to correspond to multi‐atom solvent, and for the validation of existing solvent models using solvent‐omit electron‐density maps. When tasked with classifying peaks into one of four distinct solvent classes, PeakProbe achieves greater than 99% accuracy for both peaks derived directly from the atomic coordinates of existing solvent models and those based on difference density maxima. While the program is still under development, a fully functional version is publicly available. PeakProbe makes extensive use of cctbx libraries, and requires a PHENIX licence and an up‐to‐date phenix.python environment for execution. PeakProbe facilitates the automated modelling of ordered solvent in macromolecular crystal structures by analysing features of the electron density and chemical environment surrounding a given coordinate. The extracted data are transformed to a resolution‐independent score space and likely solvent models are predicted based on t
ISSN:2059-7983
0907-4449
2059-7983
1399-0047
DOI:10.1107/S2059798319008933