Establishing a training set through the visual analysis of crystallization trials. Part I: ~150000 images

Structural crystallography aims to provide a three-dimensional representation of macromolecules. Many parts of the multistep process to produce the three-dimensional structural model have been automated, especially through various structural genomics projects. A key step is the production of crystal...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Acta crystallographica. Section D, Biological crystallography. Biological crystallography., 2008-11, Vol.64 (11), p.1123-1130
Hauptverfasser: Snell, Edward H, Luft, Joseph R, Potter, Stephen A, Lauricella, Angela M, Gulde, Stacey M, Malkowski, Michael G, Koszelak-Rosenblum, Mary, Said, Meriem I, Smith, Jennifer L, Veatch, Christina K, Collins, Robert J, Franks, Geoff, Thayer, Max, Cumbaa, Christian, Jurisica, Igor, DeTitta, George T
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Structural crystallography aims to provide a three-dimensional representation of macromolecules. Many parts of the multistep process to produce the three-dimensional structural model have been automated, especially through various structural genomics projects. A key step is the production of crystals for diffraction. The target macromolecule is combined with a large and chemically diverse set of cocktails with some leading ideally, but infrequently, to crystallization. A variety of outcomes will be observed during these screening experiments that typically require human interpretation for classification. Human interpretation is neither scalable nor objective, highlighting the need to develop an automatic computer-based image classification. As a first step towards automated image classification, 147456 images representing crystallization experiments from 96 different macromolecular samples were manually classified. Each image was classified by three experts into seven predefined categories or their combinations. The resulting data where all three observers are in agreement provides one component of a truth set for the development and rigorous testing of automated image-classification systems and provides information about the chemical cocktails used for crystallization. In this paper, the details of this study are presented. Received 1 July 2008, accepted 2 September 2008.
ISSN:0907-4449
DOI:10.1107/S0907444908028047