Protein crystallization analysis on the World Community Grid

J Struct Funct Genomics. 2010 Mar;11(1):61-9. doi: 10.1007/s10969-009-9076-9. Epub 2010 Jan 14.

Abstract

We have developed an image-analysis and classification system for automatically scoring images from high-throughput protein crystallization trials. Image analysis for this system is performed by the Help Conquer Cancer (HCC) project on the World Community Grid. HCC calculates 12,375 distinct image features on microbatch-under-oil images from the Hauptman-Woodward Medical Research Institute's High-Throughput Screening Laboratory. Using HCC-computed image features and a massive training set of 165,351 hand-scored images, we have trained multiple Random Forest classifiers that accurately recognize multiple crystallization outcomes, including crystals, clear drops, precipitate, and others. The system successfully recognizes 80% of crystal-bearing images, 89% of precipitate images, and 98% of clear drops.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Crystallization / methods
  • Diagnostic Imaging
  • Proteins / chemistry*
  • Proteins / classification

Substances

  • Proteins