Proclass protein family database: new version with motif alignments

Pac Symp Biocomput. 1998:719-30.

Abstract

ProClass is a protein family database which organizes non-redundant sequence entries into families defined collectively by the ProSite patterns and PIR superfamilies. The database consists of about 100,000 entries, more than half of which are classified in about 3,000 families. The new version includes links to various protein family/domain and structural class databases and contains gapped motif alignments for all ProSite patterns. The motif sequences are retrieved from both SwissProt and PIR-international databases, including numerous new members detected by our GeneFIND family identification system. The motif collection represents a 50% increase from those catalogued in ProSite. The ProClass database can be used to maximize family information retrieval, help organize protein sequence databases, and support full-scale genomic annotation. The database and its query program are freely available for on-line record retrieval and direct file transfer from our WWW server at http:/(/)diana.uthct.edu/proclass.html+ ++.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / classification
  • DNA-Binding Proteins / genetics
  • Databases, Factual*
  • Humans
  • Molecular Sequence Data
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / classification*
  • Proteins / genetics
  • Sequence Alignment*
  • Sequence Homology, Amino Acid
  • Software
  • Zinc Fingers

Substances

  • DNA-Binding Proteins
  • Proteins