|
GenProtEC
is dedicated to the functions encoded by the Escherichia coli
K-12 (strain MG1655) genome defined in the GenBank Accession No. NC_000913.2
deposit. Our annotation work includes multiple types of information:
1. Sequence similarity to orthologues as defined by Darwin (start and end of aligned region, identity, and
PAM distance).
2. Resolution of fused proteins into modular units with independent
functions.
3. Identification of sequence similar protein groups within E.
coli that are clustered by transitive relationships. The sequence similarity
is limited to PAM 200 and an alignment of at least 83 amino acids.
4. Updated literature references.
5. Classification of gene products by their gene type and by their
cellular role(s). The MultiFun classification system for cellular roles
is used to assign gene products to one or more roles. MultiFun has been converted to Gene Ontology terms.
6. Familes of proteins related by structure and biochemical reaction
mechanisms (work in progress).
7. SCOP superfamily identification and location (e.g. binding site
domains) for E. coli proteins.
Gene/Protein Query
Search by Gene Name, B-number, ECK Number, Swiss-Prot Accession Number
and ID, Enzyme Nomenclature (E.C. number), Protein Name, Gene Type,
or Physiological Role.
Overview of E. coli genome by Gene
Type distribution
Protein Modules
A list of fused E. coli proteins separated into modular units.
Protein Groups
Sequence related groups of E. coli proteins.
Structurally and biochemically related groups of E. coli
proteins.
MultiFun Classification System
MultiFun a classification system for cellular/physiological roles of gene
products.
MultiFun2GO conversion of Multifun to Gene Ontology terms.
SCOP superfamily assignments to E.
coli proteins
SCOP superfamily distribution.
SCOP assignments.
|