Conserved Domain Database

CDD
Content
DescriptionConserved Domain Database for the functional annotation of proteins.
Contact
Research centerNational Center for Biotechnology Information
AuthorsAron Marchler-Bauer
Primary citationMarchler-Bauer & al. (2013)[1]
Release date2003
Access
Websitehttps://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml

The Conserved Domain Database (CDD) is a database of well-annotated multiple sequence alignment models and derived database search models, for ancient domains and full-length proteins.[1] The database consists of position-specific score matrices and serves as resource for protein annotation such as identification of conserved domain or inference of functional site.[2]

Philosophy

Domains can be thought of as distinct functional and/or structural units of a protein. These two classifications coincide rather often, as a matter of fact, and what is found as an independently folding unit of a polypeptide chain also carries specific function. Domains are often identified as recurring (sequence or structure) units, which may exist in various contexts. In molecular evolution such domains may have been utilized as building blocks, and may have been recombined in different arrangements to modulate protein function. CDD defines conserved domains as recurring units in molecular evolution, the extents of which can be determined by sequence and structure analysis.

The goal of the NCBI conserved domain curation project is to provide database users with insights into how patterns of residue conservation and divergence in a family relate to functional properties, and to provide useful links to more detailed information that may help to understand those sequence/structure/function relationships. To do this, CDD Curators include the following types of information in order to supplement and enrich the traditional multiple sequence alignments that form the foundation of domain models: 3-dimensional structures and conserved core motifs, conserved features/sites, phylogenetic organization, links to electronic literature resources.

Content

CDD content includes NCBI manually curated domain models and domain models imported from a number of external source databases (Pfam, SMART, COG, PRK, TIGRFAMs). What is unique about NCBI-curated domains is that they use 3D-structure information to explicitly define domain boundaries, align blocks, amend alignment details, and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. To provide a non-redundant view of the data, CDD clusters similar domain models from various sources into superfamilies.

Searching the database

The collection is also part of NCBI's Entrez query and retrieval system, crosslinked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models can be queried with novel protein sequences via * "the CD-Search service". United States National Center for Biotechnology Information., or at* "the Batch CD-Search". United States National Center for Biotechnology Information., that allows the computation and download of annotation for large sets of protein queries.

References

  1. ^ a b Marchler-Bauer, A.; Zheng, C.; Chitsaz, F.; Derbyshire, M. K.; Geer, L. Y.; Geer, R. C.; Gonzales, N. R.; Gwadz, M.; Hurwitz, D. I.; Lanczycki, C. J.; Lu, F.; Lu, S.; Marchler, G. H.; Song, J. S.; Thanki, N.; Yamashita, R. A.; Zhang, D.; Bryant, S. H. (2012). "CDD: Conserved domains and protein three-dimensional structure". Nucleic Acids Research. 41 (Database issue): D348–D352. doi:10.1093/nar/gks1243. PMC 3531192. PMID 23197659.
  2. ^ Marchler-Bauer, Aron; Lu, Shennan; Anderson, John B.; Chitsaz, Farideh; Derbyshire, Myra K.; DeWeese-Scott, Carol; Fong, Jessica H.; Geer, Lewis Y.; Geer, Renata C.; Gonzales, Noreen R.; Gwadz, Marc; Hurwitz, David I.; Jackson, John D.; Ke, Zhaoxi; Lanczycki, Christopher J. (2011-01-01). "CDD: a Conserved Domain Database for the functional annotation of proteins". Nucleic Acids Research. 39 (suppl_1): D225–D229. doi:10.1093/nar/gkq1189. ISSN 0305-1048. PMC 3013737.

Content Disclaimer

Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.

  1. The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
  2. There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
  3. It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
  4. Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
  5. Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.