Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.
Nh3D: A Reference Dataset of Structures of Non-homologous Proteins (RRID:SCR_008212)Copy Citation Copied
URL: http://www.schematikon.org/Nh3D.html
Proper Citation: Nh3D: A Reference Dataset of Structures of Non-homologous Proteins (RRID:SCR_008212)
Description: THIS RESOURCE IS NO LONGER IN SERVICE, documented on July 17, 2013. It is freely available as a reference dataset for the statistical analysis of sequence and structure features of proteins in the PDB. It is a dataset of structurally dissimilar proteins. This dataset has been compiled by selecting well resolved representatives from the Topology level of the CATH database which hierarchically classifies all protein structures. These have been been pruned to remove: i) domains that may contain homologous elements (by pairwise sequence comparison and structural superposition of aligned residues) ii) internal duplications (by repeat detection) iii) regions with high B-Factor The statistical analysis of protein structures requires datasets in which structural features can be considered independently distributed, i.e. not related through common ancestry, and that fulfill minimal requirements regarding the experimental quality of the structures it contains. However, non-redundant datasets based on sequence similarity invariably contain distantly related homologues. Here a reference dataset of non-homologous protein domains is provided, assuming that structural dissimilarity at the topology level is incompatible with recognizable common ancestry. It contains the best refined representatives of each Topology level, validates structural dissimilarity and removes internally duplicated fragments. The compilation of Nh3D is fully scripted. The current Nh3D list contains 570 domains with a total of 90780 residues. It covers more than 70% of folds at the Topology level of the CATH database and represents more than 90% of the structures in the PDB that have been classified by CATH. Even though all protein pairs are structurally dissimilar, some pairwise sequence identities after global alignment are greater than 30%. Nh3D is freely available as a reference dataset for the statistical analysis of sequence and structure features of proteins in the PDB.
Abbreviations: Nh3D
Resource Type: data or information resource, database
Keywords: duplication, element, feature, fragment, align, alignment, analysis, b-factor, dissimilar, homologous, protein, protein structure databases, residue, sequence, statistical, structurally, structure, topology
Expand Allhas parent organization |
We found {{ ctrl2.mentions.total_count }} mentions in open access literature.
We have not found any literature mentions for this resource.
We are searching literature mentions for this resource.
Most recent articles:
{{ mention._source.dc.creators[0].familyName }} {{ mention._source.dc.creators[0].initials }}, et al. ({{ mention._source.dc.publicationYear }}) {{ mention._source.dc.title }} {{ mention._source.dc.publishers[0].name }}, {{ mention._source.dc.publishers[0].volume }}({{ mention._source.dc.publishers[0].issue }}), {{ mention._source.dc.publishers[0].pagination }}. (PMID:{{ mention._id.replace('PMID:', '') }})
A list of researchers who have used the resource and an author search tool
A list of researchers who have used the resource and an author search tool. This is available for resources that have literature mentions.
No rating or validation information has been found for Nh3D: A Reference Dataset of Structures of Non-homologous Proteins.
No alerts have been found for Nh3D: A Reference Dataset of Structures of Non-homologous Proteins.
Source: SciCrunch Registry