+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

Protein domain assignment from the recurrence of locally similar structures

Protein domain assignment from the recurrence of locally similar structures

Proteins 79(3): 853-866

Domains are basic units of protein structure and essential for exploring protein fold space and structure evolution. With the structural genomics initiative, the number of protein structures in the Protein Databank (PDB) is increasing dramatically and domain assignments need to be done automatically. Most existing structural domain assignment programs define domains using the compactness of the domains and/or the number and strength of intra-domain versus inter-domain contacts. Here we present a different approach based on the recurrence of locally similar structural pieces (LSSPs) found by one-against-all structure comparisons with a dataset of 6373 protein chains from the PDB. Residues of the query protein are clustered using LSSPs via three different procedures to define domains. This approach gives results that are comparable to several existing programs that use geometrical and other structural information explicitly. Remarkably, most of the proteins that contribute the LSSPs defining a domain do not themselves contain the domain of interest. This study shows that domains can be defined by a collection of relatively small locally similar structural pieces containing, on average, four secondary structure elements. In addition, it indicates that domains are indeed made of recurrent small structural pieces that are used to build protein structures of many different folds as suggested by recent studies.

(PDF emailed within 0-6 h: $19.90)

Accession: 055250107

Download citation: RISBibTeXText

PMID: 21287617

DOI: 10.1002/prot.22923

Related references

Domain assignment for protein structures using a consensus approach: characterization and analysis. Protein Science 7(2): 233-242, 1998

A heuristic procedure for the detection of locally similar substructures of two equivalent structures. Protein Engineering 7(3): 331-333, 1994

Solution structure of the yeast URN1 splicing factor FF domain: comparative analysis of charge distributions in FF domain structures-FFs and SURPs, two domains with a similar fold. Proteins 73(4): 1001-1009, 2008

Complete resonance assignment of the galectin-like domain of MIC1 from Toxoplasma gondii in complex with the second EGF domain from MIC6 and the backbone assignment in complex with the third EGF domain. Biomolecular Nmr Assignments 2(2): 175-177, 2009

Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index. Bmc Bioinformatics 7 Suppl 5(): S6-S6, 2007

SABA (secondary structure assignment program based on only alpha carbons): a novel pseudo center geometrical criterion for accurate assignment of protein secondary structures. Bmb Reports 44(2): 118-122, 2011

Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination. Journal of Structural and Functional Genomics 4(2-3): 67-78, 2003

Accelerating simulation of ensembles of locally differing optical structures via a Schur complement domain decomposition. Optics Letters 39(22): 6458-6461, 2014

NMR assignment of domain 2 of the receptor-associated protein. Journal of Biomolecular Nmr 36 Suppl 1: 54-54, 2006

Recombining the structures of HIV integrase, RuvC and RNase H The recently reported crystal structures of two recombination enzymes, the catalytic domain of HIV integrase and Escherichia coli RuvC, an endonuclease, are surprisingly similar to that of ribonuclease H suggesting the possibility that they have a common enzymatic mechanism. Structure (London) 3(2): 131-134, 1995

NMR assignment of the Wilson disease associated protein N-domain. Journal of Biomolecular Nmr 36 Suppl 1: 61-61, 2006

PROTCOM: searchable database of protein complexes enhanced with domain-domain structures. Nucleic Acids Research 35(Database Issue): D575-D579, 2006

SNP@Domain: a web resource of single nucleotide polymorphisms (SNPs) within protein domain structures and sequences. Nucleic Acids Research 34(Web Server Issue): W642-W644, 2006

Structure-dependent electrical conductivity of protein: its differences between alpha-domain and beta-domain structures. Nanotechnology 26(12): 125702-125702, 2015