EurekaMag
+ Translate
+ Most Popular
Cunninghamia lanceolata plantations in China
Mammalian lairs in paleo ecological studies and palynology
Studies on technological possibilities in utilization of anhydrous milk fat for production of recombined butter-like products
Should right-sided fibroelastomas be operated upon?
Large esophageal lipoma
Apoptosis in the mammalian thymus during normal histogenesis and under various in vitro and in vivo experimental conditions
Poissons characoides nouveaux ou non signales de l'Ilha do Bananal, Bresil
Desensitizing efficacy of Colgate Sensitive Maximum Strength and Fresh Mint Sensodyne dentifrices
Administration of fluid by subcutaneous infusion: revival of a forgotten method
Tundra mosquito control - an impossible dream?
Schizophrenia for primary care providers: how to contribute to the care of a vulnerable patient population
Geochemical pattern analysis; method of describing the Southeastern limestone regional aquifer system
Incidence of low birth weights in a hospital of Mexico City
Tabanidae
Graded management intensity of grassland systems for enhancing floristic diversity
Microbiology and biochemistry of cheese and fermented milk
The ember tetra: a new pygmy characid tetra from the Rio das Mortes, Brazil, Hyphessobrycon amandae sp. n. (Pisces, Characoidei)
Risk factors of contrast-induced nephropathy in patients after coronary artery intervention
Renovation of onsite domestic wastewater in a poorly drained soil
Observations of the propagation velocity and formation mechanism of burst fractures caused by gunshot
Systolic blood pressure in a population of infants in the first year of life: the Brompton study
Haematological studies in rats fed with metanil yellow
Studies on pasteurellosis. I. A new species of Pasteurella encountered in chronic fowl cholera
Dormancy breaking and germination of Acacia salicina Lindl. seeds
therapy of lupus nephritis. a two-year prospective study

Capturing protein sequence-structure specificity using computational sequence design


Capturing protein sequence-structure specificity using computational sequence design



Proteins 81(9): 1556-1570



ISSN/ISBN: 0887-3585

PMID: 23609941

DOI: 10.1002/prot.24307

It is well known that protein fold recognition can be greatly improved if models for the underlying evolution history of the folds are taken into account. The improvement, however, exists only if such evolutionary information is available. To circumvent this limitation for protein families that only have a small number of representatives in current sequence databases, we follow an alternate approach in which the benefits of including evolutionary information can be recreated by using sequences generated by computational protein design algorithms. We explore this strategy on a large database of protein templates with 1747 members from different protein families. An automated method is used to design sequences for these templates. We use the backbones from the experimental structures as fixed templates, thread sequences on these backbones using a self-consistent mean field approach, and score the fitness of the corresponding models using a semi-empirical physical potential. Sequences designed for one template are translated into a hidden Markov model-based profile. We describe the implementation of this method, the optimization of its parameters, and its performance. When the native sequences of the protein templates were tested against the library of these profiles, the class, fold, and family memberships of a large majority (>90%) of these sequences were correctly recognized for an E-value threshold of 1. In contrast, when homologous sequences were tested against the same library, a much smaller fraction (35%) of sequences were recognized; The structural classification of protein families corresponding to these sequences, however, are correctly recognized (with an accuracy of >88%).

Please choose payment method:






(PDF emailed within 0-6 h: $19.90)

Accession: 051932731

Download citation: RISBibTeXText

Related references

Computational design of the sequence and structure of a protein-binding peptide. Journal of the American Chemical Society 133(12): 4190-4192, 2011

Improving computational protein design by using structure-derived sequence profile. Proteins 78(10): 2338-2348, 2010

Full-sequence computational design and solution structure of a thermostable protein variant. Journal of Molecular Biology 372(1): 1-6, 2007

Engineering enzyme specificity using computational design of a defined-sequence library. Chemistry and Biology 17(12): 1306-1315, 2010

An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data. Nucleic Acids Research 26(1): 327-331, 1998

Computational survey of sequence specificity for protein terminal tags covering nine organisms and its application to protein identification. Journal of Proteome Research 14(2): 756-767, 2015

Geometric Potentials for Computational Protein Sequence Design. Methods in Molecular Biology 1529: 125-138, 2017

Prediction of protein-protein interface sequence diversity using flexible backbone computational protein design. Structure 16(12): 1777-1788, 2009

Exposing the co-adaptive potential of protein-protein interfaces through computational sequence design. Bioinformatics 26(18): 2266-2272, 2010

Combined sequence and sequence-structure-based methods for analyzing RAAS gene SNPs: a computational approach. Journal of Receptor and Signal Transduction Research 34(6): 513-526, 2014

Analysis of the RNA Binding Specificity Landscape of C5 Protein Reveals Structure and Sequence Preferences that Direct RNase P Specificity. Cell Chemical Biology 23(10): 1271-1281, 2016

Use of residue pairs in protein sequence-sequence and sequence-structure alignments. Protein Science: a Publication of the Protein Society 9(8): 1576-1588, 2000

Solution structure of the first three zinc fingers of TFIIIA bound to the cognate DNA sequence: determinants of affinity and sequence specificity. Journal of Molecular Biology 273(1): 183-206, 1997

Computational Prediction of Protein Secondary Structure from Sequence. Literature Cited 86: 2.3.1-2.3.10, 2016

Computational design of a single amino acid sequence that can switch between two distinct protein folds. Journal of the American Chemical Society 128(4): 1154-1161, 2006