+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

RAPHAEL: recognition, periodicity and insertion assignment of solenoid protein structures

RAPHAEL: recognition, periodicity and insertion assignment of solenoid protein structures

Bioinformatics 28(24): 3257-3264

Repeat proteins form a distinct class of structures where folding is greatly simplified. Several classes have been defined, with solenoid repeats of periodicity between ca. 5 and 40 being the most challenging to detect. Such proteins evolve quickly and their periodicity may be rapidly hidden at sequence level. From a structural point of view, finding solenoids may be complicated by the presence of insertions or multiple domains. To the best of our knowledge, no automated methods are available to characterize solenoid repeats from structure. Here we introduce RAPHAEL, a novel method for the detection of solenoids in protein structures. It reliably solves three problems of increasing difficulty: (1) recognition of solenoid domains, (2) determination of their periodicity and (3) assignment of insertions. RAPHAEL uses a geometric approach mimicking manual classification, producing several numeric parameters that are optimized for maximum performance. The resulting method is very accurate, with 89.5% of solenoid proteins and 97.2% of non-solenoid proteins correctly classified. RAPHAEL periodicities have a Spearman correlation coefficient of 0.877 against the manually established ones. A baseline algorithm for insertion detection in identified solenoids has a Q(2) value of 79.8%, suggesting room for further improvement. RAPHAEL finds 1931 highly confident repeat structures not previously annotated as solenoids in the Protein Data Bank records.

(PDF emailed within 0-6 h: $19.90)

Accession: 055333314

Download citation: RISBibTeXText

PMID: 22962341

DOI: 10.1093/bioinformatics/bts550

Related references

Autoregressive and iterative hidden Markov models for periodicity detection and solenoid structure recognition in protein sequences. IEEE Journal of Biomedical and Health Informatics 17(2): 436-441, 2014

Solenoid and non-solenoid protein recognition using stationary wavelet packet transform. Bioinformatics 26(18): I467-I473, 2010

REPETITA: detection and discrimination of the periodicity of protein solenoid repeats by discrete Fourier transform. Bioinformatics 25(12): I289-I295, 2009

When protein folding is simplified to protein coiling: the continuum of solenoid protein structures. Trends in Biochemical Sciences 25(10): 509-515, 2000

Periodicity, planarity, and pixel (3P): a program using the intrinsic residual dipolar coupling periodicity-to-peptide plane correlation and phi/psi angles to derive protein backbone structures. Journal of Magnetic Resonance 189(1): 90-103, 2007

ConSole: using modularity of contact maps to locate solenoid domains in protein structures. Bmc Bioinformatics 15(): 119-119, 2014

Quantitative structural analysis of importin-β flexibility: paradigm for solenoid protein structures. Structure 18(9): 1171-1183, 2011

SABA (secondary structure assignment program based on only alpha carbons): a novel pseudo center geometrical criterion for accurate assignment of protein secondary structures. Bmb Reports 44(2): 118-122, 2011

The humanist Raphael, the structures of the face and their esthetic appreciation. L' Orthodontie Francaise 62 Pt 3: 1043-1062, 1991

Influence of assignment on the prediction of transmembrane helices in protein structures. Amino Acids 39(5): 1241-1254, 2011

Protein domain assignment from the recurrence of locally similar structures. Proteins 79(3): 853-866, 2011

4SCOPmap: automated assignment of protein structures to evolutionary superfamilies. Bmc Bioinformatics 5(): 197-197, 2004

Dissecting the protein-RNA interface: the role of protein surface shapes and RNA secondary structures in protein-RNA recognition. Nucleic Acids Research 40(8): 3299-3306, 2012

Fast assignment of protein structures to sequences using the intermediate sequence library PDB-ISL. Bioinformatics 16(2): 117-124, 2000