+ Site Statistics
References:
52,654,530
Abstracts:
29,560,856
PMIDs:
28,072,755
+ Search Articles
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ PDF Full Text
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Translate
+ Recently Requested

FALCON@home: a high-throughput protein structure prediction server based on remote homologue recognition



FALCON@home: a high-throughput protein structure prediction server based on remote homologue recognition



Bioinformatics 32(3): 462-464



The protein structure prediction approaches can be categorized into template-based modeling (including homology modeling and threading) and free modeling. However, the existing threading tools perform poorly on remote homologous proteins. Thus, improving fold recognition for remote homologous proteins remains a challenge. Besides, the proteome-wide structure prediction poses another challenge of increasing prediction throughput. In this study, we presented FALCON@home as a protein structure prediction server focusing on remote homologue identification. The design of FALCON@home is based on the observation that a structural template, especially for remote homologous proteins, consists of conserved regions interweaved with highly variable regions. The highly variable regions lead to vague alignments in threading approaches. Thus, FALCON@home first extracts conserved regions from each template and then aligns a query protein with conserved regions only rather than the full-length template directly. This helps avoid the vague alignments rooted in highly variable regions, improving remote homologue identification. We implemented FALCON@home using the Berkeley Open Infrastructure of Network Computing (BOINC) volunteer computing protocol. With computation power donated from over 20,000 volunteer CPUs, FALCON@home shows a throughput as high as processing of over 1000 proteins per day. In the Critical Assessment of protein Structure Prediction (CASP11), the FALCON@home-based prediction was ranked the 12th in the template-based modeling category. As an application, the structures of 880 mouse mitochondria proteins were predicted, which revealed the significant correlation between protein half-lives and protein structural factors. FALCON@home is freely available at http://protein.ict.ac.cn/FALCON/. shuaicli@cityu.edu.hk, dbu@ict.ac.cn Supplementary data are available at Bioinformatics online.

(PDF emailed within 0-6 h: $19.90)

Accession: 057851984

Download citation: RISBibTeXText

PMID: 26454278

DOI: 10.1093/bioinformatics/btv581


Related references

RNAex: an RNA secondary structure prediction server enhanced by high-throughput structure-probing data. Nucleic Acids Research 44(W1): W294-W301, 2017

BeStSel: a web server for accurate protein secondary structure prediction and fold recognition from the circular dichroism spectra. Nucleic Acids Research 46(W1): W315-W322, 2018

(PS)2-v2: template-based protein structure prediction server. Bmc Bioinformatics 10: 366, 2010

PROTEUS2: a web server for comprehensive protein structure prediction and structure-based annotation. Nucleic Acids Research 36(Web Server Issue): W202-W209, 2008

MAESTROweb: a web server for structure-based protein stability prediction. Bioinformatics 32(9): 1414-1416, 2017

CABS-fold: Server for the de novo and consensus-based prediction of protein structure. Nucleic Acids Research 41(Web Server Issue): W406-W411, 2013

GalaxyGemini: a web server for protein homo-oligomer structure prediction based on similarity. Bioinformatics 29(8): 1078-1080, 2013

The IntFOLD server: an integrated web resource for protein fold recognition, 3D model quality assessment, intrinsic disorder prediction, domain prediction and ligand binding site prediction. Nucleic Acids Research 39(Web Server Issue): W171-W176, 2011

UNRES server for physics-based coarse-grained simulations and prediction of protein structure, dynamics and thermodynamics. Nucleic Acids Research 46(W1): W304-W309, 2018

Protein Sequence Annotation Tool (PSAT): a centralized web-based meta-server for high-throughput sequence annotations. Bmc Bioinformatics 17: 43, 2016

ELASPIC web-server: proteome-wide structure-based prediction of mutation effects on protein stability and binding affinity. Bioinformatics 32(10): 1589-1591, 2017

PPIMpred: a web server for high-throughput screening of small molecules targeting protein-protein interaction. Royal Society Open Science 4(4): 160501, 2017

A Bayesian network model for protein fold and remote homologue recognition. Bioinformatics 18(6): 788-801, 2002

Dinosolve: a protein disulfide bonding prediction server using context-based features to enhance prediction accuracy. Bmc Bioinformatics 14 Suppl 13: S9, 2014

PESDserv: a server for high-throughput comparison of protein binding site surfaces. Bioinformatics 26(15): 1913-1914, 2010