EurekaMag.com logo
+ Site Statistics
References:
53,869,633
Abstracts:
29,686,251
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments



MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments



Bmc Bioinformatics 13(): 117-117



The generation of multiple sequence alignments (MSAs) is a crucial step for many bioinformatic analyses. Thus improving MSA accuracy and identifying potential errors in MSAs is important for a wide range of post-genomic research. We present a novel method called MergeAlign which constructs consensus MSAs from multiple independent MSAs and assigns an alignment precision score to each column. Using conventional benchmark tests we demonstrate that on average MergeAlign MSAs are more accurate than MSAs generated using any single matrix of sequence substitution. We show that MergeAlign column scores are related to alignment precision and hence provide an ab initio method of estimating alignment precision in the absence of curated reference MSAs. Using two novel and independent alignment performance tests that utilise a large set of orthologous gene families we demonstrate that increasing MSA performance leads to an increase in the performance of downstream phylogenetic analyses. Using multiple tests of alignment performance we demonstrate that this novel method has broad general application in biological research.

(PDF emailed within 0-6 h: $19.90)

Accession: 054332334

Download citation: RISBibTeXText

PMID: 22646090

DOI: 10.1186/1471-2105-13-117



Related references

Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV). F1000research 3: 249-249, 2015

Combining multiple structure and sequence alignments to improve sequence detection and alignment: application to the SH2 domains of Janus kinases. Proceedings of the National Academy of Sciences of the United States of America 98(26): 14796-14801, 2001

Multiple Sequence Alignment Based on a Suffix Tree and Center-Star Strategy: A Linear Method for Multiple Nucleotide Sequence Alignment on Spark Parallel Framework. Journal of Computational Biology, 2017

Improving pairwise sequence alignment accuracy using near-optimal protein sequence alignments. Bmc Bioinformatics 11(): 146-146, 2010

Sequence alignment of citrate synthase proteins using a multiple sequence alignment algorithm and multiple scoring matrices. Protein Engineering 2(8): 597-604, 1989

SPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures. Bioinformatics 21(18): 3615-3621, 2005

A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments. Molecular Biology and Evolution 30(3): 689-712, 2013

Formatt: Correcting protein multiple structural alignments by incorporating sequence alignment. Bmc Bioinformatics 13(): 259-259, 2013

CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22(22): 4673-4680, 1994

Improving the performance of Rosetta using multiple sequence alignment information and global measures of hydrophobic core formation. Proteins 43(1): 1-11, April 1, 2001

ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach. Bmc Bioinformatics 15(): 265-265, 2014

Elision: a method for accommodating multiple molecular sequence alignments with alignment-ambiguous sites. Molecular Phylogenetics and Evolution 4(1): 1-9, 1995

GenomeCons: a web server for manipulating multiple genome sequence alignments and their consensus sequences. Bioinformatics 31(8): 1293-1295, 2015

Energy-based RNA consensus secondary structure prediction in multiple sequence alignments. Methods in Molecular Biology 1097: 125-141, 2014

Advantages of using multiple sequence alignments over pairwise alignments when sequence similarity is low. Abstracts of Papers American Chemical Society 203(1-3): BIOL60, 1992