+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

Sequence Diversity Diagram for comparative analysis of multiple sequence alignments

Sequence Diversity Diagram for comparative analysis of multiple sequence alignments

Bmc Proceedings 8(Suppl 2 Proceedings of the 3rd Annual Symposium on Biologica): S9-S9

The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.

(PDF emailed within 0-6 h: $19.90)

Accession: 055718803

Download citation: RISBibTeXText

PMID: 25237396

DOI: 10.1186/1753-6561-8-S2-S9

Related references

A comparative analysis of multiple sequence alignments for biological data. Bio-Medical Materials and Engineering 26 Suppl 1: S1781-9, 2016

The choice of sequence homologs included in multiple sequence alignments has a dramatic impact on evolutionary conservation analysis. Bioinformatics 35(1): 12-19, 2018

Advantages of using multiple sequence alignments over pairwise alignments when sequence similarity is low. Abstracts of Papers American Chemical Society 203(1-3): BIOL60, 1992

Revealing highly conserved regions in the E6 protein among distinct human papillomavirus types using comparative analysis of multiple sequence alignments. Brazilian Journal of Biology 73(2): 449-450, 2014

MergeAlign: improving multiple sequence alignment performance by dynamic reconstruction of consensus multiple sequence alignments. Bmc Bioinformatics 13(): 117-117, 2012

Sequence similarity searches, multiple sequence alignments, and molecular tree building. Glick, B R, Thompson, J E Methods in plant molecular biology and biotechnology: 251-268, 1993

Multiple sequence alignments based on new approaches of tree construction and sequence comparison. Lim, H A, Fickett, J W, Cantor, C R, Robbins, R J The Second International Conference on Bioinformatics, Supercomputing, and Complex Genome Analysis 419-428, 1993

Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV). F1000research 3: 249-249, 2015

ProtEST: protein multiple sequence alignments from expressed sequence tags. Bioinformatics 16(2): 111-116, 2000

Relationship between multiple sequence alignments and quality of protein comparative models. Proteins 58(1): 151-157, 2004

PyMod: sequence similarity searches, multiple sequence-structure alignments, and homology modeling within PyMOL. Bmc Bioinformatics 13 Suppl 4(): S2-S2, 2012

AliGROOVE--visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support. Bmc Bioinformatics 15(): 294-294, 2014

Combining multiple structure and sequence alignments to improve sequence detection and alignment: application to the SH2 domains of Janus kinases. Proceedings of the National Academy of Sciences of the United States of America 98(26): 14796-14801, 2001

Building multiple sequence alignments with a flavor of HSSP alignments. Genetics and Molecular Research 5(1): 127-137, 2006

Creation and analysis of protein multiple sequence alignments. Methods of Biochemical Analysis 43: 215-232, 2001