+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

Exploratory visual analysis of conserved domains on multiple sequence alignments

Exploratory visual analysis of conserved domains on multiple sequence alignments

Bmc Bioinformatics 10 Suppl 11(): S7-S7

Multiple alignment of protein sequences can provide insight into sequence conservation across many species and thus allow identification of those sections of the sequence most critical to protein function. This insight can be augmented by joint display of conserved domains along the sequences. By fusing this metadata visually, biologists can analyze sequence conservation and functional motifs simultaneously and efficiently. We present MSAVis, a new approach combining luminance and hue for simultaneous visualization of conserved motifs and sequence alignment. Input for the algorithm is a multiple sequence alignment in a standard format. The NCBI Conserved Domain Database (CDD) is used for finding conserved domains along the alignment. The visualization quickly identifies conserved domains, and allows both macro (sequence-long) and micro (small amino-acid neighborhood) views. MSAVis utilizes two visual cues, luminance and hue, to facilitate at-a-glance summary of the conservation of a user-provided protein alignment while enabling multiple comparisons among functional domains. These visual cues are preattentive and separable so that the relationship between conservation strength and domain membership can be understood. The MSAVis software, written in Python and using BioPython and OpenGL, can be found at

(PDF emailed within 0-6 h: $19.90)

Accession: 053138682

Download citation: RISBibTeXText

PMID: 19811691

DOI: 10.1186/1471-2105-10-S11-S7

Related references

Exploratory analysis of multiple sequence alignments using phylogenies. Computer Applications in the Biosciences 10(3): 243-247, 1994

Revealing highly conserved regions in the E6 protein among distinct human papillomavirus types using comparative analysis of multiple sequence alignments. Brazilian Journal of Biology 73(2): 449-450, 2014

Combining multiple structure and sequence alignments to improve sequence detection and alignment: application to the SH2 domains of Janus kinases. Proceedings of the National Academy of Sciences of the United States of America 98(26): 14796-14801, 2001

Refining multiple sequence alignments with conserved core regions. Nucleic Acids Research 34(9): 2598-2606, 2006

Algorithms for locating extremely conserved elements in multiple sequence alignments. Bmc Bioinformatics 10(): 432-432, 2010

Pfam: Multiple sequence alignments and HMM-profiles of protein domains. Nucleic Acids Research 26(1): 320-322, 1998

Searching databases of conserved sequence regions by aligning protein multiple-alignments. Nucleic Acids Research 24(19): 3836-3845, 1996

AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences. Algorithms for Molecular Biology 5(): 24-24, 2010

SeqFIRE: a web application for automated extraction of indel regions and conserved blocks from protein multiple sequence alignments. Nucleic Acids Research 40(Web Server Issue): W340-W347, 2012

Fingerprint: visual depiction of variation in multiple sequence alignments. Molecular ecology notes 7(6): 908-914, 2007

IVisTMSA: Interactive Visual Tools for Multiple Sequence Alignments. Evolutionary Bioinformatics Online 11: 35-42, 2015

Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular Biology and Evolution 17(4): 540-552, 2000

Analysis of conserved domains and sequence motifs in cellular regulatory proteins and locus control regions using new software tools for multiple alignment and visualization. New Biologist 4(3): 247-260, 1992

Advantages of using multiple sequence alignments over pairwise alignments when sequence similarity is low. Abstracts of Papers American Chemical Society 203(1-3): BIOL60, 1992

Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns. Bioinformatics 29(17): 2112-2121, 2014