+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
EurekaMag Most Shared ContentMost Shared
EurekaMag PDF Full Text ContentPDF Full Text
+ PDF Full Text
Request PDF Full TextRequest PDF Full Text
+ Follow Us
Follow on FacebookFollow on Facebook
Follow on TwitterFollow on Twitter
Follow on LinkedInFollow on LinkedIn

+ Translate

The utility of PacBio circular consensus sequencing for characterizing complex gene families in non-model organisms

The utility of PacBio circular consensus sequencing for characterizing complex gene families in non-model organisms

Bmc Genomics 15(): 720-720

Molecular characterization of highly diverse gene families can be time consuming, expensive, and difficult, especially when considering the potential for relatively large numbers of paralogs and/or pseudogenes. Here we investigate the utility of Pacific Biosciences single molecule real-time (SMRT) circular consensus sequencing (CCS) as an alternative to traditional cloning and Sanger sequencing PCR amplicons for gene family characterization. We target vomeronasal gene receptors, one of the most diverse gene families in mammals, with the goal of better understanding intra-specific V1R diversity of the gray mouse lemur (Microcebus murinus). Our study compares intragenomic variation for two V1R subfamilies found in the mouse lemur. Specifically, we compare gene copy variation within and between two individuals of M. murinus as characterized by different methods for nucleotide sequencing. By including the same individual animal from which the M. murinus draft genome was derived, we are able to cross-validate gene copy estimates from Sanger sequencing versus CCS methods. We generated 34,088 high quality circular consensus sequences of two diverse V1R subfamilies (here referred to as V1RI and V1RIX) from two individuals of Microcebus murinus. Using a minimum threshold of 7× coverage, we recovered approximately 90% of V1RI sequences previously identified in the draft M. murinus genome (59% being identical at all nucleotide positions). When low coverage sequences were considered (i.e. < 7× coverage) 100% of V1RI sequences identified in the draft genome were recovered. At least 13 putatively novel V1R loci were also identified using CCS technology. Recent upgrades to the Pacific Biosciences RS instrument have improved the CCS technology and offer an alternative to traditional sequencing approaches. Our results suggest that the Microcebus murinus V1R repertoire has been underestimated in the draft genome. In addition to providing an improved understanding of V1R diversity in the mouse lemur, this study demonstrates the utility of CCS technology for characterizing complex regions of the genome. We anticipate that long-read sequencing technologies such as PacBio SMRT will allow for the assembly of multigene family clusters and serve to more accurately characterize patterns of gene copy variation in large gene families, thus revealing novel micro-evolutionary patterns within non-model organisms.

(PDF emailed within 0-6 h: $19.90)

Accession: 056530585

Download citation: RISBibTeXText

PMID: 25159659

DOI: 10.1186/1471-2164-15-720

Related references

No assembly required: Full-length MHC class I allele discovery by PacBio circular consensus sequencing. Human Immunology 76(12): 891-896, 2016

PacBio sequencing of gene families - a case study with wheat gluten genes. Gene 533(2): 541-546, 2014

Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. Peerj 4: E1869-E1869, 2016

Utility of pooled sequencing for association mapping in non-model organisms. Molecular Ecology Resources: -, 2018

PacBio for Haplotyping in Gene Families. Methods in Molecular Biology 1551: 61-71, 2018

Evaluation of PacBio sequencing for full-length bacterial 16S rRNA gene classification. Bmc Microbiology 16(1): 274-274, 2016

NPBSS: a new PacBio sequencing simulator for generating the continuous long reads with an empirical model. Bmc Bioinformatics 19(1): 177-177, 2018

A Comprehensive Quality Evaluation System for Complex Herbal Medicine Using PacBio Sequencing, PCR-Denaturing Gradient Gel Electrophoresis, and Several Chemical Approaches. Frontiers in Plant Science 8: 1578-1578, 2017

Using PacBio Long-Read High-Throughput Microbial Gene Amplicon Sequencing To Evaluate Infant Formula Safety. Journal of Agricultural and Food Chemistry 64(37): 6993-7001, 2016

Extracting data from the muck: deriving biological insight from complex microbial communities and non-model organisms with next generation sequencing. Current Opinion in Biotechnology 28: 103-110, 2014

Utility of Whole-Genome Sequencing in Characterizing Acinetobacter Epidemiology and Analyzing Hospital Outbreaks. Journal of Clinical Microbiology 54(3): 593-612, 2015

A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucleic Acids Research 38(15): E159-E159, 2010

Complete Genome Sequencing of Protease-Producing Novel Arthrobacter sp. Strain IHBB 11108 Using PacBio Single-Molecule Real-Time Sequencing Technology. Genome Announcements 3(2): -, 2015

Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire. Bmc Immunology 13(): 52-52, 2013

Microsatellite marker discovery using single molecule real-time circular consensus sequencing on the Pacific Biosciences RS. Biotechniques 55(5): 253-256, 2014