+ Site Statistics
+ Search Articles
+ PDF Full Text Service
How our service works
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ Translate
+ Recently Requested

PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data



PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data



Genome Research 17(11): 1665-1674



Comprehensive identification and cataloging of copy number variations (CNVs) is required to provide a complete view of human genetic variation. The resolution of CNV detection in previous experimental designs has been limited to tens or hundreds of kilobases. Here we present PennCNV, a hidden Markov model (HMM) based approach, for kilobase-resolution detection of CNVs from Illumina high-density SNP genotyping data. This algorithm incorporates multiple sources of information, including total signal intensity and allelic intensity ratio at each SNP marker, the distance between neighboring SNPs, the allele frequency of SNPs, and the pedigree information where available. We applied PennCNV to genotyping data generated for 112 HapMap individuals; on average, we detected approximately 27 CNVs for each individual with a median size of approximately 12 kb. Excluding common rearrangements in lymphoblastoid cell lines, the fraction of CNVs in offspring not detected in parents (CNV-NDPs) was 3.3%. Our results demonstrate the feasibility of whole-genome fine-mapping of CNVs via high-density SNP genotyping.

Please choose payment method:






(PDF emailed within 0-6 h: $19.90)

Accession: 021535895

Download citation: RISBibTeXText

PMID: 17921354

DOI: 10.1101/gr.6861907


Related references

QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data. Nucleic Acids Research 35(6): 2013-2025, 2007

A hidden Markov model for copy number variant prediction from whole genome resequencing data. Bmc Bioinformatics 12(Suppl. 6): S4, 2011

CnvHiTSeq: integrative models for high-resolution copy number variation detection and genotyping using population sequencing data. Genome Biology 13(12): R120, 2012

GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays. Nucleic Acids Research 39(12): 4928-4941, 2011

High-resolution copy number arrays in cancer and the problem of normal genome copy number variation. Genes Chromosomes and Cancer 47(11): 933-938, 2008

High resolution copy number variation data in the NCI-60 cancer cell lines from whole genome microarrays accessible through CellMiner. Plos one 9(3): E92047, 2014

A genome-wide detection of copy number variation using SNP genotyping arrays in Beijing-You chickens. Genetica 142(5): 441-450, 2014

Copy number variation detection and genotyping from exome sequence data. Genome Research 22(8): 1525-1532, 2012

Copy Number Variation Detection via High-Density SNP Genotyping. Csh Protocols 2008: Pdb.Top46, 2008

Genome-wide detection of copy number variations using high-density SNP genotyping platforms in Holsteins. Bmc Genomics 14: 131, 2013

A high-resolution map of segmental DNA copy number variation in the mouse genome. Plos Genetics 3(1): E3, 2007

Continuous-index hidden Markov modelling of array CGH copy number data. Bioinformatics 23(8): 1006-1014, 2007

Impact of copy number variations burden on coding genome in humans using integrated high resolution arrays. Genetics Research 96: E17, 2014

Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes. Journal of the Royal Statistical Society. Series B Statistical Methodology 73(1): 37-57, 2011

A continuous-index hidden Markov jump process for modeling DNA copy number data. Biostatistics 10(4): 773-778, 2009