+ Site Statistics
+ Search Articles
+ PDF Full Text Service
How our service works
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ Translate
+ Recently Requested

Density based pruning for identification of differentially expressed genes from microarray data



Density based pruning for identification of differentially expressed genes from microarray data



Bmc Genomics 11(Suppl. 2): S3



Identification of differentially expressed genes from microarray datasets is one of the most important analyses for microarray data mining. Popular algorithms such as statistical t-test rank genes based on a single statistics. The false positive rate of these methods can be improved by considering other features of differentially expressed genes. We proposed a pattern recognition strategy for identifying differentially expressed genes. Genes are mapped to a two dimension feature space composed of average difference of gene expression and average expression levels. A density based pruning algorithm (DB Pruning) is developed to screen out potential differentially expressed genes usually located in the sparse boundary region. Biases of popular algorithms for identifying differentially expressed genes are visually characterized. Experiments on 17 datasets from Gene Omnibus Database (GEO) with experimentally verified differentially expressed genes showed that DB pruning can significantly improve the prediction accuracy of popular identification algorithms such as t-test, rank product, and fold change. Density based pruning of non-differentially expressed genes is an effective method for enhancing statistical testing based algorithms for identifying differentially expressed genes. It improves t-test, rank product, and fold change by 11% to 50% in the numbers of identified true differentially expressed genes. The source code of DB pruning is freely available on our website http://mleg.cse.sc.edu/degprune.

Please choose payment method:






(PDF emailed within 0-6 h: $19.90)

Accession: 052486240

Download citation: RISBibTeXText

PMID: 21047384

DOI: 10.1186/1471-2164-11-s2-s3


Related references

Identification of differentially expressed genes for time-course microarray data based on modified RM ANOVA. Ieee/Acm Transactions on Computational Biology and Bioinformatics 9(2): 451-466, 2012

CIT: identification of differentially expressed clusters of genes from microarray data. Bioinformatics 18(1): 205-206, 2002

Mixture-model based estimation of gene expression variance from public database improves identification of differentially expressed genes in small sized microarray data. Bioinformatics 26(4): 486-492, 2010

Identification of differentially expressed genes in gastric cancer by high density cDNA microarray. Cancer Genetics 205(4): 147-155, 2012

A novel Mixture Model Method for identification of differentially expressed genes from DNA microarray data. Bmc Bioinformatics 5: 201-201, 2004

Identification of differentially expressed genes in microarray data in a principal component space. Springerplus 2(1): 60, 2013

TTCA: an R package for the identification of differentially expressed genes in time course microarray data. Bmc Bioinformatics 18(1): 33, 2017

Identification of differentially expressed genes in pituitary adenomas by integrating analysis of microarray data. International Journal of Endocrinology 2015: 164087, 2015

A power law global error model for the identification of differentially expressed genes in microarray data. Bmc Bioinformatics 5: 203-203, 2004

Identification of differentially expressed genes by meta-analysis of microarray data on breast cancer. In Silico Biology 8(5-6): 383-411, 2008

Analysis of differentially expressed genes based on microarray data of glioma. International Journal of Clinical and Experimental Medicine 8(10): 17321-17332, 2015

Microarray-based identification of differentially expressed genes in extramammary Paget's disease. International Journal of Clinical and Experimental Medicine 8(5): 7251-7260, 2015

Analysis of differentially co-expressed genes based on microarray data of hepatocellular carcinoma. Neoplasma 64(2): 216-221, 2017

Microarray-based identification of differentially expressed growth- and metastasis-associated genes in pancreatic cancer. Cellular and Molecular Life Sciences 60(6): 1180-1199, 2003

On the identification of differentially expressed genes: improving the generalized F-statistics for Affymetrix microarray gene expression data. Computational Biology and Chemistry 30(5): 321-326, 2006