+ Site Statistics
+ Search Articles
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ PDF Full Text
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Translate
+ Recently Requested

Regression for skewed biomarker outcomes subject to pooling

Regression for skewed biomarker outcomes subject to pooling

Biometrics 70(1): 202-211

Epidemiological studies involving biomarkers are often hindered by prohibitively expensive laboratory tests. Strategically pooling specimens prior to performing these lab assays has been shown to effectively reduce cost with minimal information loss in a logistic regression setting. When the goal is to perform regression with a continuous biomarker as the outcome, regression analysis of pooled specimens may not be straightforward, particularly if the outcome is right-skewed. In such cases, we demonstrate that a slight modification of a standard multiple linear regression model for poolwise data can provide valid and precise coefficient estimates when pools are formed by combining biospecimens from subjects with identical covariate values. When these x-homogeneous pools cannot be formed, we propose a Monte Carlo expectation maximization (MCEM) algorithm to compute maximum likelihood estimates (MLEs). Simulation studies demonstrate that these analytical methods provide essentially unbiased estimates of coefficient parameters as well as their standard errors when appropriate assumptions are met. Furthermore, we show how one can utilize the fully observed covariate data to inform the pooling strategy, yielding a high level of statistical efficiency at a fraction of the total lab cost.

(PDF emailed within 0-6 h: $19.90)

Accession: 055446717

Download citation: RISBibTeXText

PMID: 24521420

DOI: 10.1111/biom.12134

Related references

Semiparametric regression models for a right-skewed outcome subject to pooling. American Journal of Epidemiology 181(7): 541-548, 2015

Logistic regression analysis of biomarker data subject to pooling and dichotomization. Statistics in Medicine 31(22): 2473-2484, 2013

Within-subject Pooling of Biological Samples to Reduce Exposure Misclassification in Biomarker-based Studies. Epidemiology 27(3): 378-388, 2016

Assessment of skewed exposure in case-control studies with pooling. Statistics in Medicine 31(22): 2461-2472, 2013

A comparison of time dependent Cox regression, pooled logistic regression and cross sectional pooling with simulations and an application to the Framingham Heart Study. Bmc Medical Research Methodology 16(1): 148-148, 2016

Pooling data when analyzing biomarkers subject to a limit of detection. Methods in Molecular Biology 477: 421-426, 2009

Evaluation of regression calibration and SIMEX methods in logistic regression when one of the predictors is subject to additive measurement error. Journal of Epidemiology and Biostatistics 4(2): 65-74, 2000

Modelling visitor attendance levels subject to day of the week and weather: A comparison between linear regression models and regression trees. Journal for Nature Conservation (Jena) 11(4): 297-308, 2003

Pooling biomarker data from different studies of disease risk, with a focus on endogenous hormones. Cancer Epidemiology, Biomarkers & Prevention 19(4): 960-965, 2010

Methods for improving regression analysis for skewed continuous or counted responses. Annual Review of Public Health 28: 95-111, 2006

Approximate median regression for complex survey data with skewed response. Biometrics: -, 2016

Modeling Change in Skewed Variables Using Mixed Beta Regression Models. Research in Human Development 7(1): 9-26, 2010

A highly efficient design strategy for regression with outcome pooling. Statistics in Medicine 33(28): 5028-5040, 2015

Fiberprint: A subject fingerprint based on sparse code pooling for white matter fiber analysis. Neuroimage 158: 242-259, 2017

Estimation of ROC curves based on stably distributed biomarkers subject to measurement error and pooling mixtures. Statistics in Medicine 27(2): 280-296, 2007