+ Site Statistics
+ Search Articles
+ PDF Full Text Service
How our service works
Request PDF Full Text
+ Follow Us
Follow on Facebook
Follow on Twitter
Follow on LinkedIn
+ Subscribe to Site Feeds
Most Shared
PDF Full Text
+ Translate
+ Recently Requested

Clinical research: a novel approach to regression substitution for handling missing data

Clinical research: a novel approach to regression substitution for handling missing data

American Journal of Therapeutics 20(5): 514-519

In clinical research, missing data are common. Imputed data are not real data but constructed values that should increase the sensitivity of testing. Regression substitution for the purpose of data imputation often did not provide a better sensitivity than did other methods. The objective of this study was to compare different methods of missing data imputation with that of regression substitution taking into account particular quality measures. A real data example with a 105-value file was used. After randomly removing 5 values from the file, mean imputation and hot deck imputation were compared with regression substitution, taking account of the following requirements: (1) at least 2 independent variables be present in the equation, (2) no more than 1 datum per patient be missing, (3) no more than 5% of the data be missing, (4) more than 5% of the data be missing after randomly choosing 5% for regression-substitution deletion of the remainder, (5) only statistically significant variables be present in the regression model, and (6) no random errors be added to the imputed data. The test statistics after regression substitution were much better than those after the other 2 methods with F-values of 44.1 vs 29.4 and 30.1, and t-values of 7.6 vs 5.6 and 5.7, and 3.0 vs 1.7 and 1.8. We conclude that regression substitution is a very sensitive method for imputing missing data provided particular quality measures are taken into account.

Please choose payment method:

(PDF emailed within 0-6 h: $19.90)

Accession: 052129398

Download citation: RISBibTeXText

PMID: 21866042

DOI: 10.1097/MJT.0b013e3181ff7a7b

Related references

Multiple imputation as a flexible tool for missing data handling in clinical research. Behaviour Research and Therapy 98: 4-18, 2016

Missing data in clinical research: an integrated approach. British Journal of Dermatology 177(6): 1463-1465, 2018

Effectiveness of Different Missing Data Treatments in Surveys with Likert-Type Data: Introducing the Relative Mean Substitution Approach. Educational and Psychological Measurement 59(5): 725-748, 1999

Handling missing data in survey research. Statistical Methods in Medical Research 5(3): 215-238, 1996

A hierarchical Bayesian approach for handling missing classification data. Ecology and Evolution 9(6): 3130-3140, 2019

Methods for handling missing data in palliative care research. Palliative Medicine 20(8): 791-798, 2006

Handling missing data in nursing research with multiple imputation. Nursing Research 50(6): 384-389, 2001

A heuristic approach to handling missing data in biologics manufacturing databases. Bioprocess and Biosystems Engineering 2019, 2019

Estimating missing data: an iterative regression approach. Journal of Human Evolution 39(3): 289-296, 2000

Missing Data: The Importance and Impact of Missing Data from Clinical Research. Brain Impairment 15(01): 1-9, 2014

Demonstrating Methods for Handling Missing Patient Reported Outcome (Pro) Data in Clinical Trial Data Analysis. Value in Health 18(7): A711, 2015

Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies. Metabolomics 14(10): 128, 2019

The handling of missing data in clinical trials. Clinical Investigation 3(3): 241-250, 2013

Handling missing data: a commonly encountered problem in quantitative research. Clinical Nurse Specialist Cns 20(6): 273-276, 2006

Approaches to handling missing data within developmental and behavioral pediatric research. Journal of Developmental and Behavioral Pediatrics 31(1): 54-60, 2010