posted on 2019-05-31, 14:34authored byOliver Pain, Frank Dudbridge, Angelica Ronald
Many statistical tests rely on the assumption that the residuals of a model are normally distributed. Rank-based inverse normal transformation (INT) of the dependent variable is one of the most popular approaches to satisfy the normality assumption. When covariates are included in the analysis, a common approach is to first adjust for the covariates and then normalize the residuals. This study investigated the effect of regressing covariates against the dependent variable and then applying rank-based INT to the residuals. The correlation between the dependent variable and covariates at each stage of processing was assessed. An alternative approach was tested in which rank-based INT was applied to the dependent variable before regressing covariates. Analyses based on both simulated and real data examples demonstrated that applying rank-based INT to the dependent variable residuals after regressing out covariates re-introduces a linear correlation between the dependent variable and covariates, increasing type-I errors and reducing power. On the other hand, when rank-based INT was applied prior to controlling for covariate effects, residuals were normally distributed and linearly uncorrelated with covariates. This latter approach is therefore recommended in situations were normality of the dependent variable is required.
Funding
We also thank Robert Plomin, Andrew McMillan, the TEDS research team, and their participants for providing TEDS data for use in this study. The TEDS study was funded by the Medical Research Council grant MR/M021475/1. This study was funded by the Medical Research Council grant G1100559 to Angelica Ronald and a Bloomsbury Colleges studentship to Oliver Pain.
History
Citation
European Journal of Human Genetics, 2018, 26 (8), pp. 1194-1201
Author affiliation
/Organisation/COLLEGE OF LIFE SCIENCES/School of Medicine/Department of Health Sciences
Version
VoR (Version of Record)
Published in
European Journal of Human Genetics
Publisher
Springer Nature for European Society of Human Genetics
The online version of this article
(https://doi.org/10.1038/s41431-018-0159-6) contains supplementary
material, which is available to authorized users.