posted on 2019-03-01, 08:50authored byS Diver, CE Brightling
‘Big data’ is on trend and the term is used in equal measure to reflect both one of the greatest challenges and likeliest solutions to future scientific advances from fundamental understanding in astrophysics, climate change, economics, health and disease. Like many trends, it means different things to different people. In medicine, it is used to describe the data derived from large populations in epidemiology studies, high fidelity multiscale ‘omic datasets across spatial scales within individuals or sometimes a combination of the two. Big data will often capture information at a single time point. Typically, it does not address temporal scales of chronic disease including day-to-day variability, response to perturbations such as intercurrent infection, decompensation of the disease or response to therapeutic interventions and is rarely obtained over a life course. Observations will therefore always be limited by what is measured, when and in whom and will only ever provide estimates of what is ‘real’ within the larger group from which the sample is taken. Big data that includes large populations makes interpretations more robust and generalisable. Indeed, as the population studied or sample size approaches a majority, or at least a sizeable minority, of the whole population then the observations begin to no longer be estimates, but simply a description of the population.
History
Citation
Thorax, 2018, 73 (4), pp. 311-312
Author affiliation
/Organisation/COLLEGE OF LIFE SCIENCES/School of Medicine/Department of Infection, Immunity and Inflammation