Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2019

Open Access 01-12-2019 | Debate

Does big data require a methodological change in medical research?

Authors: Amke Caliebe, Friedhelm Leverkus, Gerd Antes, Michael Krawczak

Published in: BMC Medical Research Methodology | Issue 1/2019

Login to get access

Abstract

Background

Use of big data is becoming increasingly popular in medical research. Since big data-based projects differ notably from classical research studies, both in terms of scope and quality, a debate is apt as to whether big data require new approaches to scientific reasoning different from those established in statistics and philosophy of science.

Main text

The progressing digitalization of our societies generates vast amounts of data that also become available for medical research. Here, the big promise of big data is to facilitate major improvements in the treatment, diagnosis and prevention of diseases. An ongoing examination of the idiosyncrasies of big data is therefore essential to ensure that the field stays congruent with the principles of evidence-based medicine. We discuss the inherent challenges and opportunities of big data in medicine from a methodological point of view, particularly highlighting the relative importance of causality and correlation in commercial and medical research settings. We make a strong case for upholding the distinction between exploratory data analysis facilitating hypothesis generation and confirmatory approaches involving hypothesis validation. An independent verification of research results will be ever more important in the context of big data, where data quality is often hampered by a lack of standardization and structuring.

Conclusions

We argue that it would be both unnecessary and dangerous to discard long-established principles of data generation, analysis and interpretation in the age of big data. While many medical research areas may reasonably benefit from big data analyses, they should nevertheless be complemented by carefully designed (prospective) studies.
Literature
2.
go back to reference Institute of Medicine (US). Roundtable on evidence-based medicine, the learning healthcare system: workshop summary. Washington (DC): National Academies Press (US); 2007. Institute of Medicine (US). Roundtable on evidence-based medicine, the learning healthcare system: workshop summary. Washington (DC): National Academies Press (US); 2007.
4.
go back to reference Almeder R: Pragmatism and science. In: The Routledge Companion to Philosophy of Science. Edited by Curd M, Psillos S: Routledge; 2008: 91–99. Almeder R: Pragmatism and science. In: The Routledge Companion to Philosophy of Science. Edited by Curd M, Psillos S: Routledge; 2008: 91–99.
5.
go back to reference Mazzocchi F. Could big data be the end of theory in science? EMBO Rep. 2015;16(10):1250–5.CrossRef Mazzocchi F. Could big data be the end of theory in science? EMBO Rep. 2015;16(10):1250–5.CrossRef
6.
go back to reference Ioannidis JPA. Why most published research findings are false. PLoS Med. 2005;2(8):e124.CrossRef Ioannidis JPA. Why most published research findings are false. PLoS Med. 2005;2(8):e124.CrossRef
7.
go back to reference Popper KR: Logik der Forschung: Springer; 1935. Popper KR: Logik der Forschung: Springer; 1935.
8.
go back to reference Moulines CU: Die Entwicklung der modernen Wissenschaftstheorie (1890–2000): eine historische Einführung: Lit; 2008. Moulines CU: Die Entwicklung der modernen Wissenschaftstheorie (1890–2000): eine historische Einführung: Lit; 2008.
10.
11.
go back to reference Mayer-Schönberger V, Cukier K: Big data: a revolution that will transform how we live, work, and think: Houghton Mifflin Harcourt; 2013. Mayer-Schönberger V, Cukier K: Big data: a revolution that will transform how we live, work, and think: Houghton Mifflin Harcourt; 2013.
12.
go back to reference Pearl J. An introduction to causal inference. Int J Biostat. 2010;6(2):Article 7.CrossRef Pearl J. An introduction to causal inference. Int J Biostat. 2010;6(2):Article 7.CrossRef
13.
go back to reference Broadbent A. Inferring causation in epidemiology: mechanisms, black boxes, and contrasts. In: Illari PM, Russo F, Williamson J, editors. Causality in the Sciences. Oxford: Oxford University Press; 2011. p. 45–69.CrossRef Broadbent A. Inferring causation in epidemiology: mechanisms, black boxes, and contrasts. In: Illari PM, Russo F, Williamson J, editors. Causality in the Sciences. Oxford: Oxford University Press; 2011. p. 45–69.CrossRef
16.
17.
go back to reference Schork NJ. Personalized medicine: time for one-person trials. Nature. 2015;520(7549):609–11.CrossRef Schork NJ. Personalized medicine: time for one-person trials. Nature. 2015;520(7549):609–11.CrossRef
18.
go back to reference Kuhn TS: The structure of scientific revolutions: 2nd edition, University of Chicago Press; 1970. Kuhn TS: The structure of scientific revolutions: 2nd edition, University of Chicago Press; 1970.
19.
go back to reference Kambli R: The primary challenge is to improve the usability of big data by healthcare sector. Express Healthcare http://wwwexpressbpdcom/healthcare/it-healthcare/the-primary-challenge-is-to-improve-the-usability-of-big-data-by-healthcare-sector/386292/ 8 June 2017. Kambli R: The primary challenge is to improve the usability of big data by healthcare sector. Express Healthcare http://wwwexpressbpdcom/healthcare/it-healthcare/the-primary-challenge-is-to-improve-the-usability-of-big-data-by-healthcare-sector/386292/ 8 June 2017.
20.
go back to reference Khera AV, Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, Natarajan P, Lander ES, Lubitz SA, Ellinor PT, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50(9):1219–24.CrossRef Khera AV, Chaffin M, Aragam KG, Haas ME, Roselli C, Choi SH, Natarajan P, Lander ES, Lubitz SA, Ellinor PT, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet. 2018;50(9):1219–24.CrossRef
22.
go back to reference Schork AJ, Schork MA, Schork NJ. Genetic risks and clinical rewards. Nat Genet. 2018;50(9):1210–1.CrossRef Schork AJ, Schork MA, Schork NJ. Genetic risks and clinical rewards. Nat Genet. 2018;50(9):1210–1.CrossRef
Metadata
Title
Does big data require a methodological change in medical research?
Authors
Amke Caliebe
Friedhelm Leverkus
Gerd Antes
Michael Krawczak
Publication date
01-12-2019
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2019
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-019-0774-0

Other articles of this Issue 1/2019

BMC Medical Research Methodology 1/2019 Go to the issue