Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2019

Open Access 01-12-2019 | Research article

Feasibility analysis of conducting observational studies with the electronic health record

Authors: Marcel von Lucadou, Thomas Ganslandt, Hans-Ulrich Prokosch, Dennis Toddenroth

Published in: BMC Medical Informatics and Decision Making | Issue 1/2019

Login to get access

Abstract

Background

The secondary use of electronic health records (EHRs) promises to facilitate medical research. We reviewed general data requirements in observational studies and analyzed the feasibility of conducting observational studies with structured EHR data, in particular diagnosis and procedure codes.

Methods

After reviewing published observational studies from the University Hospital of Erlangen for general data requirements, we identified three different study populations for the feasibility analysis with eligibility criteria from three exemplary observational studies. For each study population, we evaluated the availability of relevant patient characteristics in our EHR, including outcome and exposure variables. To assess data quality, we computed distributions of relevant patient characteristics from the available structured EHR data and compared them to those of the original studies. We implemented computed phenotypes for patient characteristics where necessary. In random samples, we evaluated how well structured patient characteristics agreed with a gold standard from manually interpreted free texts. We categorized our findings using the four data quality dimensions “completeness”, “correctness”, “currency” and “granularity”.

Results

Reviewing general data requirements, we found that some investigators supplement routine data with questionnaires, interviews and follow-up examinations. We included 847 subjects in the feasibility analysis (Study 1 n = 411, Study 2 n = 423, Study 3 n = 13). All eligibility criteria from two studies were available in structured data, while one study required computed phenotypes in eligibility criteria. In one study, we found that all necessary patient characteristics were documented at least once in either structured or unstructured data. In another study, all exposure and outcome variables were available in structured data, while in the other one unstructured data had to be consulted. The comparison of patient characteristics distributions, as computed from structured data, with those from the original study yielded similar distributions as well as indications of underreporting. We observed violations in all four data quality dimensions.

Conclusions

While we found relevant patient characteristics available in structured EHR data, data quality problems may entail that it remains a case-by-case decision whether diagnosis and procedure codes are sufficient to underpin observational studies. Free-text data or subsequently supplementary study data may be important to complement a comprehensive patient history.
Appendix
Available only for authorised users
Literature
2.
go back to reference Röhrig B, et al. Types of study in medical research: part 3 of a series on evaluation of scientific publications. Dtsch Arztebl Int. 2009;106(15):262–8.PubMedPubMedCentral Röhrig B, et al. Types of study in medical research: part 3 of a series on evaluation of scientific publications. Dtsch Arztebl Int. 2009;106(15):262–8.PubMedPubMedCentral
3.
go back to reference Frieden TR. Evidence for health decision making — beyond randomized, controlled trials. N Engl J Med. 2017;377(5):465–75.CrossRefPubMed Frieden TR. Evidence for health decision making — beyond randomized, controlled trials. N Engl J Med. 2017;377(5):465–75.CrossRefPubMed
4.
go back to reference Vandenbroucke JP. When are observational studies as credible as randomised trials? Lancet. 2004;363(9422):1728–31. Vandenbroucke JP. When are observational studies as credible as randomised trials? Lancet. 2004;363(9422):1728–31.
5.
go back to reference Concato J, Shah N, Horwitz RI. Randomized, controlled trials, observational studies, and the hierarchy of research designs. N Engl J Med. 2000;342(25):1887–92.CrossRefPubMedPubMedCentral Concato J, Shah N, Horwitz RI. Randomized, controlled trials, observational studies, and the hierarchy of research designs. N Engl J Med. 2000;342(25):1887–92.CrossRefPubMedPubMedCentral
6.
go back to reference Rothman KJ, GS, Lash TL. Modern epidemiology. Philadelphia: Lippincott Williams & Wilkins; 2008. p. 95–7. Rothman KJ, GS, Lash TL. Modern epidemiology. Philadelphia: Lippincott Williams & Wilkins; 2008. p. 95–7.
7.
go back to reference Prokosch HU, Ganslandt T. Perspectives for medical informatics. Reusing the electronic medical record for clinical research. Methods Inf Med. 2009;48(1):38–44.CrossRefPubMed Prokosch HU, Ganslandt T. Perspectives for medical informatics. Reusing the electronic medical record for clinical research. Methods Inf Med. 2009;48(1):38–44.CrossRefPubMed
8.
go back to reference Köpcke F, Prokosch HU. Employing Computers for the Recruitment into Clinical Trials: A Comprehensive Systematic Review. J Med Internet Res. 2014;16(7). Köpcke F, Prokosch HU. Employing Computers for the Recruitment into Clinical Trials: A Comprehensive Systematic Review. J Med Internet Res. 2014;16(7).
9.
go back to reference Frankovich J, Longhurst CA, Sutherland Evidence-Based SM. Medicine in the EMR era. N Engl J Med. 2011;365(19):1758–9.CrossRefPubMed Frankovich J, Longhurst CA, Sutherland Evidence-Based SM. Medicine in the EMR era. N Engl J Med. 2011;365(19):1758–9.CrossRefPubMed
10.
go back to reference Xu H, et al. Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality. J Am Med Inform Assoc. 2015;22:179–91. Xu H, et al. Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality. J Am Med Inform Assoc. 2015;22:179–91.
11.
go back to reference Sherman RE, et al. Real-world Evidence — what is it and what can it tell us? N Engl J Med. 2016;375(23):2293–7.CrossRefPubMed Sherman RE, et al. Real-world Evidence — what is it and what can it tell us? N Engl J Med. 2016;375(23):2293–7.CrossRefPubMed
13.
go back to reference Agniel D, Kohane IS, Weber GM. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ. 2018;361:k1479. Agniel D, Kohane IS, Weber GM. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ. 2018;361:k1479.
14.
go back to reference Botsis T, et al. Secondary use of EHR: data quality issues and informatics opportunities. Summit Transl Bioinform. 2010;2010:1–5.PubMedPubMedCentral Botsis T, et al. Secondary use of EHR: data quality issues and informatics opportunities. Summit Transl Bioinform. 2010;2010:1–5.PubMedPubMedCentral
15.
go back to reference Hersh WR, et al. Caveats for the Use of Operational Electronic Health Record Data in Comparative Effectiveness Research. Med Care. 2013;51(8 0 3):S30–7.CrossRefPubMedPubMedCentral Hersh WR, et al. Caveats for the Use of Operational Electronic Health Record Data in Comparative Effectiveness Research. Med Care. 2013;51(8 0 3):S30–7.CrossRefPubMedPubMedCentral
16.
go back to reference Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc. 2013;20(1):117–21.CrossRefPubMed Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc. 2013;20(1):117–21.CrossRefPubMed
18.
go back to reference Brunner TB, Sauer R, Fietkau R. Gemcitabine/cisplatin versus 5-fluorouracil/mitomycin C chemoradiotherapy in locally advanced pancreatic cancer: a retrospective analysis of 93 patients. Radiat Oncol. 2011;6:88.CrossRefPubMedPubMedCentral Brunner TB, Sauer R, Fietkau R. Gemcitabine/cisplatin versus 5-fluorouracil/mitomycin C chemoradiotherapy in locally advanced pancreatic cancer: a retrospective analysis of 93 patients. Radiat Oncol. 2011;6:88.CrossRefPubMedPubMedCentral
19.
go back to reference Hagel AF, et al. Perforation during esophageal dilatation: a 10-year experience. J Gastrointestin Liver Dis. 2013;22(4):385–9.PubMed Hagel AF, et al. Perforation during esophageal dilatation: a 10-year experience. J Gastrointestin Liver Dis. 2013;22(4):385–9.PubMed
20.
go back to reference Loos B, Puschkin V, Horch RE. 50 years experience with Dupuytren's contracture in the Erlangen University Hospital--a retrospective analysis of 2919 operated hands from 1956 to 2006. BMC Musculoskelet Disord. 2007;8:60.CrossRefPubMedPubMedCentral Loos B, Puschkin V, Horch RE. 50 years experience with Dupuytren's contracture in the Erlangen University Hospital--a retrospective analysis of 2919 operated hands from 1956 to 2006. BMC Musculoskelet Disord. 2007;8:60.CrossRefPubMedPubMedCentral
22.
go back to reference Kahn MG, Eliason BB, Bathurst J. Quantifying clinical data quality using relative gold standards. AMIA Annu Symp Proc. 2010;2010:356–60.PubMedPubMedCentral Kahn MG, Eliason BB, Bathurst J. Quantifying clinical data quality using relative gold standards. AMIA Annu Symp Proc. 2010;2010:356–60.PubMedPubMedCentral
23.
go back to reference Romano PS, et al. A comparison of administrative versus clinical data: coronary artery bypass surgery as an example. Ischemic heart disease patient outcomes research Team. J Clin Epidemiol. 1994;47(3):249–60.CrossRefPubMed Romano PS, et al. A comparison of administrative versus clinical data: coronary artery bypass surgery as an example. Ischemic heart disease patient outcomes research Team. J Clin Epidemiol. 1994;47(3):249–60.CrossRefPubMed
24.
go back to reference Weintraub WS, et al. Can cardiovascular clinical characteristics be identified and outcome models be developed from an in-patient claims database? Am J Cardiol. 2004;84(2):166–9. Weintraub WS, et al. Can cardiovascular clinical characteristics be identified and outcome models be developed from an in-patient claims database? Am J Cardiol. 2004;84(2):166–9.
26.
go back to reference Lawson EH, et al. A comparison of clinical registry versus administrative claims data for reporting of 30-day surgical complications. Ann Surg. 2012;256(6):973–81.CrossRefPubMed Lawson EH, et al. A comparison of clinical registry versus administrative claims data for reporting of 30-day surgical complications. Ann Surg. 2012;256(6):973–81.CrossRefPubMed
27.
go back to reference Rosenbloom ST, et al. Data from clinical notes: a perspective on the tension between structure and flexible documentation. J Am Med Inform Assoc. 2011;18(2):181–6.CrossRefPubMedPubMedCentral Rosenbloom ST, et al. Data from clinical notes: a perspective on the tension between structure and flexible documentation. J Am Med Inform Assoc. 2011;18(2):181–6.CrossRefPubMedPubMedCentral
28.
go back to reference Gologorsky Y, et al. Improving discharge data fidelity for use in large administrative databases. Neurosurg Focus. 2014;36(6):E2.CrossRefPubMed Gologorsky Y, et al. Improving discharge data fidelity for use in large administrative databases. Neurosurg Focus. 2014;36(6):E2.CrossRefPubMed
29.
go back to reference Dixon J, et al. Assessment of the reproducibility of clinical coding in routinely collected hospital activity data: a study in two hospitals. J Public Health Med. 1998;20(1):63–9.CrossRefPubMed Dixon J, et al. Assessment of the reproducibility of clinical coding in routinely collected hospital activity data: a study in two hospitals. J Public Health Med. 1998;20(1):63–9.CrossRefPubMed
31.
go back to reference DP Ryan. Initial chemotherapy and radiation for nonmetastatic, locally advanced, unresectable and borderline resectable, exocrine pancreatic cancer, in UpToDate, P. TW, Editor.: UpToDate, Waltham, MA (Accessed on August 26, 2017.) DP Ryan. Initial chemotherapy and radiation for nonmetastatic, locally advanced, unresectable and borderline resectable, exocrine pancreatic cancer, in UpToDate, P. TW, Editor.: UpToDate, Waltham, MA (Accessed on August 26, 2017.)
32.
go back to reference Callery MP, et al. Pretreatment assessment of Resectable and borderline Resectable pancreatic Cancer: expert consensus statement. Ann Surg Oncol. 2009;16(7):1727–33.CrossRefPubMed Callery MP, et al. Pretreatment assessment of Resectable and borderline Resectable pancreatic Cancer: expert consensus statement. Ann Surg Oncol. 2009;16(7):1727–33.CrossRefPubMed
33.
go back to reference Keltie K, et al. Identifying complications of interventional procedures from UK routine healthcare databases: a systematic search for methods using clinical codes. BMC Med Res Methodol. 2014;14(1):126.CrossRefPubMedPubMedCentral Keltie K, et al. Identifying complications of interventional procedures from UK routine healthcare databases: a systematic search for methods using clinical codes. BMC Med Res Methodol. 2014;14(1):126.CrossRefPubMedPubMedCentral
36.
go back to reference Weiskopf NG, et al. A Data Quality Assessment Guideline for Electronic Health Record Data Reuse. eGEMs. 2017;5(1). Weiskopf NG, et al. A Data Quality Assessment Guideline for Electronic Health Record Data Reuse. eGEMs. 2017;5(1).
37.
go back to reference Weiskopf NG, et al. Defining and measuring completeness of electronic health records for secondary use. J Biomed Inform. 2013;46(5):830–6.CrossRefPubMed Weiskopf NG, et al. Defining and measuring completeness of electronic health records for secondary use. J Biomed Inform. 2013;46(5):830–6.CrossRefPubMed
38.
go back to reference Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2013;20(1):144–51.CrossRefPubMedPubMedCentral Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2013;20(1):144–51.CrossRefPubMedPubMedCentral
39.
go back to reference Iezzoni LI. Assessing quality using administrative data. Ann Intern Med. 1997;127(8_Part_2):666–74.CrossRefPubMed Iezzoni LI. Assessing quality using administrative data. Ann Intern Med. 1997;127(8_Part_2):666–74.CrossRefPubMed
40.
go back to reference Jollis JG, et al. Discordance of databases designed for claims payment versus clinical information systems. Implications for outcomes research. Ann Intern Med. 1993;119(8):844–50.CrossRefPubMed Jollis JG, et al. Discordance of databases designed for claims payment versus clinical information systems. Implications for outcomes research. Ann Intern Med. 1993;119(8):844–50.CrossRefPubMed
41.
go back to reference Parker JP, et al. Administrative versus clinical data for coronary artery bypass graft surgery report cards: the view from California. Med Care. 2006;44(7):687–95.CrossRefPubMed Parker JP, et al. Administrative versus clinical data for coronary artery bypass graft surgery report cards: the view from California. Med Care. 2006;44(7):687–95.CrossRefPubMed
44.
go back to reference Steinwald B, Dummit LA. Hospital case-mix change: sicker patients or DRG creep? Health Aff (Millwood). 1989;8(2):35–47.CrossRef Steinwald B, Dummit LA. Hospital case-mix change: sicker patients or DRG creep? Health Aff (Millwood). 1989;8(2):35–47.CrossRef
45.
go back to reference Derby CA, et al. Possible effect of DRGs on the classification of stroke: implications for epidemiological surveillance. Stroke. 2001;32(7):1487–91.CrossRefPubMed Derby CA, et al. Possible effect of DRGs on the classification of stroke: implications for epidemiological surveillance. Stroke. 2001;32(7):1487–91.CrossRefPubMed
46.
go back to reference Assaf AR, et al. Possible influence of the prospective payment system on the assignment of discharge diagnoses for coronary heart disease. N Engl J Med. 1993;329(13):931–5.CrossRefPubMed Assaf AR, et al. Possible influence of the prospective payment system on the assignment of discharge diagnoses for coronary heart disease. N Engl J Med. 1993;329(13):931–5.CrossRefPubMed
47.
go back to reference Serden L, Lindqvist R, Rosen M. Have DRG-based prospective payment systems influenced the number of secondary diagnoses in health care administrative data? Health Policy. 2003;65(2):101–7.CrossRefPubMed Serden L, Lindqvist R, Rosen M. Have DRG-based prospective payment systems influenced the number of secondary diagnoses in health care administrative data? Health Policy. 2003;65(2):101–7.CrossRefPubMed
48.
go back to reference Januel JM, et al. Improved accuracy of co-morbidity coding over time after the introduction of ICD-10 administrative data. BMC Health Serv Res. 2011;11:194.CrossRefPubMedPubMedCentral Januel JM, et al. Improved accuracy of co-morbidity coding over time after the introduction of ICD-10 administrative data. BMC Health Serv Res. 2011;11:194.CrossRefPubMedPubMedCentral
49.
go back to reference Rothman KJ, Greenland S. Causation and causal inference in epidemiology. Am J Public Health. 2005;95(S1):S144–50.CrossRefPubMed Rothman KJ, Greenland S. Causation and causal inference in epidemiology. Am J Public Health. 2005;95(S1):S144–50.CrossRefPubMed
50.
go back to reference Raghavan P, et al. How essential are unstructured clinical narratives and information fusion to clinical trial recruitment? AMIA Jt Summits Transl Sci Proc. 2014;2014:218–23.PubMedPubMedCentral Raghavan P, et al. How essential are unstructured clinical narratives and information fusion to clinical trial recruitment? AMIA Jt Summits Transl Sci Proc. 2014;2014:218–23.PubMedPubMedCentral
51.
go back to reference Carrell DS, et al. Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence. Am J Epidemiol. 2014;179(6):749–58.CrossRefPubMedPubMedCentral Carrell DS, et al. Using natural language processing to improve efficiency of manual chart abstraction in research: the case of breast cancer recurrence. Am J Epidemiol. 2014;179(6):749–58.CrossRefPubMedPubMedCentral
52.
go back to reference Luepker RV, et al. Case definitions for acute coronary heart disease in epidemiology and clinical research studies: a statement from the AHA Council on epidemiology and prevention; AHA statistics committee; world heart federation council on epidemiology and prevention; the European Society of Cardiology Working Group on epidemiology and prevention; Centers for Disease Control and Prevention; and the National Heart, Lung, and Blood Institute. Circulation. 2003;108(20):2543–9.CrossRefPubMed Luepker RV, et al. Case definitions for acute coronary heart disease in epidemiology and clinical research studies: a statement from the AHA Council on epidemiology and prevention; AHA statistics committee; world heart federation council on epidemiology and prevention; the European Society of Cardiology Working Group on epidemiology and prevention; Centers for Disease Control and Prevention; and the National Heart, Lung, and Blood Institute. Circulation. 2003;108(20):2543–9.CrossRefPubMed
53.
go back to reference Baqui AH, et al. Methodological issues in diarrhoeal diseases epidemiology: definition of diarrhoeal episodes. Int J Epidemiol. 1991;20(4):1057–63.CrossRefPubMed Baqui AH, et al. Methodological issues in diarrhoeal diseases epidemiology: definition of diarrhoeal episodes. Int J Epidemiol. 1991;20(4):1057–63.CrossRefPubMed
Metadata
Title
Feasibility analysis of conducting observational studies with the electronic health record
Authors
Marcel von Lucadou
Thomas Ganslandt
Hans-Ulrich Prokosch
Dennis Toddenroth
Publication date
01-12-2019
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2019
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-019-0939-0

Other articles of this Issue 1/2019

BMC Medical Informatics and Decision Making 1/2019 Go to the issue