Skip to main content
Top
Published in: BMC Pediatrics 1/2016

Open Access 01-12-2016 | Research article

Quality of EHR data extractions for studies of preterm birth in a tertiary care center: guidelines for obtaining reliable data

Authors: Lindsey A. Knake, Monika Ahuja, Erin L. McDonald, Kelli K. Ryckman, Nancy Weathers, Todd Burstain, John M. Dagle, Jeffrey C. Murray, Prakash Nadkarni

Published in: BMC Pediatrics | Issue 1/2016

Login to get access

Abstract

Background

The use of Electronic Health Records (EHR) has increased significantly in the past 15 years. This study compares electronic vs. manual data abstractions from an EHR for accuracy. While the dataset is limited to preterm birth data, our work is generally applicable. We enumerate challenges to reliable extraction, and state guidelines to maximize reliability.

Methods

An Epic™ EHR data extraction of structured data values from 1,772 neonatal records born between the years 2001–2011 was performed. The data were directly compared to a manually-abstracted database. Specific data values important to studies of perinatology were chosen to compare discrepancies between the two databases.

Results

Discrepancy rates between the EHR extraction and the manual database were calculated for gestational age in weeks (2.6 %), birthweight (9.7 %), first white blood cell count (3.2 %), initial hemoglobin (11.9 %), peak total and direct bilirubin (11.4 % and 4.9 %), and patent ductus arteriosus (PDA) diagnosis (12.8 %). Using the discrepancies, errors were quantified in both datasets using chart review. The EHR extraction errors were significantly fewer than manual abstraction errors for PDA and laboratory values excluding neonates transferred from outside hospitals, but significantly greater for birth weight. Reasons for the observed errors are discussed.

Conclusions

We show that an EHR not modified specifically for research purposes had discrepancy ranges comparable to a manually created database. We offer guidelines to minimize EHR extraction errors in future study designs. As EHRs become more research-friendly, electronic chart extractions should be more efficient and have lower error rates compared to manual abstractions.
Appendix
Available only for authorised users
Literature
2.
go back to reference Melton J, Simon AR, Gray J. SQL 1999: Understanding Relational Language Components. San Mateo: Morgan Kaufman; 2001. Melton J, Simon AR, Gray J. SQL 1999: Understanding Relational Language Components. San Mateo: Morgan Kaufman; 2001.
3.
go back to reference Spong CY. Defining “term” pregnancy: recommendations from the Defining “Term“ Pregnancy Workgroup. JAMA. 2013;309(23):2445–6.CrossRefPubMed Spong CY. Defining “term” pregnancy: recommendations from the Defining “Term“ Pregnancy Workgroup. JAMA. 2013;309(23):2445–6.CrossRefPubMed
4.
go back to reference Centers for Medicare & Medicaid Services. Medicare and Medicaid Programs: Electronic Health Record Incentive Program. Federal Register. 2010;75(1):1844. Centers for Medicare & Medicaid Services. Medicare and Medicaid Programs: Electronic Health Record Incentive Program. Federal Register. 2010;75(1):1844.
5.
go back to reference Wasserman RC. Electronic medical records (EMRs), epidemiology, and epistemology: reflections on EMRs and future pediatric clinical research. Acad Pediatr. 2011;11(4):280–7.CrossRefPubMedPubMedCentral Wasserman RC. Electronic medical records (EMRs), epidemiology, and epistemology: reflections on EMRs and future pediatric clinical research. Acad Pediatr. 2011;11(4):280–7.CrossRefPubMedPubMedCentral
6.
go back to reference Pathak J, Bailey KR, Beebe CE, Bethard S, Carrell DC, Chen PJ, Dligach D, Endle CM, Hart LA, Haug PJ, et al. Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium. J Am Med Inform Assoc. 2013;20(e2):e341–348.CrossRefPubMedPubMedCentral Pathak J, Bailey KR, Beebe CE, Bethard S, Carrell DC, Chen PJ, Dligach D, Endle CM, Hart LA, Haug PJ, et al. Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium. J Am Med Inform Assoc. 2013;20(e2):e341–348.CrossRefPubMedPubMedCentral
8.
go back to reference McCoy AB, Wright A, Eysenbach G, Malin BA, Patterson ES, Xu H, Sittig DF. State of the art in clinical informatics: evidence and examples. Yearb Med Inform. 2013;8(1):13–9.PubMed McCoy AB, Wright A, Eysenbach G, Malin BA, Patterson ES, Xu H, Sittig DF. State of the art in clinical informatics: evidence and examples. Yearb Med Inform. 2013;8(1):13–9.PubMed
9.
go back to reference Freund J, Meiman J, Kraus C. Using electroinic medial record data to characterize the level of medication use by age-groups in a network of primary care clinics. J Prim Care Community Health. 2013;4(4):286–93.CrossRefPubMed Freund J, Meiman J, Kraus C. Using electroinic medial record data to characterize the level of medication use by age-groups in a network of primary care clinics. J Prim Care Community Health. 2013;4(4):286–93.CrossRefPubMed
10.
go back to reference Wu ST, Sohn S, Ravikumar KE, Wagholikar K, Jonnalagadda SR, Liu H, Juhn YJ. Automated chart review for asthma cohort identification using natural language processing: an exploratory study. Ann Allergy Asthma Immunol. 2013;111(5):364–9.CrossRefPubMed Wu ST, Sohn S, Ravikumar KE, Wagholikar K, Jonnalagadda SR, Liu H, Juhn YJ. Automated chart review for asthma cohort identification using natural language processing: an exploratory study. Ann Allergy Asthma Immunol. 2013;111(5):364–9.CrossRefPubMed
11.
go back to reference Steidl M, Zimmern P. Data for free--can an electronic medical record provide outcome data for incontinence/prolapse repair procedures? J Urol. 2013;189(1):194–9.CrossRefPubMed Steidl M, Zimmern P. Data for free--can an electronic medical record provide outcome data for incontinence/prolapse repair procedures? J Urol. 2013;189(1):194–9.CrossRefPubMed
12.
go back to reference Newgard CD, Zive D, Jui J, Weathers C, Daya M. Electronic versus manual data processing: evaluating the use of electronic health records in out-of-hospital clinical research. Acad Emerg Med. 2012;19(2):217–27.CrossRefPubMedPubMedCentral Newgard CD, Zive D, Jui J, Weathers C, Daya M. Electronic versus manual data processing: evaluating the use of electronic health records in out-of-hospital clinical research. Acad Emerg Med. 2012;19(2):217–27.CrossRefPubMedPubMedCentral
14.
go back to reference Hartung J, Knapp G, Sinha B. Statistical Meta-Analysis with Applications. 2008.CrossRef Hartung J, Knapp G, Sinha B. Statistical Meta-Analysis with Applications. 2008.CrossRef
Metadata
Title
Quality of EHR data extractions for studies of preterm birth in a tertiary care center: guidelines for obtaining reliable data
Authors
Lindsey A. Knake
Monika Ahuja
Erin L. McDonald
Kelli K. Ryckman
Nancy Weathers
Todd Burstain
John M. Dagle
Jeffrey C. Murray
Prakash Nadkarni
Publication date
01-12-2016
Publisher
BioMed Central
Published in
BMC Pediatrics / Issue 1/2016
Electronic ISSN: 1471-2431
DOI
https://doi.org/10.1186/s12887-016-0592-z

Other articles of this Issue 1/2016

BMC Pediatrics 1/2016 Go to the issue