Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2017

Open Access 01-12-2017 | Research article

Validation of multisource electronic health record data: an application to blood transfusion data

Authors: Loan R. van Hoeven, Martine C. de Bruijne, Peter F. Kemper, Maria M.W. Koopman, Jan M.M. Rondeel, Anja Leyte, Hendrik Koffijberg, Mart P. Janssen, Kit C.B. Roes

Published in: BMC Medical Informatics and Decision Making | Issue 1/2017

Login to get access

Abstract

Background

Although data from electronic health records (EHR) are often used for research purposes, systematic validation of these data prior to their use is not standard practice. Existing validation frameworks discuss validity concepts without translating these into practical implementation steps or addressing the potential influence of linking multiple sources. Therefore we developed a practical approach for validating routinely collected data from multiple sources and to apply it to a blood transfusion data warehouse to evaluate the usability in practice.

Methods

The approach consists of identifying existing validation frameworks for EHR data or linked data, selecting validity concepts from these frameworks and establishing quantifiable validity outcomes for each concept. The approach distinguishes external validation concepts (e.g. concordance with external reports, previous literature and expert feedback) and internal consistency concepts which use expected associations within the dataset itself (e.g. completeness, uniformity and plausibility). In an example case, the selected concepts were applied to a transfusion dataset and specified in more detail.

Results

Application of the approach to a transfusion dataset resulted in a structured overview of data validity aspects. This allowed improvement of these aspects through further processing of the data and in some cases adjustment of the data extraction. For example, the proportion of transfused products that could not be linked to the corresponding issued products initially was 2.2% but could be improved by adjusting data extraction criteria to 0.17%.

Conclusions

This stepwise approach for validating linked multisource data provides a basis for evaluating data quality and enhancing interpretation. When the process of data validation is adopted more broadly, this contributes to increased transparency and greater reliability of research based on routinely collected electronic health records.
Appendix
Available only for authorised users
Literature
2.
go back to reference Kleinman S, Glynn SA. Database research in transfusion medicine: the power of large numbers. Transfusion. 2015;55(7):1591–5.CrossRefPubMed Kleinman S, Glynn SA. Database research in transfusion medicine: the power of large numbers. Transfusion. 2015;55(7):1591–5.CrossRefPubMed
3.
go back to reference Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2013;20(1):144–51.CrossRefPubMedPubMedCentral Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2013;20(1):144–51.CrossRefPubMedPubMedCentral
5.
go back to reference Callen J. What is the impact of electronic health records on the quality of health data? Health Inform Manage J. 2014;43(1):42.CrossRef Callen J. What is the impact of electronic health records on the quality of health data? Health Inform Manage J. 2014;43(1):42.CrossRef
6.
go back to reference Chan KS, Fowles JB, Weiner JP. Electronic health records and reliability and validity of quality measures: a review of the literature. Med Care Res Rev. 2010;67(5):503–27.CrossRefPubMed Chan KS, Fowles JB, Weiner JP. Electronic health records and reliability and validity of quality measures: a review of the literature. Med Care Res Rev. 2010;67(5):503–27.CrossRefPubMed
8.
go back to reference Kahn MG, Raebel MA, Glanz JM, Riedlinger K, Steiner JF. A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research. Med Care. 2013;50 Kahn MG, Raebel MA, Glanz JM, Riedlinger K, Steiner JF. A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research. Med Care. 2013;50
9.
go back to reference Bhoensky M. Bias in data linkage studies. In: Goldstein H, Harron K, Dibben C, editors. Methodological developments in data linkage. Chichester: John Wiley & Sons, Ltd; 2016. Bhoensky M. Bias in data linkage studies. In: Goldstein H, Harron K, Dibben C, editors. Methodological developments in data linkage. Chichester: John Wiley & Sons, Ltd; 2016.
10.
11.
go back to reference Benchimol EI, Smeeth L, Guttmann A, Harron K, Moher D, Petersen I. RECORD working committee. The REporting of studies conducted using observational routinely-collected health data (RECORD) statement. PLoS Med. 2015;12(10):e1001885.CrossRefPubMedPubMedCentral Benchimol EI, Smeeth L, Guttmann A, Harron K, Moher D, Petersen I. RECORD working committee. The REporting of studies conducted using observational routinely-collected health data (RECORD) statement. PLoS Med. 2015;12(10):e1001885.CrossRefPubMedPubMedCentral
12.
go back to reference Bohensky MA, Jolley D, Sundararajan V, Evans S, Ibrahim J, Brand C. Development and validation of reporting guidelines for studies involving data linkage. Aust N Z J Public Health. 2011;35(5):486–9.CrossRefPubMed Bohensky MA, Jolley D, Sundararajan V, Evans S, Ibrahim J, Brand C. Development and validation of reporting guidelines for studies involving data linkage. Aust N Z J Public Health. 2011;35(5):486–9.CrossRefPubMed
13.
go back to reference van Hoeven LR, Hooftman BH, Janssen MP, de Bruijne MC, de Vooght KM, Kemper P, et al. Protocol for a national blood transfusion data warehouse from donor to recipient. BMJ Open. 2016;6(8):e010962.CrossRefPubMedPubMedCentral van Hoeven LR, Hooftman BH, Janssen MP, de Bruijne MC, de Vooght KM, Kemper P, et al. Protocol for a national blood transfusion data warehouse from donor to recipient. BMJ Open. 2016;6(8):e010962.CrossRefPubMedPubMedCentral
14.
go back to reference Wang R, Strong D. Beyond accuracy: what data quality means to data consumers. J Manag Inf Syst. 1996;12:5–34.CrossRef Wang R, Strong D. Beyond accuracy: what data quality means to data consumers. J Manag Inf Syst. 1996;12:5–34.CrossRef
15.
go back to reference Maydanchik A. Data quality assessment. Bradley Beach, NJ: Technics Publications; 2007. Maydanchik A. Data quality assessment. Bradley Beach, NJ: Technics Publications; 2007.
16.
go back to reference Van den Broeck J, Cunningham SA, Eeckels R, Herbst K. Data cleaning: detecting, diagnosing, and editing data abnormalities. PLoS Med. 2005;2(10):966. Van den Broeck J, Cunningham SA, Eeckels R, Herbst K. Data cleaning: detecting, diagnosing, and editing data abnormalities. PLoS Med. 2005;2(10):966.
19.
go back to reference Quality Institute for health care (CBO). [Blood transfusion guideline] Richtlijn Bloedtransfusie. Alphen aan de Rijn: Van Zuiden Communications; 2004. Quality Institute for health care (CBO). [Blood transfusion guideline] Richtlijn Bloedtransfusie. Alphen aan de Rijn: Van Zuiden Communications; 2004.
20.
go back to reference Edgren G, Hjalgrim H, Tran TN, Rostgaard K, Shanwell A, Titlestad K, et al. A population-based bi-national register for monitoring long-term outcome and possible disease concordance among blood donors and recipients. Vox Sang. 2006;91:316–23.CrossRefPubMed Edgren G, Hjalgrim H, Tran TN, Rostgaard K, Shanwell A, Titlestad K, et al. A population-based bi-national register for monitoring long-term outcome and possible disease concordance among blood donors and recipients. Vox Sang. 2006;91:316–23.CrossRefPubMed
21.
go back to reference Barr PJ, Donnelly M, Morris K, Parker M, Cardwell C, Bailie KEM. The epidemiology of red cell transfusion. Vox Sang. 2010;99(3):239–50.CrossRefPubMed Barr PJ, Donnelly M, Morris K, Parker M, Cardwell C, Bailie KEM. The epidemiology of red cell transfusion. Vox Sang. 2010;99(3):239–50.CrossRefPubMed
22.
go back to reference Edgren G, Rostgaard K, Vasan SK, Wikman A, Norda R, Pedersen OB, et al. The new Scandinavian donations and transfusions database (SCANDAT2): a blood safety resource with added versatility. Transfusion. 2015;55(7):1600–6.CrossRefPubMed Edgren G, Rostgaard K, Vasan SK, Wikman A, Norda R, Pedersen OB, et al. The new Scandinavian donations and transfusions database (SCANDAT2): a blood safety resource with added versatility. Transfusion. 2015;55(7):1600–6.CrossRefPubMed
23.
go back to reference Tinegate H, Chattree S, Iqbal A, Plews D, Whitehead J, Wallis JP. Ten-year pattern of red blood cell use in the north of England. Transfusion. 2013;53(3):483–9.CrossRefPubMed Tinegate H, Chattree S, Iqbal A, Plews D, Whitehead J, Wallis JP. Ten-year pattern of red blood cell use in the north of England. Transfusion. 2013;53(3):483–9.CrossRefPubMed
24.
go back to reference Allden RL, Sinha R, Roxby DJ, Ireland S, Hakendorf P, Robinson KL. Red alert–a new perspective on patterns of blood use in the south Australian public sector. Aust Health Rev. 2011;35(3):327–33.CrossRefPubMed Allden RL, Sinha R, Roxby DJ, Ireland S, Hakendorf P, Robinson KL. Red alert–a new perspective on patterns of blood use in the south Australian public sector. Aust Health Rev. 2011;35(3):327–33.CrossRefPubMed
25.
go back to reference Palo R, Ali-Melkkilä T, Hanhela R, Jäntti V, Krusius T, Leppänen E, et al. Development of permanent national register of blood component use utilizing electronic hospital information systems. Vox Sang. 2006;91:140–7.CrossRefPubMed Palo R, Ali-Melkkilä T, Hanhela R, Jäntti V, Krusius T, Leppänen E, et al. Development of permanent national register of blood component use utilizing electronic hospital information systems. Vox Sang. 2006;91:140–7.CrossRefPubMed
Metadata
Title
Validation of multisource electronic health record data: an application to blood transfusion data
Authors
Loan R. van Hoeven
Martine C. de Bruijne
Peter F. Kemper
Maria M.W. Koopman
Jan M.M. Rondeel
Anja Leyte
Hendrik Koffijberg
Mart P. Janssen
Kit C.B. Roes
Publication date
01-12-2017
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2017
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-017-0504-7

Other articles of this Issue 1/2017

BMC Medical Informatics and Decision Making 1/2017 Go to the issue