Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2018

Open Access 01-03-2018 | Research

Causal risk factor discovery for severe acute kidney injury using electronic health records

Authors: Weiqi Chen, Yong Hu, Xiangzhou Zhang, Lijuan Wu, Kang Liu, Jianqin He, Zilin Tang, Xing Song, Lemuel R. Waitman, Mei Liu

Published in: BMC Medical Informatics and Decision Making | Special Issue 1/2018

Login to get access

Abstract

Background

Acute kidney injury (AKI), characterized by abrupt deterioration of renal function, is a common clinical event among hospitalized patients and it is associated with high morbidity and mortality. AKI is defined in three stages with stage-3 being the most severe phase which is irreversible. It is important to effectively discover the true risk factors in order to identify high-risk AKI patients and allow better targeting of tailored interventions. However, Stage-3 AKI patients are very rare (only 0.2% of AKI patients) with a large scale of features available in EHR (1917 potential risk features), yielding a scenario unfeasible for any correlation-based feature selection or modeling method. This study aims to discover the key factors and improve the detection of Stage-3 AKI.

Methods

A causal discovery method (McDSL) is adopted for causal discovery to infer true causal relationship between information buried in EHR (such as medication, diagnosis, laboratory tests, comorbidities and etc.) and Stage-3 AKI risk. The research approach comprised two major phases: data collection, and causal discovery. The first phase is propose to collect the data from HER (includes 358 encounters and 891 risk factors). Finally, McDSL is employed to discover the causal risk factors of Stage-3 AKI, and five well-known machine learning models are built for predicting Stage-3 AKI with 10-fold cross-validation (predictive accuracy were measured by AUC, precision, recall and F-score).

Results

McDSL is useful for further research of EHR. It is able to discover four causal features, all selected features are medications that are modifiable. The latest research of machine learning is employed to compare the performance of prediction, and the experimental result has verified the selected features are pivotal.

Conclusions

The features selected by McDSL, which enable us to achieve significant dimension reduction without sacrificing prediction accuracy, suggesting potential clinical use such as helping physicians develop better prevention and treatment strategies.
Literature
1.
go back to reference Waikar SS, Curhan GC, Ayanian JZ, et al. Race and mortality after acute renal failure. J Am Soc Nephrol. 2007;18:2740–48. Waikar SS, Curhan GC, Ayanian JZ, et al. Race and mortality after acute renal failure. J Am Soc Nephrol. 2007;18:2740–48.
2.
go back to reference Chertow GM, Burdick E, Honour M, et al. Acute kidney injury, mortality, length of stay, and costs in hospitalized patients. J Am Soc Nephrol. 2005;16:3365–70.CrossRefPubMed Chertow GM, Burdick E, Honour M, et al. Acute kidney injury, mortality, length of stay, and costs in hospitalized patients. J Am Soc Nephrol. 2005;16:3365–70.CrossRefPubMed
3.
go back to reference KDIGO: Kidney Disease: Improving Global Outcomes (KDIGO) Acute Kidney Injury Work Group. KDIGO clinical practice guideline for acute kidney injury. Kidney Int Suppl 2 2012, 1–138. KDIGO: Kidney Disease: Improving Global Outcomes (KDIGO) Acute Kidney Injury Work Group. KDIGO clinical practice guideline for acute kidney injury. Kidney Int Suppl 2 2012, 1–138.
4.
go back to reference Kate RJ, Perez RM, Mazumdar D, et al. Prediction and detection models for acute kidney injury in hospitalized older adults. BMC Med Inform Decis Mak. 2016;16:1–11.CrossRef Kate RJ, Perez RM, Mazumdar D, et al. Prediction and detection models for acute kidney injury in hospitalized older adults. BMC Med Inform Decis Mak. 2016;16:1–11.CrossRef
5.
go back to reference Thomas M, Sitch A, Dowswell G. The initial development and assessment of an automatic alert warning of acute kidney injury. Nephrology Dialysis Transplantation. 2011;26:2161–8.CrossRef Thomas M, Sitch A, Dowswell G. The initial development and assessment of an automatic alert warning of acute kidney injury. Nephrology Dialysis Transplantation. 2011;26:2161–8.CrossRef
6.
go back to reference Chen W, Hao Z, Cai R, et al. Multiple-causes discovery combined with structure learning for high dimensional discrete data and application to stock prediction. Soft Comput. 2016;20:4575–88.CrossRef Chen W, Hao Z, Cai R, et al. Multiple-causes discovery combined with structure learning for high dimensional discrete data and application to stock prediction. Soft Comput. 2016;20:4575–88.CrossRef
7.
go back to reference Waitman LR, Warren JJ, Manos EL, Connolly DW. Expressing observations from electronic medical record flowsheets in an i2b2 based clinical data repository to support research and quality improvement. In: AMIA Annu Symp proc; 2011. p. 1454–63. Waitman LR, Warren JJ, Manos EL, Connolly DW. Expressing observations from electronic medical record flowsheets in an i2b2 based clinical data repository to support research and quality improvement. In: AMIA Annu Symp proc; 2011. p. 1454–63.
8.
go back to reference Matheny ME, Miller RA, Ikizler TA, et al. Development of inpatient risk stratification models of acute kidney injury for use in electronic health records. Med Decis Making. 2010;30:639–50.CrossRefPubMedPubMedCentral Matheny ME, Miller RA, Ikizler TA, et al. Development of inpatient risk stratification models of acute kidney injury for use in electronic health records. Med Decis Making. 2010;30:639–50.CrossRefPubMedPubMedCentral
9.
go back to reference Peters J, Janzing D, Schölkopf B. Causal inference on discrete data using additive noise models. IEEE Trans Pattern Anal Mach Intell. 2011;33:2436–50. Peters J, Janzing D, Schölkopf B. Causal inference on discrete data using additive noise models. IEEE Trans Pattern Anal Mach Intell. 2011;33:2436–50.
10.
go back to reference Weiqi Chen, Kang Liu, Lijun Su, Mei Liu: Discovering many-to-one causality in software project risk analysis. In: Ninth International Conference on P2p, Parallel, Grid, Cloud and Internet Computing 2014, 316–323. Weiqi Chen, Kang Liu, Lijun Su, Mei Liu: Discovering many-to-one causality in software project risk analysis. In: Ninth International Conference on P2p, Parallel, Grid, Cloud and Internet Computing 2014, 316–323.
11.
go back to reference Denœux T. A k-nearest neighbor classification rule based on Dempster-Shafer theory. IEEE Transactions on Systems Man & Cybernetics. 1995;25:804–13. Denœux T. A k-nearest neighbor classification rule based on Dempster-Shafer theory. IEEE Transactions on Systems Man & Cybernetics. 1995;25:804–13.
12.
go back to reference Quinlan JR. Simplifying decision trees. Int J Man-Machine Studies. 1987;27:221–34.CrossRef Quinlan JR. Simplifying decision trees. Int J Man-Machine Studies. 1987;27:221–34.CrossRef
13.
go back to reference Sadeghi BHM. A BP-neural network predictor model for plastic injection molding process. J Mater Process Technol. 2000;103:411–6.CrossRef Sadeghi BHM. A BP-neural network predictor model for plastic injection molding process. J Mater Process Technol. 2000;103:411–6.CrossRef
14.
go back to reference Pal M. Random forest classifier for remote sensing classification. Int J Remote Sens. 2005;26:217–22.CrossRef Pal M. Random forest classifier for remote sensing classification. Int J Remote Sens. 2005;26:217–22.CrossRef
15.
go back to reference Wang H, Fan W, Yu P S, et al: Mining concept-drifting data streams using ensemble classifiers, In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2003, 226–235. Wang H, Fan W, Yu P S, et al: Mining concept-drifting data streams using ensemble classifiers, In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2003, 226–235.
16.
go back to reference Dietterich TG. Ensemble methods in machine learning. In: Multiple classifier systems, MCS 2000, Lecture Notes in Computer Science, Springer-Verlag, Berlin, Heidelberg. 2000;1857:1–15. Dietterich TG. Ensemble methods in machine learning. In: Multiple classifier systems, MCS 2000, Lecture Notes in Computer Science, Springer-Verlag, Berlin, Heidelberg. 2000;1857:1–15.
17.
go back to reference Ohnuma T, Uchino S. Prediction models and their external validation studies for mortality of patients with acute kidney injury: a systematic review. PLoS One. 2017;12:e0169341.CrossRefPubMedPubMedCentral Ohnuma T, Uchino S. Prediction models and their external validation studies for mortality of patients with acute kidney injury: a systematic review. PLoS One. 2017;12:e0169341.CrossRefPubMedPubMedCentral
18.
go back to reference Wu J, Roy J, Stewart WF. Prediction modeling using EHR data: challenges, strategies, and a comparison of machine learning approaches. Med Care. 2010;48:S106.CrossRefPubMed Wu J, Roy J, Stewart WF. Prediction modeling using EHR data: challenges, strategies, and a comparison of machine learning approaches. Med Care. 2010;48:S106.CrossRefPubMed
19.
go back to reference Sugihara G, May R, Ye H, et al. Detecting causality in complex ecosystems. Science. 2012;338:496–500. Sugihara G, May R, Ye H, et al. Detecting causality in complex ecosystems. Science. 2012;338:496–500.
20.
go back to reference Liu M, Cai R, Hu Y, et al. Determining molecular predictors of adverse drug reactions with causality analysis based on structure learning. J Am Med Inform Assoc. 2014;21:245–51.CrossRefPubMed Liu M, Cai R, Hu Y, et al. Determining molecular predictors of adverse drug reactions with causality analysis based on structure learning. J Am Med Inform Assoc. 2014;21:245–51.CrossRefPubMed
21.
go back to reference Zheng T, Xie W, Xu L, et al. A machine learning-based framework to identify type 2 diabetes through electronic health records. Int J Med Inform. 2017;97:120–127.CrossRefPubMed Zheng T, Xie W, Xu L, et al. A machine learning-based framework to identify type 2 diabetes through electronic health records. Int J Med Inform. 2017;97:120–127.CrossRefPubMed
Metadata
Title
Causal risk factor discovery for severe acute kidney injury using electronic health records
Authors
Weiqi Chen
Yong Hu
Xiangzhou Zhang
Lijuan Wu
Kang Liu
Jianqin He
Zilin Tang
Xing Song
Lemuel R. Waitman
Mei Liu
Publication date
01-03-2018
Publisher
BioMed Central
DOI
https://doi.org/10.1186/s12911-018-0597-7

Other articles of this Special Issue 1/2018

BMC Medical Informatics and Decision Making 1/2018 Go to the issue