Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2017

Open Access 01-12-2017 | Research article

Using multiclass classification to automate the identification of patient safety incident reports by type and severity

Authors: Ying Wang, Enrico Coiera, William Runciman, Farah Magrabi

Published in: BMC Medical Informatics and Decision Making | Issue 1/2017

Login to get access

Abstract

Background

Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals.

Methods

Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_ Type  = 2860, n_ SeverityLevel  = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_ Type  = 6000, n_ SeverityLevel =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall.

Results

The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3).

Conclusions

Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type.
Appendix
Available only for authorised users
Literature
1.
go back to reference Rafter N, Hickey A, Condell S, et al. Adverse events in healthcare: learning from mistakes. QJM. 2015;108(4):273–7.CrossRefPubMed Rafter N, Hickey A, Condell S, et al. Adverse events in healthcare: learning from mistakes. QJM. 2015;108(4):273–7.CrossRefPubMed
2.
go back to reference Runciman B, Walton M. Safety and ethics in healthcare: a guide to getting it right. London: Ashgate Publishing Ltd; 2007. Runciman B, Walton M. Safety and ethics in healthcare: a guide to getting it right. London: Ashgate Publishing Ltd; 2007.
3.
go back to reference Mitchell I, Schuster A, Smith K, et al. Patient safety incident reporting: a qualitative study of thoughts and perceptions of experts 15 years after ‘To Err is Human’. BMJ Qual Saf. 2015;25(2):92–9.CrossRefPubMed Mitchell I, Schuster A, Smith K, et al. Patient safety incident reporting: a qualitative study of thoughts and perceptions of experts 15 years after ‘To Err is Human’. BMJ Qual Saf. 2015;25(2):92–9.CrossRefPubMed
5.
go back to reference Magrabi F, Ong MS, Runciman W, et al. Using FDA reports to inform a classification for health information technology safety problems. J Am Med Inform Assoc. 2012;19(1):45–53.CrossRefPubMed Magrabi F, Ong MS, Runciman W, et al. Using FDA reports to inform a classification for health information technology safety problems. J Am Med Inform Assoc. 2012;19(1):45–53.CrossRefPubMed
6.
go back to reference Magrabi F, Liaw ST, Arachi D, et al. Identifying patient safety problems associated with information technology in general practice: an analysis of incident reports. BMJ Qual Saf. 2015;25:870–80.CrossRefPubMed Magrabi F, Liaw ST, Arachi D, et al. Identifying patient safety problems associated with information technology in general practice: an analysis of incident reports. BMJ Qual Saf. 2015;25:870–80.CrossRefPubMed
7.
go back to reference Magrabi F, Baker M, Sinha I, et al. Clinical safety of England’s national programme for IT: a retrospective analysis of all reported safety events 2005 to 2011. Int J Med Inform. 2015;84(3):198–206.CrossRefPubMed Magrabi F, Baker M, Sinha I, et al. Clinical safety of England’s national programme for IT: a retrospective analysis of all reported safety events 2005 to 2011. Int J Med Inform. 2015;84(3):198–206.CrossRefPubMed
8.
go back to reference Runciman W, Hibbert P, Thomson R, et al. Towards an International Classification for Patient Safety: key concepts and terms. Int J Qual Health Care. 2009;21(1):18–26.CrossRefPubMedPubMedCentral Runciman W, Hibbert P, Thomson R, et al. Towards an International Classification for Patient Safety: key concepts and terms. Int J Qual Health Care. 2009;21(1):18–26.CrossRefPubMedPubMedCentral
9.
go back to reference Pronovost PJ, Morlock LL, Sexton JB, et al. Improving the value of patient safety reporting systems. In: Henriksen K, Battles JB, Keyes MA, Grady ML, editors. Advances in patient safety: New directions and alternative approaches (Vol 1: assessment). Rockville: Advances in Patient Safety; 2008. Pronovost PJ, Morlock LL, Sexton JB, et al. Improving the value of patient safety reporting systems. In: Henriksen K, Battles JB, Keyes MA, Grady ML, editors. Advances in patient safety: New directions and alternative approaches (Vol 1: assessment). Rockville: Advances in Patient Safety; 2008.
10.
go back to reference Travaglia JF, Westbrook MT, Braithwaite J. Implementation of a patient safety incident management system as viewed by doctors, nurses and allied health professionals. Health (London). 2009;13(3):277–96.CrossRef Travaglia JF, Westbrook MT, Braithwaite J. Implementation of a patient safety incident management system as viewed by doctors, nurses and allied health professionals. Health (London). 2009;13(3):277–96.CrossRef
11.
go back to reference Westbrook JI, Li L, Lehnbom EC, et al. What are incident reports telling us? a comparative study at two Australian hospitals of medication errors identified at audit, detected by staff and reported to an incident system. Int J Qual Health Care. 2015;27(1):1–9.CrossRefPubMedPubMedCentral Westbrook JI, Li L, Lehnbom EC, et al. What are incident reports telling us? a comparative study at two Australian hospitals of medication errors identified at audit, detected by staff and reported to an incident system. Int J Qual Health Care. 2015;27(1):1–9.CrossRefPubMedPubMedCentral
12.
13.
go back to reference Williams SD, Ashcroft DM. Medication errors: how reliable are the severity ratings reported to the national reporting and learning system? Int J Qual Health Care. 2009;21(5):316–20.CrossRefPubMed Williams SD, Ashcroft DM. Medication errors: how reliable are the severity ratings reported to the national reporting and learning system? Int J Qual Health Care. 2009;21(5):316–20.CrossRefPubMed
14.
go back to reference Haines TP, Massey B, Varghese P, et al. Inconsistency in classification and reporting of in-hospital falls. J Am Geriatr Soc. 2009;57(3):517–23.CrossRefPubMed Haines TP, Massey B, Varghese P, et al. Inconsistency in classification and reporting of in-hospital falls. J Am Geriatr Soc. 2009;57(3):517–23.CrossRefPubMed
15.
go back to reference Centre for Clinical Governance Research in Health. Evaluation of the safety improvement program in New South Wales: study no 6 report on program outcomes. Sydney: University of New South Wales, Centre for Clinical Governance Research in Health; 2005. Centre for Clinical Governance Research in Health. Evaluation of the safety improvement program in New South Wales: study no 6 report on program outcomes. Sydney: University of New South Wales, Centre for Clinical Governance Research in Health; 2005.
16.
go back to reference Gong Y. Data consistency in a voluntary medical incident reporting system. J Med Syst. 2011;35(4):609–15.CrossRefPubMed Gong Y. Data consistency in a voluntary medical incident reporting system. J Med Syst. 2011;35(4):609–15.CrossRefPubMed
17.
go back to reference Warrer P, Hansen EH, Juhl-Jensen L, et al. Using text-mining techniques in electronic patient records to identify ADRs from medicine use. Brit J Clin Pharmacol. 2012;73(5):674–84.CrossRef Warrer P, Hansen EH, Juhl-Jensen L, et al. Using text-mining techniques in electronic patient records to identify ADRs from medicine use. Brit J Clin Pharmacol. 2012;73(5):674–84.CrossRef
18.
go back to reference D’Avolio LW, Litwin MS, Rogers Jr SO, et al. Automatic identification and classification of surgical margin status from pathology reports following prostate cancer surgery. AMIA Annu Symp Proc. 2007;11:160–4. D’Avolio LW, Litwin MS, Rogers Jr SO, et al. Automatic identification and classification of surgical margin status from pathology reports following prostate cancer surgery. AMIA Annu Symp Proc. 2007;11:160–4.
19.
go back to reference Liu V, Clark MP, Mendoza M, et al. Automated identification of pneumonia in chest radiograph reports in critically ill patients. BMC Med Inform Decis Mak. 2013;13:90.CrossRefPubMedPubMedCentral Liu V, Clark MP, Mendoza M, et al. Automated identification of pneumonia in chest radiograph reports in critically ill patients. BMC Med Inform Decis Mak. 2013;13:90.CrossRefPubMedPubMedCentral
20.
go back to reference Fong A, Hettinger AZ, Ratwani RM. Exploring methods for identifying related patient safety events using structured and unstructured data. J Biomed Inform. 2015;58:89–95.CrossRefPubMed Fong A, Hettinger AZ, Ratwani RM. Exploring methods for identifying related patient safety events using structured and unstructured data. J Biomed Inform. 2015;58:89–95.CrossRefPubMed
21.
go back to reference Fong A, Ratwani R. An evaluation of patient safety event report categories using unsupervised topic modeling. Methods Inf Med. 2015;54(4):338–45.CrossRefPubMed Fong A, Ratwani R. An evaluation of patient safety event report categories using unsupervised topic modeling. Methods Inf Med. 2015;54(4):338–45.CrossRefPubMed
22.
go back to reference Ratwani RM, Fong A. ‘Connecting the dots’: leveraging visual analytics to make sense of patient safety event reports. J Am Med Inform Assoc. 2015;22(2):312–7.PubMed Ratwani RM, Fong A. ‘Connecting the dots’: leveraging visual analytics to make sense of patient safety event reports. J Am Med Inform Assoc. 2015;22(2):312–7.PubMed
23.
go back to reference Ong MS, Magrabi F, Coiera E. Automated categorisation of clinical incident reports using statistical text classification. Qual Saf Health Care. 2010;19(6):e55.PubMed Ong MS, Magrabi F, Coiera E. Automated categorisation of clinical incident reports using statistical text classification. Qual Saf Health Care. 2010;19(6):e55.PubMed
24.
go back to reference Chai KEK, Anthony S, Coiera E, et al. Using statistical text classification to identify health information technology incidents. J Am Med Inform Assoc. 2013;20(5):980–5.CrossRefPubMedPubMedCentral Chai KEK, Anthony S, Coiera E, et al. Using statistical text classification to identify health information technology incidents. J Am Med Inform Assoc. 2013;20(5):980–5.CrossRefPubMedPubMedCentral
25.
go back to reference Ong MS, Magrabi F, Coiera E. Automated identification of extreme-risk events in clinical incident reports. J Am Med Inform Assoc. 2012;19(e1):e110–8.CrossRefPubMedPubMedCentral Ong MS, Magrabi F, Coiera E. Automated identification of extreme-risk events in clinical incident reports. J Am Med Inform Assoc. 2012;19(e1):e110–8.CrossRefPubMedPubMedCentral
26.
go back to reference Marella WM, Sparnon E, Finley E. Screening Electronic Health Record-Related Patient Safety Reports Using Machine Learning. J Patient Saf. 2017;13(1):31-36. Marella WM, Sparnon E, Finley E. Screening Electronic Health Record-Related Patient Safety Reports Using Machine Learning. J Patient Saf. 2017;13(1):31-36.
27.
go back to reference Japkowicz N. The class imbalance problem: Significance and strategies. Ic-Ai’2000: Proceedings of the International Conference on Artificial Intelligence, Vol 1-Iii. 2000:111–7. Japkowicz N. The class imbalance problem: Significance and strategies. Ic-Ai’2000: Proceedings of the International Conference on Artificial Intelligence, Vol 1-Iii. 2000:111–7.
28.
go back to reference Lin HY. Efficient classifiers for multi-class classification problems. Decis Support Syst. 2012;53(3):473–81.CrossRef Lin HY. Efficient classifiers for multi-class classification problems. Decis Support Syst. 2012;53(3):473–81.CrossRef
30.
go back to reference Towards national indicators of safety and quality in health care. Australian Institute of Health and Welfare. 2009. Towards national indicators of safety and quality in health care. Australian Institute of Health and Welfare. 2009.
32.
go back to reference Galar M, Fernandez A, Barrenechea E, et al. An overview of ensemble methods for binary classifiers in multi-class problems: experimental study on one-vs-one and one-vs-all schemes. Pattern Recogn. 2011;44(8):1761–76.CrossRef Galar M, Fernandez A, Barrenechea E, et al. An overview of ensemble methods for binary classifiers in multi-class problems: experimental study on one-vs-one and one-vs-all schemes. Pattern Recogn. 2011;44(8):1761–76.CrossRef
33.
go back to reference Runciman WB, Williamson JA, Deakin A, et al. An integrated framework for safety, quality and risk management: an information and incident management system based on a universal patient safety classification. Qual Saf Health Care. 2006;15 Suppl 1:i82–90.CrossRefPubMedPubMedCentral Runciman WB, Williamson JA, Deakin A, et al. An integrated framework for safety, quality and risk management: an information and incident management system based on a universal patient safety classification. Qual Saf Health Care. 2006;15 Suppl 1:i82–90.CrossRefPubMedPubMedCentral
35.
go back to reference Bagian JP, Lee C, Gosbee J, et al. Developing and deploying a patient safety program in a large health care delivery system: you can’t fix what you don’t know about. Jt Comm J Qual Improv. 2001;27(10):522–32.PubMed Bagian JP, Lee C, Gosbee J, et al. Developing and deploying a patient safety program in a large health care delivery system: you can’t fix what you don’t know about. Jt Comm J Qual Improv. 2001;27(10):522–32.PubMed
36.
go back to reference Platt JC, Cristianini N, Shawe-Taylor J. Large margin DAGs for multiclass classification. Adv Neural Inf Proces Syst. 2000;12:547–53. Platt JC, Cristianini N, Shawe-Taylor J. Large margin DAGs for multiclass classification. Adv Neural Inf Proces Syst. 2000;12:547–53.
37.
go back to reference Korenius T, Laurikkala J, Järvelin K, et al., editors. Stemming and lemmatization in the clustering of finnish text documents. Proceedings of the thirteenth ACM international conference on Information and knowledge management; 2004: ACM. Korenius T, Laurikkala J, Järvelin K, et al., editors. Stemming and lemmatization in the clustering of finnish text documents. Proceedings of the thirteenth ACM international conference on Information and knowledge management; 2004: ACM.
38.
go back to reference Sivic J, Zisserman A. Efficient visual search of videos cast as text retrieval. EEE Trans Pattern Anal Mach Intell. 2009;31(4):591–606.CrossRef Sivic J, Zisserman A. Efficient visual search of videos cast as text retrieval. EEE Trans Pattern Anal Mach Intell. 2009;31(4):591–606.CrossRef
39.
go back to reference Robertson S. Understanding inverse document frequency: on theoretical arguments for IDF. J Doc. 2004;60(5):503–20.CrossRef Robertson S. Understanding inverse document frequency: on theoretical arguments for IDF. J Doc. 2004;60(5):503–20.CrossRef
40.
go back to reference Black D. On the rationale of group decision-making. J Polit Econ. 1948;56:23–34.CrossRef Black D. On the rationale of group decision-making. J Polit Econ. 1948;56:23–34.CrossRef
41.
go back to reference Sokolova M, Lapalme G. A systematic analysis of performance measures for classification tasks. Inf Process Manag. 2009;45(4):427–37.CrossRef Sokolova M, Lapalme G. A systematic analysis of performance measures for classification tasks. Inf Process Manag. 2009;45(4):427–37.CrossRef
43.
go back to reference Webb RK, Currie M, Morgan CA, et al. The Australian incident monitoring study: an analysis of 2000 incident reports. Anaesth Intensive Care. 1993;21(5):520–8.PubMed Webb RK, Currie M, Morgan CA, et al. The Australian incident monitoring study: an analysis of 2000 incident reports. Anaesth Intensive Care. 1993;21(5):520–8.PubMed
44.
45.
go back to reference Chen Y, Mani S, Xu H. Applying active learning to assertion classification of concepts in clinical text. J Biomed Inform. 2012;45(2):265–72.CrossRefPubMed Chen Y, Mani S, Xu H. Applying active learning to assertion classification of concepts in clinical text. J Biomed Inform. 2012;45(2):265–72.CrossRefPubMed
46.
go back to reference Doyle S, Monaco J, Feldman M, et al. An active learning based classification strategy for the minority class problem: application to histopathology annotation. BMC Bioinformatics. 2011;12:424.CrossRefPubMedPubMedCentral Doyle S, Monaco J, Feldman M, et al. An active learning based classification strategy for the minority class problem: application to histopathology annotation. BMC Bioinformatics. 2011;12:424.CrossRefPubMedPubMedCentral
47.
go back to reference Branco P, Torgo L, Ribeiro RP. A survey of predictive modeling on imbalanced domains. ACM Comput Surv. 2016;49(2):1–50.CrossRef Branco P, Torgo L, Ribeiro RP. A survey of predictive modeling on imbalanced domains. ACM Comput Surv. 2016;49(2):1–50.CrossRef
50.
go back to reference Sarker A, Gonzalez G. Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J Biomed Inform. 2015;53:196–207.CrossRefPubMed Sarker A, Gonzalez G. Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J Biomed Inform. 2015;53:196–207.CrossRefPubMed
51.
go back to reference Wang SF, Wang J, Wang ZY, et al. Enhancing multi-label classification by modeling dependencies among labels. Pattern Recogn. 2014;47(10):3405–13.CrossRef Wang SF, Wang J, Wang ZY, et al. Enhancing multi-label classification by modeling dependencies among labels. Pattern Recogn. 2014;47(10):3405–13.CrossRef
Metadata
Title
Using multiclass classification to automate the identification of patient safety incident reports by type and severity
Authors
Ying Wang
Enrico Coiera
William Runciman
Farah Magrabi
Publication date
01-12-2017
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2017
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-017-0483-8

Other articles of this Issue 1/2017

BMC Medical Informatics and Decision Making 1/2017 Go to the issue