Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 3/2020

Open Access 01-07-2020 | Ultrasound | Research

Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports

Published in: BMC Medical Informatics and Decision Making | Special Issue 3/2020

Login to get access

Abstract

Background

Cardiogenic stroke has increasing morbidity in China and brought economic burden to patient families. In cardiogenic stroke diagnosis, echocardiograph examination is one of the most important examinations. Sonographers will investigate patients’ heart via echocardiograph, and describe them in the echocardiograph reports. In this study, we developed a machine learning model to automatically identify diagnosis evidences of cardiogenic stroke providing to neurologist for clinical decision making.

Methods

We collected 4188 Chinese echocardiograph reports of 4018 patients, with average length 177 Chinese characters in free-text style. Collaborating with neurologists and sonographers, we summarized 149 phrases on diagnosis evidence of cardiogenic stroke such as “二尖瓣重度狭窄” (severe mitral stenosis), “主动脉瓣退行性变” (aortic valve degeneration) and so on. Furthermore, we developed an annotated corpus via mapping 149 phrases to the 4188 reports. We selected 11 most frequent diagnosis evidence types such as “二尖瓣狭窄” (mitral stenosis) for further identifying. The generated corpus is divided into training set and testing set in the ratio of 8:2, which is used to train and validate a machine learning model to identify the evidence of cardiogenic stroke using BiLSTM-CRF algorithm.

Results

Our machine learning method achieved the average performance on the diagnosis evidence identification is 98.03, 90.17 and 93.94% respectively. In addition, our method is capable to identify the novel diagnosis evidence of cardiogenic stroke description such as “二尖瓣中-重度狭窄” (mitral stenosis), “主动脉瓣退行性病变” (aortic valve calcification) et al.

Conclusions

In this study, we analyze the structure of the echocardiograph reports and summarized 149 phrases on diagnosis evidence of cardiogenic stroke. We use the phrases to generate an annotated corpus automatically, which greatly reduces the cost of manual annotation. The model trained based on the corpus also has a good performance on the testing set. The method of automatically identifying diagnosis evidence of cardiogenic stroke proposed in this study will be further refined in the practice.
Literature
1.
go back to reference Wang D, Liu J, Liu M, Lu C, Brainin M, Zhang J. Patterns of stroke between University hospitals and nonuniversity hospitals in mainland China: prospective multicenter hospital-based registry study. World Neurosurg. 2017;98(98):258–65.CrossRef Wang D, Liu J, Liu M, Lu C, Brainin M, Zhang J. Patterns of stroke between University hospitals and nonuniversity hospitals in mainland China: prospective multicenter hospital-based registry study. World Neurosurg. 2017;98(98):258–65.CrossRef
2.
go back to reference Wang W, Jiang B, Sun H, Ru X, Sun D, Wang L, et al. Prevalence, incidence, and mortality of stroke in China. Circulation. 2017;135(8):759. Wang W, Jiang B, Sun H, Ru X, Sun D, Wang L, et al. Prevalence, incidence, and mortality of stroke in China. Circulation. 2017;135(8):759.
3.
go back to reference Kim AS, Cahill EA, Cheng NT. Global Stroke Belt: geographic variation in stroke burden worldwide. Stroke. 2015;46(12):3564–70.CrossRef Kim AS, Cahill EA, Cheng NT. Global Stroke Belt: geographic variation in stroke burden worldwide. Stroke. 2015;46(12):3564–70.CrossRef
4.
go back to reference Moran AE, Gu D, Zhao D, Coxson PG, Wang YC, Chen C, et al. Future cardiovascular disease in China Markov model and risk factor scenario projections from the coronary heart disease policy model–China. Circ-Cardiovasc Qual. 2010;3(3):243–52.CrossRef Moran AE, Gu D, Zhao D, Coxson PG, Wang YC, Chen C, et al. Future cardiovascular disease in China Markov model and risk factor scenario projections from the coronary heart disease policy model–China. Circ-Cardiovasc Qual. 2010;3(3):243–52.CrossRef
7.
go back to reference Chen N, Zhou M, Wang Y, Wang H, Yang M, Guo J, et al. Inter-rater reliability of the A-S-C-O classification system for ischemic stroke. J Clin Neurosci. 2013;20(3):410–2.CrossRef Chen N, Zhou M, Wang Y, Wang H, Yang M, Guo J, et al. Inter-rater reliability of the A-S-C-O classification system for ischemic stroke. J Clin Neurosci. 2013;20(3):410–2.CrossRef
8.
go back to reference Ay H, Furie KL, Singhal AB, Smith WS, Sorensen AG, Koroshetz WJ. An evidence-based causative classification system for acute ischemic stroke. Ann Neurol. 2005;58(5):688–97.CrossRef Ay H, Furie KL, Singhal AB, Smith WS, Sorensen AG, Koroshetz WJ. An evidence-based causative classification system for acute ischemic stroke. Ann Neurol. 2005;58(5):688–97.CrossRef
9.
go back to reference Amarenco P, Bogousslavsky J, Caplan LR, Donnan GA, Wolf ME, Hennerici MG. The ASCOD Phenotyping of ischemic stroke (updated ASCO Phenotyping). Cerebrovasc Dis. 2013;36(1):1–5.CrossRef Amarenco P, Bogousslavsky J, Caplan LR, Donnan GA, Wolf ME, Hennerici MG. The ASCOD Phenotyping of ischemic stroke (updated ASCO Phenotyping). Cerebrovasc Dis. 2013;36(1):1–5.CrossRef
10.
go back to reference Amarenco P, Bogousslavsky J, Caplan LR, Donnan GA, Hennerici MG. New approach to stroke subtyping: the A-S-C-O (phenotypic) classification of stroke. Cerebrovasc Dis. 2009;27(5):502–8.CrossRef Amarenco P, Bogousslavsky J, Caplan LR, Donnan GA, Hennerici MG. New approach to stroke subtyping: the A-S-C-O (phenotypic) classification of stroke. Cerebrovasc Dis. 2009;27(5):502–8.CrossRef
11.
go back to reference Gao S, Wang Y, Xu A, Li Y, Wang D. Chinese ischemic stroke subclassification. Front Neurol. 2011;2:6.CrossRef Gao S, Wang Y, Xu A, Li Y, Wang D. Chinese ischemic stroke subclassification. Front Neurol. 2011;2:6.CrossRef
12.
go back to reference Zhao J, Yao Y, Sang M. Research Progress in diagnosis, prevention and treatment of cardiogenic stroke. Chin J Geriatr Heart Brain Vessel Viseases. 2017;19(1):94–6. Zhao J, Yao Y, Sang M. Research Progress in diagnosis, prevention and treatment of cardiogenic stroke. Chin J Geriatr Heart Brain Vessel Viseases. 2017;19(1):94–6.
13.
go back to reference Association CS. Guidelines for clinical Management of Cerebrovascular Diseases in China. Beijing: People’s Medical Publishing House; 2019. Association CS. Guidelines for clinical Management of Cerebrovascular Diseases in China. Beijing: People’s Medical Publishing House; 2019.
14.
go back to reference He J, Baxter SL, Xu J, Xu J, Zhou X, Zhang K. The practical implementation of artificial intelligence technologies in medicine. Nat Med. 2019;25(1):30–6.CrossRef He J, Baxter SL, Xu J, Xu J, Zhou X, Zhang K. The practical implementation of artificial intelligence technologies in medicine. Nat Med. 2019;25(1):30–6.CrossRef
15.
go back to reference Edwards H, Smith J, Weston MJ. What makes a good ultrasound report. Ultrasound. 2014;22(1):57–60.CrossRef Edwards H, Smith J, Weston MJ. What makes a good ultrasound report. Ultrasound. 2014;22(1):57–60.CrossRef
16.
go back to reference Cao Z, Duan L, Yang G, Yue T, Chen Q, Fu H, et al. Breast tumor detection in ultrasound images using deep learning. In: International Workshop on Patch-based Techniques in Medical Imaging. Berlin: Springer; 2017. Cao Z, Duan L, Yang G, Yue T, Chen Q, Fu H, et al. Breast tumor detection in ultrasound images using deep learning. In: International Workshop on Patch-based Techniques in Medical Imaging. Berlin: Springer; 2017.
17.
go back to reference Yap MH, Pons G, Marti J, Ganau S, Sentis M, Zwiggelaar R, et al. Automated breast ultrasound lesions detection using convolutional neural networks. IEEE J Biomed Health Informatics. 2018;22(4):1218–26.CrossRef Yap MH, Pons G, Marti J, Ganau S, Sentis M, Zwiggelaar R, et al. Automated breast ultrasound lesions detection using convolutional neural networks. IEEE J Biomed Health Informatics. 2018;22(4):1218–26.CrossRef
18.
go back to reference Carrodeguas E, Lacson R, Swanson W, Khorasani R. Use of machine learning to identify follow-up recommendations in radiology reports. J Am Coll Radiol. 2019;16(3):336–43.CrossRef Carrodeguas E, Lacson R, Swanson W, Khorasani R. Use of machine learning to identify follow-up recommendations in radiology reports. J Am Coll Radiol. 2019;16(3):336–43.CrossRef
19.
go back to reference Zulkarnain NZ. A medical ultrasound reporting system based on domain ontology: University of Salford; 2017. Zulkarnain NZ. A medical ultrasound reporting system based on domain ontology: University of Salford; 2017.
20.
go back to reference Zeng X-H, Liu B-G, Zhou M. Understanding and generating ultrasound image description. J Comput Sci Technol. 2018;33(5):1086–100.CrossRef Zeng X-H, Liu B-G, Zhou M. Understanding and generating ultrasound image description. J Comput Sci Technol. 2018;33(5):1086–100.CrossRef
21.
go back to reference Kluegl P, Toepfer M, Beck P-D, Fette G, Puppe F. UIMA Ruta: rapid development of rule-based information extraction applications. Nat Lang Eng. 2016;22(1):1–40.CrossRef Kluegl P, Toepfer M, Beck P-D, Fette G, Puppe F. UIMA Ruta: rapid development of rule-based information extraction applications. Nat Lang Eng. 2016;22(1):1–40.CrossRef
22.
go back to reference Miao S, Xu T, Wu Y, Xie H, Wang J, Jing S, et al. Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches. Int J Med Inform. 2018;119:17–21.CrossRef Miao S, Xu T, Wu Y, Xie H, Wang J, Jing S, et al. Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches. Int J Med Inform. 2018;119:17–21.CrossRef
23.
go back to reference Chen P, Liu Q, Wei L, Zhao B, Jia Y, Lv H, et al. Automatically structuring on Chinese ultrasound report of cerebrovascular diseases via natural language processing. IEEE Access. 2019;7:89043–50.CrossRef Chen P, Liu Q, Wei L, Zhao B, Jia Y, Lv H, et al. Automatically structuring on Chinese ultrasound report of cerebrovascular diseases via natural language processing. IEEE Access. 2019;7:89043–50.CrossRef
24.
go back to reference Komiya K, Suzuki M, Iwakura T, Sasaki M, Shinnou H. Comparison of methods to annotate named entity corpora. ACM Trans Asian Low-Resource Language Inf Process. 2018;17(4):34. Komiya K, Suzuki M, Iwakura T, Sasaki M, Shinnou H. Comparison of methods to annotate named entity corpora. ACM Trans Asian Low-Resource Language Inf Process. 2018;17(4):34.
25.
go back to reference Adams HP Jr, Bendixen BH, Kappelle LJ, Biller J, Love BB, Gordon DL, et al. Classification of subtype of acute ischemic stroke. Definitions for use in a multicenter clinical trial. TOAST. Trial of org 10172 in acute stroke treatment. Stroke. 1993;24(1):35–41.CrossRef Adams HP Jr, Bendixen BH, Kappelle LJ, Biller J, Love BB, Gordon DL, et al. Classification of subtype of acute ischemic stroke. Definitions for use in a multicenter clinical trial. TOAST. Trial of org 10172 in acute stroke treatment. Stroke. 1993;24(1):35–41.CrossRef
26.
go back to reference Saric M, Armour AC, Arnaout MS, Chaudhry FA, Grimm RA, Kronzon I, et al. Guidelines for the use of echocardiography in the evaluation of a cardiac source of embolism. J Am Soc Echocardiogr. 2016;29(1):1–42.CrossRef Saric M, Armour AC, Arnaout MS, Chaudhry FA, Grimm RA, Kronzon I, et al. Guidelines for the use of echocardiography in the evaluation of a cardiac source of embolism. J Am Soc Echocardiogr. 2016;29(1):1–42.CrossRef
27.
go back to reference Du Y. Research on word-vector-representation-based new word discovery and name entity identification: University of Electronic Science and Technology of China; 2017. Du Y. Research on word-vector-representation-based new word discovery and name entity identification: University of Electronic Science and Technology of China; 2017.
Metadata
Title
Identifying diagnosis evidence of cardiogenic stroke from Chinese echocardiograph reports
Publication date
01-07-2020

Other articles of this Special Issue 3/2020

BMC Medical Informatics and Decision Making 3/2020 Go to the issue