Skip to main content
Top
Published in: European Journal of Medical Research 1/2023

Open Access 01-12-2023 | Stroke | Research

Common clinical blood and urine biomarkers for ischemic stroke: an Estonian Electronic Health Records database study

Authors: Siim Kurvits, Ainika Harro, Anu Reigo, Anne Ott, Sven Laur, Dage Särg, Ardi Tampuu, Kaur Alasoo, Jaak Vilo, Lili Milani, Toomas Haller, the Estonian Biobank Research Team, the PRECISE4Q consortium

Published in: European Journal of Medical Research | Issue 1/2023

Login to get access

Abstract

Background

Ischemic stroke (IS) is a major health risk without generally usable effective measures of primary prevention. Early warning signals that are easy to detect and widely available can save lives. Estonia has one nation-wide Electronic Health Record (EHR) database for the storage of medical information of patients from hospitals and primary care providers.

Methods

We extracted structured and unstructured data from the EHRs of participants of the Estonian Biobank (EstBB) and evaluated different formats of input data to understand how this continuously growing dataset should be prepared for best prediction. The utility of the EHR database for finding blood- and urine-based biomarkers for IS was demonstrated by applying different analytical and machine learning (ML) methods.

Results

Several early trends in common clinical laboratory parameter changes (set of red blood indices, lymphocyte/neutrophil ratio, etc.) were established for IS prediction. The developed ML models predicted the future occurrence of IS with very high accuracy and Random Forests was proved as the most applicable method to EHR data.

Conclusions

We conclude that the EHR database and the risk factors uncovered are valuable resources in screening the population for risk of IS as well as constructing disease risk scores and refining prediction models for IS by ML.
Appendix
Available only for authorised users
Literature
1.
go back to reference GBD 2019 Stroke Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 2021;20:795–820.CrossRef GBD 2019 Stroke Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 2021;20:795–820.CrossRef
2.
go back to reference Adams HP, et al. Classification of subtype of acute ischemic stroke. Definitions for use in a multicenter clinical trial. TOAST. Trial of Org 10172 in Acute Stroke Treatment. Stroke. 1993;24:35–41.PubMedCrossRef Adams HP, et al. Classification of subtype of acute ischemic stroke. Definitions for use in a multicenter clinical trial. TOAST. Trial of Org 10172 in Acute Stroke Treatment. Stroke. 1993;24:35–41.PubMedCrossRef
3.
go back to reference Dichgans M, Pulit SL, Rosand J. Stroke genetics: discovery, biology, and clinical applications. Lancet Neurol. 2019;18:587–99.PubMedCrossRef Dichgans M, Pulit SL, Rosand J. Stroke genetics: discovery, biology, and clinical applications. Lancet Neurol. 2019;18:587–99.PubMedCrossRef
4.
go back to reference Malik R, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2019;51:1192–3.PubMedCrossRef Malik R, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2019;51:1192–3.PubMedCrossRef
5.
go back to reference Keene KL, et al. Genome-wide association study meta-analysis of stroke in 22 000 individuals of African descent identifies novel associations with stroke. Stroke. 2020;51:2454–63.PubMedPubMedCentralCrossRef Keene KL, et al. Genome-wide association study meta-analysis of stroke in 22 000 individuals of African descent identifies novel associations with stroke. Stroke. 2020;51:2454–63.PubMedPubMedCentralCrossRef
7.
go back to reference Ren H, Liu X, Wang L, Gao Y. Lymphocyte-to-monocyte ratio: a novel predictor of the prognosis of acute ischemic stroke. J Stroke Cerebrovasc Dis. 2017;26:2595–602.PubMedCrossRef Ren H, Liu X, Wang L, Gao Y. Lymphocyte-to-monocyte ratio: a novel predictor of the prognosis of acute ischemic stroke. J Stroke Cerebrovasc Dis. 2017;26:2595–602.PubMedCrossRef
8.
go back to reference Kim H, et al. Elevated blood urea nitrogen/creatinine ratio is associated with venous thromboembolism in patients with acute ischemic stroke. J Korean Neurosurg Soc. 2017;60:620–6.PubMedPubMedCentralCrossRef Kim H, et al. Elevated blood urea nitrogen/creatinine ratio is associated with venous thromboembolism in patients with acute ischemic stroke. J Korean Neurosurg Soc. 2017;60:620–6.PubMedPubMedCentralCrossRef
10.
go back to reference Sadeghi F, et al. Platelet count and mean volume in acute stroke: a systematic review and meta-analysis. Platelets. 2019;31:731–9.PubMedCrossRef Sadeghi F, et al. Platelet count and mean volume in acute stroke: a systematic review and meta-analysis. Platelets. 2019;31:731–9.PubMedCrossRef
11.
go back to reference Marini S, Georgakis MK, Anderson CD. Interactions between kidney function and cerebrovascular disease: vessel pathology that fires together wires together. Front Neurol. 2021;12:1.CrossRef Marini S, Georgakis MK, Anderson CD. Interactions between kidney function and cerebrovascular disease: vessel pathology that fires together wires together. Front Neurol. 2021;12:1.CrossRef
12.
13.
go back to reference Ravioli S, et al. Risk of electrolyte disorders, syncope, and falls in patients taking thiazide diuretics: results of a cross-sectional study. Am J Med. 2021;134:1148–54.PubMedCrossRef Ravioli S, et al. Risk of electrolyte disorders, syncope, and falls in patients taking thiazide diuretics: results of a cross-sectional study. Am J Med. 2021;134:1148–54.PubMedCrossRef
14.
go back to reference Tóth OM, et al. Tissue acidosis associated with ischemic stroke to guide neuroprotective drug delivery. Biology. 2020;9:460.CrossRef Tóth OM, et al. Tissue acidosis associated with ischemic stroke to guide neuroprotective drug delivery. Biology. 2020;9:460.CrossRef
15.
go back to reference Diener HC, Hankey GJ. Primary and secondary prevention of ischemic stroke and cerebral hemorrhage: JACC focus seminar. J Am Coll Cardiol. 2020;75:1804–18.PubMedCrossRef Diener HC, Hankey GJ. Primary and secondary prevention of ischemic stroke and cerebral hemorrhage: JACC focus seminar. J Am Coll Cardiol. 2020;75:1804–18.PubMedCrossRef
16.
go back to reference Prins BP, et al. Advances in genomic discovery and implications for personalized prevention and medicine: Estonia as example. J Personal Med. 2021;11:358.CrossRef Prins BP, et al. Advances in genomic discovery and implications for personalized prevention and medicine: Estonia as example. J Personal Med. 2021;11:358.CrossRef
19.
go back to reference Leitsalu L, et al. Cohort profile: Estonian Biobank of the Estonian Genome Center, University of Tartu. Int J Epidemiol. 2014;44:1137–47.PubMedCrossRef Leitsalu L, et al. Cohort profile: Estonian Biobank of the Estonian Genome Center, University of Tartu. Int J Epidemiol. 2014;44:1137–47.PubMedCrossRef
21.
go back to reference Yang Z, et al. Assessment of natural language processing methods for ascertaining the expanded disability status scale score from the electronic health records of patients with multiple sclerosis: algorithm development and validation study. JMIR Med Inform. 2022;10:1.CrossRef Yang Z, et al. Assessment of natural language processing methods for ascertaining the expanded disability status scale score from the electronic health records of patients with multiple sclerosis: algorithm development and validation study. JMIR Med Inform. 2022;10:1.CrossRef
25.
go back to reference McDonald CJ, Schadow G, Suico J, Overhage JM. Data standards in health care. Ann Emerg Med. 2001;38:303–11.PubMedCrossRef McDonald CJ, Schadow G, Suico J, Overhage JM. Data standards in health care. Ann Emerg Med. 2001;38:303–11.PubMedCrossRef
26.
go back to reference Borgelt C. An implementation of the FP-growth algorithm. In OSDM’05 Proceedings of the 1st International Workshop on Open Source Data Mining: Frequent Pattern Mining Implementations 1–5. ACM, New York (2005). Borgelt C. An implementation of the FP-growth algorithm. In OSDM’05 Proceedings of the 1st International Workshop on Open Source Data Mining: Frequent Pattern Mining Implementations 1–5. ACM, New York (2005).
30.
go back to reference Pedregosa F, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30. Pedregosa F, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
31.
33.
go back to reference Renna R, et al. Risk factor and etiology analysis of ischemic stroke in young adult patients. J Stroke Cerebrovasc Dis. 2014;23:1.CrossRef Renna R, et al. Risk factor and etiology analysis of ischemic stroke in young adult patients. J Stroke Cerebrovasc Dis. 2014;23:1.CrossRef
34.
go back to reference Lee H, et al. Machine learning approach to identify stroke within 4.5 hours. Stroke. 2020;51:860–6.PubMedCrossRef Lee H, et al. Machine learning approach to identify stroke within 4.5 hours. Stroke. 2020;51:860–6.PubMedCrossRef
35.
go back to reference Cunningham P, Carney J. Diversity versus quality in classification ensembles based on feature selection. In: López de Mántaras R, Plaza E, editors. Machine learning: ECML 2000. Lecture notes in computer science (Lecture notes in artificial intelligence), vol. 1810. New York: Springer; 2000. Cunningham P, Carney J. Diversity versus quality in classification ensembles based on feature selection. In: López de Mántaras R, Plaza E, editors. Machine learning: ECML 2000. Lecture notes in computer science (Lecture notes in artificial intelligence), vol. 1810. New York: Springer; 2000.
36.
go back to reference Gawali M, et al. Comparison of privacy-preserving distributed deep learning methods in healthcare. Med Image Understand Anal. 2021;2021:457–71.CrossRef Gawali M, et al. Comparison of privacy-preserving distributed deep learning methods in healthcare. Med Image Understand Anal. 2021;2021:457–71.CrossRef
37.
go back to reference Kong M, Zhang H, Cao X, Mao X, Lu Z. Higher level of neutrophil-to-lymphocyte is associated with severe COVID-19. Epidemiol Infect. 2020;148:1.CrossRef Kong M, Zhang H, Cao X, Mao X, Lu Z. Higher level of neutrophil-to-lymphocyte is associated with severe COVID-19. Epidemiol Infect. 2020;148:1.CrossRef
38.
go back to reference Forget, et al. What is the normal value of the neutrophil-to-lymphocyte ratio? BMC Res Notes. 2017;10:1.CrossRef Forget, et al. What is the normal value of the neutrophil-to-lymphocyte ratio? BMC Res Notes. 2017;10:1.CrossRef
39.
go back to reference Luo Y, et al. Early neutrophil-to-lymphocyte ratio is a prognostic marker in acute minor stroke or transient ischemic attack. Acta Neurol Belg. 2020;121:1415–21.PubMedCrossRef Luo Y, et al. Early neutrophil-to-lymphocyte ratio is a prognostic marker in acute minor stroke or transient ischemic attack. Acta Neurol Belg. 2020;121:1415–21.PubMedCrossRef
40.
go back to reference Zhang J, et al. Prognostic role of neutrophil-lyphocyte ratio in patients with acute ischemic stroke. Medicine (Baltimore). 2017;96:1. Zhang J, et al. Prognostic role of neutrophil-lyphocyte ratio in patients with acute ischemic stroke. Medicine (Baltimore). 2017;96:1.
41.
go back to reference Barlas RS, et al. Impact of hemoglobin levels and anemia on mortality in acute stroke: analysis of UK regional registry data, systematic review, and meta-analysis. J Am Heart Assoc. 2016;5:1.CrossRef Barlas RS, et al. Impact of hemoglobin levels and anemia on mortality in acute stroke: analysis of UK regional registry data, systematic review, and meta-analysis. J Am Heart Assoc. 2016;5:1.CrossRef
42.
43.
go back to reference Ciancarelli I, Amicis DD, Massimo CD, Pistarini C, Ciancarelli MGT. Mean platelet volume during ischemic stroke is a potential pro-inflammatory biomarker in the acute phase and during neurorehabilitation not directly linked to clinical outcome. Curr Neurovasc Res. 2016;13:177–83.PubMedCrossRef Ciancarelli I, Amicis DD, Massimo CD, Pistarini C, Ciancarelli MGT. Mean platelet volume during ischemic stroke is a potential pro-inflammatory biomarker in the acute phase and during neurorehabilitation not directly linked to clinical outcome. Curr Neurovasc Res. 2016;13:177–83.PubMedCrossRef
44.
go back to reference Oji S, et al. Mean platelet volume is associated with early neurological deterioration in patients with branch atheromatous disease: involvement of platelet activation. J Stroke Cerebrovasc Dis. 2018;27:1624–31.PubMedCrossRef Oji S, et al. Mean platelet volume is associated with early neurological deterioration in patients with branch atheromatous disease: involvement of platelet activation. J Stroke Cerebrovasc Dis. 2018;27:1624–31.PubMedCrossRef
46.
go back to reference Cinar BP, et al. Assessment of the relation between the neutrophil to lymphocyte ratio and severity of ischemic stroke in a large cohort. Int J Clin Pract. 2021;75:1. Cinar BP, et al. Assessment of the relation between the neutrophil to lymphocyte ratio and severity of ischemic stroke in a large cohort. Int J Clin Pract. 2021;75:1.
47.
go back to reference Huang Z, Fu Z, Huang W, Huang K. Prognostic value of neutrophil-to-lymphocyte ratio in sepsis: a meta-analysis. Am J Emerg Med. 2020;38:641–7.PubMedCrossRef Huang Z, Fu Z, Huang W, Huang K. Prognostic value of neutrophil-to-lymphocyte ratio in sepsis: a meta-analysis. Am J Emerg Med. 2020;38:641–7.PubMedCrossRef
48.
go back to reference Prozan L, et al. Prognostic value of Neutrophil-to-lymphocyte ratio in COVID-19 compared with Influenza and respiratory syncytial virus infection. Sci Rep. 2021;11:21519.PubMedPubMedCentralCrossRef Prozan L, et al. Prognostic value of Neutrophil-to-lymphocyte ratio in COVID-19 compared with Influenza and respiratory syncytial virus infection. Sci Rep. 2021;11:21519.PubMedPubMedCentralCrossRef
49.
go back to reference Kloska A, Malinowska M, Gabig-Cimińska M, Jakóbkiewicz-Banecka J. Lipids and lipid mediators associated with the risk and pathology of ischemic stroke. Int J Mol Sci. 2020;21:3618.PubMedPubMedCentralCrossRef Kloska A, Malinowska M, Gabig-Cimińska M, Jakóbkiewicz-Banecka J. Lipids and lipid mediators associated with the risk and pathology of ischemic stroke. Int J Mol Sci. 2020;21:3618.PubMedPubMedCentralCrossRef
51.
go back to reference Dagonnier M, Donnan GA, Davis SM, Dewey HM, Howells DW. Acute stroke biomarkers: Are we there yet? Front Neurol. 2021;12:1.CrossRef Dagonnier M, Donnan GA, Davis SM, Dewey HM, Howells DW. Acute stroke biomarkers: Are we there yet? Front Neurol. 2021;12:1.CrossRef
52.
go back to reference Li J, et al. Imputation of missing values for electronic health record laboratory data. NPJ Dig Med. 2021;4:1.CrossRef Li J, et al. Imputation of missing values for electronic health record laboratory data. NPJ Dig Med. 2021;4:1.CrossRef
53.
go back to reference Heo J, et al. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. 2019;50:1263–5.PubMedCrossRef Heo J, et al. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. 2019;50:1263–5.PubMedCrossRef
Metadata
Title
Common clinical blood and urine biomarkers for ischemic stroke: an Estonian Electronic Health Records database study
Authors
Siim Kurvits
Ainika Harro
Anu Reigo
Anne Ott
Sven Laur
Dage Särg
Ardi Tampuu
Kaur Alasoo
Jaak Vilo
Lili Milani
Toomas Haller
the Estonian Biobank Research Team
the PRECISE4Q consortium
Publication date
01-12-2023
Publisher
BioMed Central
Keywords
Stroke
Biomarkers
Published in
European Journal of Medical Research / Issue 1/2023
Electronic ISSN: 2047-783X
DOI
https://doi.org/10.1186/s40001-023-01087-6

Other articles of this Issue 1/2023

European Journal of Medical Research 1/2023 Go to the issue