Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2023

Open Access 01-12-2023 | Care | Research article

Analytical methods for identifying sequences of utilization in health data: a scoping review

Authors: Amelie Flothow, Anna Novelli, Leonie Sundmacher

Published in: BMC Medical Research Methodology | Issue 1/2023

Login to get access

Abstract

Background

Healthcare, as with other sectors, has undergone progressive digitalization, generating an ever-increasing wealth of data that enables research and the analysis of patient movement. This can help to evaluate treatment processes and outcomes, and in turn improve the quality of care. This scoping review provides an overview of the algorithms and methods that have been used to identify care pathways from healthcare utilization data.

Method

This review was conducted according to the methodology of the Joanna Briggs Institute and the Preferred Reporting Items for Systematic Reviews Extension for Scoping Reviews (PRISMA-ScR) Checklist. The PubMed, Web of Science, Scopus, and EconLit databases were searched and studies published in English between 2000 and 2021 considered. The search strategy used keywords divided into three categories: the method of data analysis, the requirement profile for the data, and the intended presentation of results. Criteria for inclusion were that health data were analyzed, the methodology used was described and that the chronology of care events was considered. In a two-stage review process, records were reviewed by two researchers independently for inclusion. Results were synthesized narratively.

Results

The literature search yielded 2,865 entries; 51 studies met the inclusion criteria. Health data from different countries (\(n=12\)) and of different types of disease (\(n=26\)) were analyzed with respect to different care events. Applied methods can be divided into those identifying subsequences of care and those describing full care trajectories. Variants of pattern mining or Markov models were mostly used to extract subsequences, with clustering often applied to find care trajectories. Statistical algorithms such as rule mining, probability-based machine learning algorithms or a combination of methods were also applied. Clustering methods were sometimes used for data preparation or result compression. Further characteristics of the included studies are presented.

Conclusion

Various data mining methods are already being applied to gain insight from health data. The great heterogeneity of the methods used shows the need for a scoping review. We performed a narrative review and found that clustering methods currently dominate the literature for identifying complete care trajectories, while variants of pattern mining dominate for identifying subsequences of limited length.
Appendix
Available only for authorised users
Literature
1.
go back to reference Rydning DRJGJ, Reinsel J, Gantz J. The digitization of the world from edge to core, vol. 16. Framingham: International Data Corporation; 2018. p. 1–28. Rydning DRJGJ, Reinsel J, Gantz J. The digitization of the world from edge to core, vol. 16. Framingham: International Data Corporation; 2018. p. 1–28.
5.
go back to reference Vanasse A, Courteau J, Courteau M, Benigeri M., Chiu YM, Dufour I, Couillard S., Larivee, P., Hudon, C.: Healthcare utilization after a first hospitalization for copd: a new approach of state sequence analysis based on the ‘6w’ multidimensional model of care trajectories. BMC Health Serv Res 2020;20(1). https://doi.org/10.1186/s12913-020-5030-0 Vanasse A, Courteau J, Courteau M, Benigeri M., Chiu YM, Dufour I, Couillard S., Larivee, P., Hudon, C.: Healthcare utilization after a first hospitalization for copd: a new approach of state sequence analysis based on the ‘6w’ multidimensional model of care trajectories. BMC Health Serv Res 2020;20(1). https://​doi.​org/​10.​1186/​s12913-020-5030-0
7.
go back to reference Lambert-Cote L, Bouhnik AD, Bendiane MK, Berenger C, Mondor M, Huiart L, Lauzier S. Adherence trajectories of adjuvant endocrine therapy in the five years after its initiation among women with non-metastatic breast cancer: a cohort study using administrative databases. Breast Cancer Res Treat. 2020;180:777–90. https://doi.org/10.1007/s10549-020-05549-xCrossRefPubMed Lambert-Cote L, Bouhnik AD, Bendiane MK, Berenger C, Mondor M, Huiart L, Lauzier S. Adherence trajectories of adjuvant endocrine therapy in the five years after its initiation among women with non-metastatic breast cancer: a cohort study using administrative databases. Breast Cancer Res Treat. 2020;180:777–90. https://​doi.​org/​10.​1007/​s10549-020-05549-xCrossRefPubMed
8.
go back to reference Yan C, Chen Y, Li B, Liebovitz D, Malin B. Learning clinical workflows to identify subgroups of heart failure patients. AMIA Annu Symp Proc. 2017;2016:1248–57. Yan C, Chen Y, Li B, Liebovitz D, Malin B. Learning clinical workflows to identify subgroups of heart failure patients. AMIA Annu Symp Proc. 2017;2016:1248–57.
10.
12.
go back to reference Williams R, Rojas E, Peek N, Johnson OA. Process mining in primary care: a literaturereview. Stud Health Technol Inform. 2018;247:376–80. Williams R, Rojas E, Peek N, Johnson OA. Process mining in primary care: a literaturereview. Stud Health Technol Inform. 2018;247:376–80.
15.
go back to reference Peters M, Godfrey C, Mcinerney P, Soares C, Khalil H, Parker D. Methodology for jbi scoping reviews, 1st ed. p. 1–24. Joanna Briggs Institute; 2015 Peters M, Godfrey C, Mcinerney P, Soares C, Khalil H, Parker D. Methodology for jbi scoping reviews, 1st ed. p. 1–24. Joanna Briggs Institute; 2015
19.
go back to reference Alharbi A, Bulpitt A, Johnson OA, Klein GO, Karlsson D, Moen A, Ugon A. Towards unsupervised detection of process models in healthcare. Stud Health Technol Inform. 2018;247:381–5PubMed Alharbi A, Bulpitt A, Johnson OA, Klein GO, Karlsson D, Moen A, Ugon A. Towards unsupervised detection of process models in healthcare. Stud Health Technol Inform. 2018;247:381–5PubMed
27.
go back to reference Cherrie M, Curtis S, Baranyi G, McTaggart S, Cunningham N, Licence K, Dibben C, Bambra C, Pearce J. Use of sequence analysis for classifying individual antidepressant trajectories to monitor population mental health. BMC Psychiatr 2020;20(1). https://doi.org/10.1186/s12888-020-02952-y Cherrie M, Curtis S, Baranyi G, McTaggart S, Cunningham N, Licence K, Dibben C, Bambra C, Pearce J. Use of sequence analysis for classifying individual antidepressant trajectories to monitor population mental health. BMC Psychiatr 2020;20(1). https://​doi.​org/​10.​1186/​s12888-020-02952-y
33.
go back to reference Esmaili N, Buchlak QD, Piccardi M, Kruger B, Girosi F. Multichannel mixture models for time-series analysis and classification of engagement with multiple health services: An application to psychology and physiotherapy utilization patterns after traffic accidents. Artif Intell Med 2021;111. https://doi.org/10.1016/j.artmed.2020.101997 Esmaili N, Buchlak QD, Piccardi M, Kruger B, Girosi F. Multichannel mixture models for time-series analysis and classification of engagement with multiple health services: An application to psychology and physiotherapy utilization patterns after traffic accidents. Artif Intell Med 2021;111. https://​doi.​org/​10.​1016/​j.​artmed.​2020.​101997
35.
37.
go back to reference Honda Y, Kushima M, Yamazaki T, Araki K, Yokota H, Begoli E, Luo G, Wang F. Detection and visualization of variants in typical medical treatment sequences. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017;10494:89–101. https://doi.org/10.1007/978-3-319-67186-4_8 Honda Y, Kushima M, Yamazaki T, Araki K, Yokota H, Begoli E, Luo G, Wang F. Detection and visualization of variants in typical medical treatment sequences. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017;10494:89–101. https://​doi.​org/​10.​1007/​978-3-319-67186-4_​8
41.
go back to reference Le H.H, Yamada T, Honda Y, Kayahara M, Kushima M, Araki K, Yokota H, Hartmann S, Kung J, Anderst-Kotsis G, Khalil I, Chakravarthy S, Tjoa AM. Analyzing sequence pattern variants in sequential pattern mining and its application to electronic medical record systems. Lect Notes Comput Sci. 2019;11707:393–408. https://doi.org/10.1007/978-3-030-27618-8_29. Le H.H, Yamada T, Honda Y, Kayahara M, Kushima M, Araki K, Yokota H, Hartmann S, Kung J, Anderst-Kotsis G, Khalil I, Chakravarthy S, Tjoa AM. Analyzing sequence pattern variants in sequential pattern mining and its application to electronic medical record systems. Lect Notes Comput Sci. 2019;11707:393–408. https://​doi.​org/​10.​1007/​978-3-030-27618-8_​29.
46.
go back to reference Nuemi G, Afonso F, Roussot A, Billard L, Cottenet J, Combier E, Diday E, Quantin C. Classification of hospital pathways in the management of cancer: Application to lung cancer in the region of burgundy. Cancer Epidemiol. 2013;37(5):688–996CrossRefPubMed Nuemi G, Afonso F, Roussot A, Billard L, Cottenet J, Combier E, Diday E, Quantin C. Classification of hospital pathways in the management of cancer: Application to lung cancer in the region of burgundy. Cancer Epidemiol. 2013;37(5):688–996CrossRefPubMed
48.
50.
go back to reference Pokharel S, Zuccon G, Li Y. Representing EHRs with Temporal Tree and Sequential Pattern Mining for Similarity Computing. In: Yang, X., Wang, CD., Islam, M.S., Zhang, Z. (eds) Advanced Data Mining and Applications. ADMA 2020. Lecture Notes in Computer Science, vol 12447. Cham: Springer; 2020. https://doi.org/10.1007/978-3-030-65390-3_18. Pokharel S, Zuccon G, Li Y. Representing EHRs with Temporal Tree and Sequential Pattern Mining for Similarity Computing. In: Yang, X., Wang, CD., Islam, M.S., Zhang, Z. (eds) Advanced Data Mining and Applications. ADMA 2020. Lecture Notes in Computer Science, vol 12447. Cham: Springer; 2020. https://​doi.​org/​10.​1007/​978-3-030-65390-3_​18.
57.
go back to reference Sun W, Shen W, Li X, Cao F, Ni Y, Liu H. Mining information dependency in outpatient encounters for chronic disease care. 40th Medical Informatics in Europe Conference, MIE 2018, vol 192, 2013. p. 278–282. Sun W, Shen W, Li X, Cao F, Ni Y, Liu H. Mining information dependency in outpatient encounters for chronic disease care. 40th Medical Informatics in Europe Conference, MIE 2018, vol 192, 2013. p. 278–282.
61.
go back to reference Zhang Y, Padman R, Wasserman L. On learning and visualizing practice-based clinical pathways for chronic kidney disease. AMIA Symposium: AMIA. Annual Symposium proceedings; 2014 Zhang Y, Padman R, Wasserman L. On learning and visualizing practice-based clinical pathways for chronic kidney disease. AMIA Symposium: AMIA. Annual Symposium proceedings; 2014
62.
go back to reference Zhang YY, Padman R. Innovations in chronic care delivery using data-driven clinical pathways. Am J Manage Care. 2015;21(12):661–8. PMID: 26760429 Zhang YY, Padman R. Innovations in chronic care delivery using data-driven clinical pathways. Am J Manage Care. 2015;21(12):661–8. PMID: 26760429
63.
go back to reference Wilson J, Bock A. The benefit of using both claims data and electronic medical record data in health care analysis. Optum Insight. 2012;1:1–4. Wilson J, Bock A. The benefit of using both claims data and electronic medical record data in health care analysis. Optum Insight. 2012;1:1–4.
69.
go back to reference Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning, 2nd ed. Springer Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning, 2nd ed. Springer
77.
go back to reference Leemans SJJ, Fahland D, Aalst van der WMP. Exploring processes and deviations. In: Fournier F, Mendling J, editors. Business Process Management Workshops. Springer; 2015. p. 304–16 Leemans SJJ, Fahland D, Aalst van der WMP. Exploring processes and deviations. In: Fournier F, Mendling J, editors. Business Process Management Workshops. Springer; 2015. p. 304–16
Metadata
Title
Analytical methods for identifying sequences of utilization in health data: a scoping review
Authors
Amelie Flothow
Anna Novelli
Leonie Sundmacher
Publication date
01-12-2023
Publisher
BioMed Central
Keyword
Care
Published in
BMC Medical Research Methodology / Issue 1/2023
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-023-02019-y

Other articles of this Issue 1/2023

BMC Medical Research Methodology 1/2023 Go to the issue