Skip to main content
Top
Published in: BMC Primary Care 1/2022

Open Access 01-12-2022 | Care | Research

Detection of primary Sjögren’s syndrome in primary care: developing a classification model with the use of routine healthcare data and machine learning

Authors: Jesper T. Dros, Isabelle Bos, Frank C. Bennis, Sytske Wiegersma, John Paget, Chiara Seghieri, Jaime Barrio Cortés, Robert A. Verheij

Published in: BMC Primary Care | Issue 1/2022

Login to get access

Abstract

Background

Primary Sjögren’s Syndrome (pSS) is a rare autoimmune disease that is difficult to diagnose due to a variety of clinical presentations, resulting in misdiagnosis and late referral to specialists. To improve early-stage disease recognition, this study aimed to develop an algorithm to identify possible pSS patients in primary care. We built a machine learning algorithm which was based on combined healthcare data as a first step towards a clinical decision support system.

Method

Routine healthcare data, consisting of primary care electronic health records (EHRs) data and hospital claims data (HCD), were linked on patient level and consisted of 1411 pSS and 929,179 non-pSS patients. Logistic regression (LR) and random forest (RF) models were used to classify patients using age, gender, diseases and symptoms, prescriptions and GP visits.

Results

The LR and RF models had an AUC of 0.82 and 0.84, respectively. Many actual pSS patients were found (sensitivity LR = 72.3%, RF = 70.1%), specificity was 74.0% (LR) and 77.9% (RF) and the negative predictive value was 99.9% for both models. However, most patients classified as pSS patients did not have a diagnosis of pSS in secondary care (positive predictive value LR = 0.4%, RF = 0.5%).

Conclusion

This is the first study to use machine learning to classify patients with pSS in primary care using GP EHR data. Our algorithm has the potential to support the early recognition of pSS in primary care and should be validated and optimized in clinical practice. To further enhance the algorithm in detecting pSS in primary care, we suggest it is improved by working with experienced clinicians.
Appendix
Available only for authorised users
Literature
1.
go back to reference Daniels T, Fox. Salivary and oral components of Sjögren’s syndrome. Rheum Dis Clin North Am. 1992:571–589. Daniels T, Fox. Salivary and oral components of Sjögren’s syndrome. Rheum Dis Clin North Am. 1992:571–589.
4.
6.
go back to reference Baldini C, Ferro F, Luciano N, Bombardieri S, Grossi E. Artificial neural networks help to identify disease subsets and to predict lymphoma in primary Sjögren's syndrome. Clin Exp Rheumatol. 2018;36 Suppl 112(3):137-44. Baldini C, Ferro F, Luciano N, Bombardieri S, Grossi E. Artificial neural networks help to identify disease subsets and to predict lymphoma in primary Sjögren's syndrome. Clin Exp Rheumatol. 2018;36 Suppl 112(3):137-44.
8.
go back to reference Acar-Denizli N, Kostov B, Ramos-Casals M; Sjögren Big Data Consortium. The Big Data Sjögren Consortium: a project for a new data science era. Clin Exp Rheumatol. 2019;37 Suppl 118(3):19-23. Acar-Denizli N, Kostov B, Ramos-Casals M; Sjögren Big Data Consortium. The Big Data Sjögren Consortium: a project for a new data science era. Clin Exp Rheumatol. 2019;37 Suppl 118(3):19-23.
9.
go back to reference Ronicke S. Can a decision support system accelerate rare disease diagnosis? Evaluating the potential impact of Ada DX in a retrospective study. 2019;12. Ronicke S. Can a decision support system accelerate rare disease diagnosis? Evaluating the potential impact of Ada DX in a retrospective study. 2019;12.
11.
go back to reference Hasaart F. Incentives in the diagnosis treatment combination payment system for specialist medical care: a study about behavioral responses of medical specialists and hospitals in the Netherlands. Maastricht University; 2011. Hasaart F. Incentives in the diagnosis treatment combination payment system for specialist medical care: a study about behavioral responses of medical specialists and hospitals in the Netherlands. Maastricht University; 2011.
20.
go back to reference Aikake H. A new look at the statistical model identification. IEEE Trans Autom Control. 1974;19(6):716–23. 10.1109. Aikake H. A new look at the statistical model identification. IEEE Trans Autom Control. 1974;19(6):716–23. 10.1109.
22.
go back to reference Lutgendorf MA, Stoll KA. Why 99% may not be as good as you think it is: limitations of screening for rare diseases. 4. Lutgendorf MA, Stoll KA. Why 99% may not be as good as you think it is: limitations of screening for rare diseases. 4.
24.
go back to reference Shiboski CH, Shiboski SC, Seror R, et al. 2016 American College of Rheumatology/European league against rheumatism classification criteria for primary Sjögren’s syndrome: a consensus and data-driven methodology involving three international patient cohorts. Arthritis Rheumatol. 2017;69(1):35–45. https://doi.org/10.1002/art.39859.CrossRefPubMed Shiboski CH, Shiboski SC, Seror R, et al. 2016 American College of Rheumatology/European league against rheumatism classification criteria for primary Sjögren’s syndrome: a consensus and data-driven methodology involving three international patient cohorts. Arthritis Rheumatol. 2017;69(1):35–45. https://​doi.​org/​10.​1002/​art.​39859.CrossRefPubMed
25.
go back to reference Boulesteix AL, Janitza S, Kruppa J, König IR. Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics: random forests in bioinformatics. Wiley Interdiscip Rev Data Min Knowl Discov. 2012;2(6):493–507. https://doi.org/10.1002/widm.1072.CrossRef Boulesteix AL, Janitza S, Kruppa J, König IR. Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics: random forests in bioinformatics. Wiley Interdiscip Rev Data Min Knowl Discov. 2012;2(6):493–507. https://​doi.​org/​10.​1002/​widm.​1072.CrossRef
27.
go back to reference Argyropoulou OD, Valentini E, Ferro F, Leone MC, Cafaro G, Bartoloni E, Baldini C. One year in review 2018: Sjögren's syndrome. Clin Exp Rheumatol. 2018;36 Suppl 112(3):14-26. Argyropoulou OD, Valentini E, Ferro F, Leone MC, Cafaro G, Bartoloni E, Baldini C. One year in review 2018: Sjögren's syndrome. Clin Exp Rheumatol. 2018;36 Suppl 112(3):14-26.
29.
go back to reference Foulquier N, Redou P, Le Gal C, Rouvière B, Pers JO, Saraux A. Pathogenesis-based treatments in primary Sjogren’s syndrome using artificial intelligence and advanced machine learning techniques: a systematic literature review. Hum Vaccines Immunother. , 2018:1–6. doi:https://doi.org/10.1080/21645515.2018.1475872. Foulquier N, Redou P, Le Gal C, Rouvière B, Pers JO, Saraux A. Pathogenesis-based treatments in primary Sjogren’s syndrome using artificial intelligence and advanced machine learning techniques: a systematic literature review. Hum Vaccines Immunother. , 2018:1–6. doi:https://​doi.​org/​10.​1080/​21645515.​2018.​1475872.
Metadata
Title
Detection of primary Sjögren’s syndrome in primary care: developing a classification model with the use of routine healthcare data and machine learning
Authors
Jesper T. Dros
Isabelle Bos
Frank C. Bennis
Sytske Wiegersma
John Paget
Chiara Seghieri
Jaime Barrio Cortés
Robert A. Verheij
Publication date
01-12-2022
Publisher
BioMed Central
Keyword
Care
Published in
BMC Primary Care / Issue 1/2022
Electronic ISSN: 2731-4553
DOI
https://doi.org/10.1186/s12875-022-01804-w

Other articles of this Issue 1/2022

BMC Primary Care 1/2022 Go to the issue