Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2017

Open Access 01-12-2017 | Research article

A novel data-driven workflow combining literature and electronic health records to estimate comorbidities burden for a specific disease: a case study on autoimmune comorbidities in patients with celiac disease

Authors: Jean-Baptiste Escudié, Bastien Rance, Georgia Malamut, Sherine Khater, Anita Burgun, Christophe Cellier, Anne-Sophie Jannot

Published in: BMC Medical Informatics and Decision Making | Issue 1/2017

Login to get access

Abstract

Background

Data collected in EHRs have been widely used to identifying specific conditions; however there is still a need for methods to define comorbidities and sources to identify comorbidities burden. We propose an approach to assess comorbidities burden for a specific disease using the literature and EHR data sources in the case of autoimmune diseases in celiac disease (CD).

Methods

We generated a restricted set of comorbidities using the literature (via the MeSH® co-occurrence file). We extracted the 15 most co-occurring autoimmune diseases of the CD. We used mappings of the comorbidities to EHR terminologies: ICD-10 (billing codes), ATC (drugs) and UMLS (clinical reports). Finally, we extracted the concepts from the different data sources. We evaluated our approach using the correlation between prevalence estimates in our cohort and co-occurrence ranking in the literature.

Results

We retrieved the comorbidities for 741 patients with CD. 18.1% of patients had at least one of the 15 studied autoimmune disorders. Overall, 79.3% of the mapped concepts were detected only in text, 5.3% only in ICD codes and/or drugs prescriptions, and 15.4% could be found in both sources. Prevalence in our cohort were correlated with literature (Spearman’s coefficient 0.789, p = 0.0005). The three most prevalent comorbidities were thyroiditis 12.6% (95% CI 10.1–14.9), type 1 diabetes 2.3% (95% CI 1.2–3.4) and dermatitis herpetiformis 2.0% (95% CI 1.0–3.0).

Conclusion

We introduced a process that leveraged the MeSH terminology to identify relevant autoimmune comorbidities of the CD and several data sources from EHRs to phenotype a large population of CD patients. We achieved prevalence estimates comparable to the literature.
Appendix
Available only for authorised users
Literature
1.
go back to reference Jannot AS, Zapletal E, Avillach P, Mamzer MF, Burgun A, Degoulet P. The Georges Pompidou University Hospital Clinical Data Warehouse: a 8-years follow-up experience. Int J Med Inform. 2017;102:21–8.CrossRefPubMed Jannot AS, Zapletal E, Avillach P, Mamzer MF, Burgun A, Degoulet P. The Georges Pompidou University Hospital Clinical Data Warehouse: a 8-years follow-up experience. Int J Med Inform. 2017;102:21–8.CrossRefPubMed
2.
go back to reference Shivade C, Raghavan P, Fosler-Lussier E, Embi PJ, Elhadad N, Johnson SB, et al. A review of approaches to identifying patient phenotype cohorts using electronic health records. J Am Med Inform Assoc. 2014;21:221–30.CrossRefPubMed Shivade C, Raghavan P, Fosler-Lussier E, Embi PJ, Elhadad N, Johnson SB, et al. A review of approaches to identifying patient phenotype cohorts using electronic health records. J Am Med Inform Assoc. 2014;21:221–30.CrossRefPubMed
3.
go back to reference Conway M, Berg RL, Carrell D, Denny JC, Kho AN, Kullo IJ, et al. Analyzing the heterogeneity and complexity of electronic health record oriented phenotyping algorithms. AMIA Annu Symp Proc. 2011;2011:274–83.PubMedPubMedCentral Conway M, Berg RL, Carrell D, Denny JC, Kho AN, Kullo IJ, et al. Analyzing the heterogeneity and complexity of electronic health record oriented phenotyping algorithms. AMIA Annu Symp Proc. 2011;2011:274–83.PubMedPubMedCentral
4.
go back to reference Benchimol EI, Guttmann A, Mack DR, Nguyen GC, Marshall JK, Gregor JC, et al. Validation of international algorithms to identify adults with inflammatory bowel disease in health administrative data from Ontario, Canada. J Clin Epidemiol. 2014;67:887–96.CrossRefPubMed Benchimol EI, Guttmann A, Mack DR, Nguyen GC, Marshall JK, Gregor JC, et al. Validation of international algorithms to identify adults with inflammatory bowel disease in health administrative data from Ontario, Canada. J Clin Epidemiol. 2014;67:887–96.CrossRefPubMed
5.
go back to reference Bertaud V, Lasbleiz J, Mougin F, Burgun A, Duvauferrier R. A unified representation of findings in clinical radiology using the UMLS and DICOM. Int J Med Inf. 2008;77:621–9.CrossRef Bertaud V, Lasbleiz J, Mougin F, Burgun A, Duvauferrier R. A unified representation of findings in clinical radiology using the UMLS and DICOM. Int J Med Inf. 2008;77:621–9.CrossRef
6.
go back to reference Fiszman M, Chapman WW, Aronsky D, Evans RS, Haug PJ. Automatic detection of acute bacterial pneumonia from chest X-ray reports. J Am Med Inform Assoc. 2000;7:593–604.CrossRefPubMedPubMedCentral Fiszman M, Chapman WW, Aronsky D, Evans RS, Haug PJ. Automatic detection of acute bacterial pneumonia from chest X-ray reports. J Am Med Inform Assoc. 2000;7:593–604.CrossRefPubMedPubMedCentral
7.
go back to reference Hahn U, Romacker M, Schulz S. MEDSYNDIKATE--a natural language system for the extraction of medical information from findings reports. Int J Med Inf. 2002;67:63–74.CrossRef Hahn U, Romacker M, Schulz S. MEDSYNDIKATE--a natural language system for the extraction of medical information from findings reports. Int J Med Inf. 2002;67:63–74.CrossRef
8.
go back to reference Friedman C, Shagina L, Lussier Y, Hripcsak G. Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004;11:392–402.CrossRefPubMedPubMedCentral Friedman C, Shagina L, Lussier Y, Hripcsak G. Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004;11:392–402.CrossRefPubMedPubMedCentral
9.
go back to reference Bakken S, Hyun S, Friedman C, Johnson SB. ISO reference terminology models for nursing: applicability for natural language processing of nursing narratives. Int J Med Inf. 2005;74:615–22.CrossRef Bakken S, Hyun S, Friedman C, Johnson SB. ISO reference terminology models for nursing: applicability for natural language processing of nursing narratives. Int J Med Inf. 2005;74:615–22.CrossRef
10.
go back to reference Li L, Chase HS, Patel CO, Friedman C, Weng C. Comparing ICD9-encoded diagnoses and NLP-processed discharge summaries for clinical trials pre-screening: a case study. AMIA Annu Symp Proc. 2008;2008:404–8. Li L, Chase HS, Patel CO, Friedman C, Weng C. Comparing ICD9-encoded diagnoses and NLP-processed discharge summaries for clinical trials pre-screening: a case study. AMIA Annu Symp Proc. 2008;2008:404–8.
11.
go back to reference Xu H, Fu Z, Shah A, Chen Y, Peterson NB, Chen Q, et al. Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases. AMIA Annu Symp Proc. 2011;2011:1564–72.PubMedPubMedCentral Xu H, Fu Z, Shah A, Chen Y, Peterson NB, Chen Q, et al. Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases. AMIA Annu Symp Proc. 2011;2011:1564–72.PubMedPubMedCentral
12.
go back to reference Wei W-Q, Teixeira PL, Mo H, Cronin RM, Warner JL, Denny JC. Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance. J Am Med Inform Assoc. 2016;23:e20–7.CrossRefPubMed Wei W-Q, Teixeira PL, Mo H, Cronin RM, Warner JL, Denny JC. Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance. J Am Med Inform Assoc. 2016;23:e20–7.CrossRefPubMed
13.
go back to reference Kirby JC, Speltz P, Rasmussen LV, Basford M, Gottesman O, Peissig PL, et al. PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability. J Am Med Inform Assoc. 2016;23(6):ocv202.CrossRef Kirby JC, Speltz P, Rasmussen LV, Basford M, Gottesman O, Peissig PL, et al. PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability. J Am Med Inform Assoc. 2016;23(6):ocv202.CrossRef
14.
go back to reference Cosnes J, Cellier C, Viola S, Colombel J, Michaud L, Sarles J, et al. Incidence of autoimmune diseases in celiac disease: protective effect of the gluten-free diet. Clin Gastroenterol Hepatol. 2008;6:753–8.CrossRefPubMed Cosnes J, Cellier C, Viola S, Colombel J, Michaud L, Sarles J, et al. Incidence of autoimmune diseases in celiac disease: protective effect of the gluten-free diet. Clin Gastroenterol Hepatol. 2008;6:753–8.CrossRefPubMed
15.
go back to reference Iqbal T, Zaidi MA, Wells GA, Karsh J. Celiac disease arthropathy and autoimmunity study. J Gastroenterol Hepatol. 2013;28:99–105.CrossRefPubMed Iqbal T, Zaidi MA, Wells GA, Karsh J. Celiac disease arthropathy and autoimmunity study. J Gastroenterol Hepatol. 2013;28:99–105.CrossRefPubMed
16.
go back to reference Collin P, Salmi J, Hällström O, Reunala T, Pasternack A. Autoimmune thyroid disorders and coeliac disease. Eur J Endocrinol Eur Fed Endocr Soc. 1994;130:137–40.CrossRef Collin P, Salmi J, Hällström O, Reunala T, Pasternack A. Autoimmune thyroid disorders and coeliac disease. Eur J Endocrinol Eur Fed Endocr Soc. 1994;130:137–40.CrossRef
17.
go back to reference Diamanti A, Ferretti F, Guglielmi R, Panetta F, Colistro F, Cappa M, et al. Thyroid autoimmunity in children with coeliac disease: a prospective survey. Arch Dis Child. 2011;96:1038–41.CrossRefPubMed Diamanti A, Ferretti F, Guglielmi R, Panetta F, Colistro F, Cappa M, et al. Thyroid autoimmunity in children with coeliac disease: a prospective survey. Arch Dis Child. 2011;96:1038–41.CrossRefPubMed
18.
go back to reference van der Pals M, Ivarsson A, Norström F, Högberg L, Svensson J, Carlsson A. Prevalence of thyroid autoimmunity in children with celiac disease compared to healthy 12-year olds. Autoimmune Dis. 2014;2014:417356.PubMedPubMedCentral van der Pals M, Ivarsson A, Norström F, Högberg L, Svensson J, Carlsson A. Prevalence of thyroid autoimmunity in children with celiac disease compared to healthy 12-year olds. Autoimmune Dis. 2014;2014:417356.PubMedPubMedCentral
19.
go back to reference Sategna-Guidetti C, Volta U, Ciacci C, Usai P, Carlino A, De Franceschi L, et al. Prevalence of thyroid disorders in untreated adult celiac disease patients and effect of gluten withdrawal: an Italian multicenter study. Am J Gastroenterol. 2001;96:751–7.CrossRefPubMed Sategna-Guidetti C, Volta U, Ciacci C, Usai P, Carlino A, De Franceschi L, et al. Prevalence of thyroid disorders in untreated adult celiac disease patients and effect of gluten withdrawal: an Italian multicenter study. Am J Gastroenterol. 2001;96:751–7.CrossRefPubMed
21.
go back to reference Lubrano E, Ciacci C, Ames PR, Mazzacca G, Oriente P, Scarpa R. The arthritis of coeliac disease: prevalence and pattern in 200 adult patients. Br J Rheumatol. 1996;35:1314–8.CrossRefPubMed Lubrano E, Ciacci C, Ames PR, Mazzacca G, Oriente P, Scarpa R. The arthritis of coeliac disease: prevalence and pattern in 200 adult patients. Br J Rheumatol. 1996;35:1314–8.CrossRefPubMed
22.
go back to reference Volta U, Caio G, Stanghellini V, De Giorgio R. The changing clinical profile of celiac disease: a 15-year experience (1998-2012) in an Italian referral center. BMC Gastroenterol. 2014;14:194.CrossRefPubMedPubMedCentral Volta U, Caio G, Stanghellini V, De Giorgio R. The changing clinical profile of celiac disease: a 15-year experience (1998-2012) in an Italian referral center. BMC Gastroenterol. 2014;14:194.CrossRefPubMedPubMedCentral
23.
go back to reference Størdal K, Bakken IJ, Surén P, Stene LC. Epidemiology of Coeliac Disease and Comorbidity in Norwegian Children: J. Pediatr Gastroenterol Nutr. 2013;57:467–71.CrossRef Størdal K, Bakken IJ, Surén P, Stene LC. Epidemiology of Coeliac Disease and Comorbidity in Norwegian Children: J. Pediatr Gastroenterol Nutr. 2013;57:467–71.CrossRef
24.
go back to reference Bybrant MC, Örtqvist E, Lantz S, Grahnquist L. High prevalence of celiac disease in Swedish children and adolescents with type 1 diabetes and the relation to the Swedish epidemic of celiac disease: a cohort study. Scand J Gastroenterol. 2014;49:52–8.CrossRefPubMed Bybrant MC, Örtqvist E, Lantz S, Grahnquist L. High prevalence of celiac disease in Swedish children and adolescents with type 1 diabetes and the relation to the Swedish epidemic of celiac disease: a cohort study. Scand J Gastroenterol. 2014;49:52–8.CrossRefPubMed
25.
go back to reference Zapletal E, Rodon N, Grabar N, Degoulet P. Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case. Stud Health Technol Inform. 2010;160:193–7.PubMed Zapletal E, Rodon N, Grabar N, Degoulet P. Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case. Stud Health Technol Inform. 2010;160:193–7.PubMed
26.
go back to reference Al-Hussaini A, Sulaiman N, Al-Zahrani M, Alenizi A, El Haj I. High prevalence of celiac disease among Saudi children with type 1 diabetes: a prospective cross-sectional study. BMC Gastroenterol. 2012;12:180.CrossRefPubMedPubMedCentral Al-Hussaini A, Sulaiman N, Al-Zahrani M, Alenizi A, El Haj I. High prevalence of celiac disease among Saudi children with type 1 diabetes: a prospective cross-sectional study. BMC Gastroenterol. 2012;12:180.CrossRefPubMedPubMedCentral
27.
go back to reference Gonzalez GH, Tahsin T, Goodale BC, Greene AC, Greene CS. Recent advances and emerging applications in text and data mining for biomedical discovery. Brief Bioinform. 2016;17:33–42.CrossRefPubMed Gonzalez GH, Tahsin T, Goodale BC, Greene AC, Greene CS. Recent advances and emerging applications in text and data mining for biomedical discovery. Brief Bioinform. 2016;17:33–42.CrossRefPubMed
28.
go back to reference Abdelali B, Caruba T, Zapletal E, Sabatier B, Durieux P, Degoulet P. A Clinical Data Warehouse-Based Process for Refining Medication Orders Alerts. J Am Med Informat Assoc: JAMIA. 2012;19(5):782–85. doi:10.1136/amiajnl-2012-000850. Abdelali B, Caruba T, Zapletal E, Sabatier B, Durieux P, Degoulet P. A Clinical Data Warehouse-Based Process for Refining Medication Orders Alerts. J Am Med Informat Assoc: JAMIA. 2012;19(5):782–85. doi:10.​1136/​amiajnl-2012-000850.
29.
go back to reference Escudié J-B, Jannot A-S, Zapletal E, Cohen S, Malamut G, Burgun A, et al. Reviewing 741 patients records in two hours with FASTVISU. AMIA Annu Symp Proc. 2015;2015:553–9.PubMedPubMedCentral Escudié J-B, Jannot A-S, Zapletal E, Cohen S, Malamut G, Burgun A, et al. Reviewing 741 patients records in two hours with FASTVISU. AMIA Annu Symp Proc. 2015;2015:553–9.PubMedPubMedCentral
30.
go back to reference Sperrin M, Thew S, Weatherall J, Dixon W, Buchan I. Quantifying the longitudinal value of healthcare record collections for pharmacoepidemiology. AMIA Annu Symp Proc. 2011;2011:1318–25.PubMedPubMedCentral Sperrin M, Thew S, Weatherall J, Dixon W, Buchan I. Quantifying the longitudinal value of healthcare record collections for pharmacoepidemiology. AMIA Annu Symp Proc. 2011;2011:1318–25.PubMedPubMedCentral
31.
go back to reference Casez P, Labarère J, Sevestre M-A, Haddouche M, Courtois X, Mercier S, et al. ICD-10 hospital discharge diagnosis codes were sensitive for identifying pulmonary embolism but not deep vein thrombosis. J Clin Epidemiol. 2010;63:790–7.CrossRefPubMed Casez P, Labarère J, Sevestre M-A, Haddouche M, Courtois X, Mercier S, et al. ICD-10 hospital discharge diagnosis codes were sensitive for identifying pulmonary embolism but not deep vein thrombosis. J Clin Epidemiol. 2010;63:790–7.CrossRefPubMed
33.
go back to reference Hruby GW, Matsoukas K, Cimino JJ, Weng C. Facilitating biomedical researchers’ interrogation of electronic health record data: Ideas from outside of biomedical informatics. J Biomed Inform. 2016;60:376–84.CrossRefPubMedPubMedCentral Hruby GW, Matsoukas K, Cimino JJ, Weng C. Facilitating biomedical researchers’ interrogation of electronic health record data: Ideas from outside of biomedical informatics. J Biomed Inform. 2016;60:376–84.CrossRefPubMedPubMedCentral
34.
go back to reference Adler-Milstein J, DesRoches CM, Kralovec P, Foster G, Worzala C, Charles D, et al. Electronic health record adoption in US hospitals: progress continues, but challenges persist. Health Aff Proj Hope. 2015;34:2174–80.CrossRef Adler-Milstein J, DesRoches CM, Kralovec P, Foster G, Worzala C, Charles D, et al. Electronic health record adoption in US hospitals: progress continues, but challenges persist. Health Aff Proj Hope. 2015;34:2174–80.CrossRef
Metadata
Title
A novel data-driven workflow combining literature and electronic health records to estimate comorbidities burden for a specific disease: a case study on autoimmune comorbidities in patients with celiac disease
Authors
Jean-Baptiste Escudié
Bastien Rance
Georgia Malamut
Sherine Khater
Anita Burgun
Christophe Cellier
Anne-Sophie Jannot
Publication date
01-12-2017
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2017
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-017-0537-y

Other articles of this Issue 1/2017

BMC Medical Informatics and Decision Making 1/2017 Go to the issue