Skip to main content
Top
Published in: Digestive Diseases and Sciences 4/2013

01-04-2013 | Original Article

Automated Identification of Surveillance Colonoscopy in Inflammatory Bowel Disease Using Natural Language Processing

Authors: Jason K. Hou, Mimi Chang, Thien Nguyen, Jennifer R. Kramer, Peter Richardson, Shubhada Sansgiry, Leonard W. D’Avolio, Hashem B. El-Serag

Published in: Digestive Diseases and Sciences | Issue 4/2013

Login to get access

Abstract

Background

Differentiating surveillance from non-surveillance colonoscopy for colorectal cancer in patients with inflammatory bowel disease (IBD) using electronic medical records (EMR) is important for practice improvement and research purposes, but diagnosis code algorithms are lacking. The automated retrieval console (ARC) is natural language processing (NLP)-based software that allows text-based document-level classification.

Aims

The purpose of this study was to test the feasibility and accuracy of ARC in identifying surveillance and non-surveillance colonoscopy in IBD using EMR.

Methods

We performed a split validation study of electronic reports of colonoscopy pathology for patients with IBD from the Michael E. DeBakey VA Medical Center. A gastroenterologist manually classified pathology reports as either derived from surveillance or non-surveillance colonoscopy. Pathology reports were randomly split into two sets: 70 % for algorithm derivation and 30 % for validation. An ARC generated classification model was applied to the validation set of pathology reports. The performance of the model was compared with manual classification for surveillance and non-surveillance colonoscopy.

Results

A total of 575 colonoscopy pathology reports were available on 195 IBD patients, of which 400 reports were designated as training and 175 as testing sets. Within the testing set, a total of 69 pathology reports were classified as surveillance by manual review, whereas the ARC model classified 66 reports as surveillance for a recall of 0.77, precision of 0.80, and specificity of 0.88.

Conclusions

ARC was able to identify surveillance colonoscopy for IBD without customized software programming. NLP-based document-level classification may be used to differentiate surveillance from non-surveillance colonoscopy in IBD.
Literature
1.
go back to reference Kornbluth A, Sachar DB. Practice parameters committee of the American College of Gastroenterology. Ulcerative colitis practice guidelines in adults: American College of Gastroenterology, practice parameters committee. Am J Gastroenterol. 2010;105:501–523.PubMedCrossRef Kornbluth A, Sachar DB. Practice parameters committee of the American College of Gastroenterology. Ulcerative colitis practice guidelines in adults: American College of Gastroenterology, practice parameters committee. Am J Gastroenterol. 2010;105:501–523.PubMedCrossRef
2.
go back to reference Farraye FA, Odze RD, Eaden J, et al. AGA technical review on the diagnosis and management of colorectal neoplasia in inflammatory bowel disease. Gastroenterology. 2010;138:746–774.PubMedCrossRef Farraye FA, Odze RD, Eaden J, et al. AGA technical review on the diagnosis and management of colorectal neoplasia in inflammatory bowel disease. Gastroenterology. 2010;138:746–774.PubMedCrossRef
3.
go back to reference Velayos FS, Liu L, Lewis JD, et al. Prevalence of colorectal cancer surveillance for ulcerative colitis in an integrated health care delivery system. Gastroenterology. 2010;139:1511–1518.PubMedCrossRef Velayos FS, Liu L, Lewis JD, et al. Prevalence of colorectal cancer surveillance for ulcerative colitis in an integrated health care delivery system. Gastroenterology. 2010;139:1511–1518.PubMedCrossRef
4.
go back to reference Kottachchi D, Yung D, Marshall JK. Adherence to guidelines for surveillance colonoscopy in patients with ulcerative colitis at a Canadian quaternary care hospital. Can J Gastroenterol. 2009;23:613–617.PubMed Kottachchi D, Yung D, Marshall JK. Adherence to guidelines for surveillance colonoscopy in patients with ulcerative colitis at a Canadian quaternary care hospital. Can J Gastroenterol. 2009;23:613–617.PubMed
5.
go back to reference Savova GK, Masanz JJ, Ogren PV, et al. Mayo clinical text analysis and knowledge extraction system (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc. 2010;17:507–513.PubMedCrossRef Savova GK, Masanz JJ, Ogren PV, et al. Mayo clinical text analysis and knowledge extraction system (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc. 2010;17:507–513.PubMedCrossRef
6.
go back to reference D’Avolio LW, Nguyen TM, Farwell WR, et al. Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC). J Am Med Inform Assoc. 2010;17:375–382.PubMedCrossRef D’Avolio LW, Nguyen TM, Farwell WR, et al. Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC). J Am Med Inform Assoc. 2010;17:375–382.PubMedCrossRef
7.
go back to reference Shiner B, D’Avolio LW, Nguyen TM, et al. Automated classification of psychotherapy note text: implications for quality assessment in PTSD care. J Eval Clin Pract. 2012;18:698–701. Shiner B, D’Avolio LW, Nguyen TM, et al. Automated classification of psychotherapy note text: implications for quality assessment in PTSD care. J Eval Clin Pract. 2012;18:698–701.
8.
go back to reference Farwell WR, D’Avolio LW, Scranton RE, et al. Statins and prostate cancer diagnosis and grade in a veterans population. J Natl Cancer Inst. 2011;103:885–892.PubMedCrossRef Farwell WR, D’Avolio LW, Scranton RE, et al. Statins and prostate cancer diagnosis and grade in a veterans population. J Natl Cancer Inst. 2011;103:885–892.PubMedCrossRef
Metadata
Title
Automated Identification of Surveillance Colonoscopy in Inflammatory Bowel Disease Using Natural Language Processing
Authors
Jason K. Hou
Mimi Chang
Thien Nguyen
Jennifer R. Kramer
Peter Richardson
Shubhada Sansgiry
Leonard W. D’Avolio
Hashem B. El-Serag
Publication date
01-04-2013
Publisher
Springer US
Published in
Digestive Diseases and Sciences / Issue 4/2013
Print ISSN: 0163-2116
Electronic ISSN: 1573-2568
DOI
https://doi.org/10.1007/s10620-012-2433-8

Other articles of this Issue 4/2013

Digestive Diseases and Sciences 4/2013 Go to the issue

Stanford Multidisciplinary Seminars

A Great Masquerader: Acute Syphilitic Hepatitis

Live Webinar | 27-06-2024 | 18:00 (CEST)

Keynote webinar | Spotlight on medication adherence

Live: Thursday 27th June 2024, 18:00-19:30 (CEST)

WHO estimates that half of all patients worldwide are non-adherent to their prescribed medication. The consequences of poor adherence can be catastrophic, on both the individual and population level.

Join our expert panel to discover why you need to understand the drivers of non-adherence in your patients, and how you can optimize medication adherence in your clinics to drastically improve patient outcomes.

Prof. Kevin Dolgin
Prof. Florian Limbourg
Prof. Anoop Chauhan
Developed by: Springer Medicine
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discusses last year's major advances in heart failure and cardiomyopathies.