Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2015

Open Access 01-12-2015 | Research article

Identifying influenza-like illness presentation from unstructured general practice clinical narrative using a text classifier rule-based expert system versus a clinical expert

Authors: Jayden MacRae, Tom Love, Michael G. Baker, Anthony Dowell, Matthew Carnachan, Maria Stubbe, Lynn McBain

Published in: BMC Medical Informatics and Decision Making | Issue 1/2015

Login to get access

Abstract

Background

We designed and validated a rule-based expert system to identify influenza like illness (ILI) from routinely recorded general practice clinical narrative to aid a larger retrospective research study into the impact of the 2009 influenza pandemic in New Zealand.

Methods

Rules were assessed using pattern matching heuristics on routine clinical narrative. The system was trained using data from 623 clinical encounters and validated using a clinical expert as a gold standard against a mutually exclusive set of 901 records.

Results

We calculated a 98.2 % specificity and 90.2 % sensitivity across an ILI incidence of 12.4 % measured against clinical expert classification. Peak problem list identification of ILI by clinical coding in any month was 9.2 % of all detected ILI presentations. Our system addressed an unusual problem domain for clinical narrative classification; using notational, unstructured, clinician entered information in a community care setting. It performed well compared with other approaches and domains. It has potential applications in real-time surveillance of disease, and in assisted problem list coding for clinicians.

Conclusions

Our system identified ILI presentation with sufficient accuracy for use at a population level in the wider research study. The peak coding of 9.2 % illustrated the need for automated coding of unstructured narrative in our study.
Literature
1.
go back to reference Lekana-Douki SE, Nkoghe D, Drosten C, Ngoungou EB, Drexler JF, Leroy EM. Viral etiology and seasonality of influenza-like illness in Gabon, March 2010 to June 2011. BMC Infect Dis. 2014;14:373.CrossRefPubMedPubMedCentral Lekana-Douki SE, Nkoghe D, Drosten C, Ngoungou EB, Drexler JF, Leroy EM. Viral etiology and seasonality of influenza-like illness in Gabon, March 2010 to June 2011. BMC Infect Dis. 2014;14:373.CrossRefPubMedPubMedCentral
2.
go back to reference Maman I, Badziklou K, Landoh ED, Halatoko AW, Nzussouo TN, Defang GN, et al. Implementation of influenza-like illness sentinel surveillance in Togo. BMC Public Health. 2014;14:981.CrossRefPubMedPubMedCentral Maman I, Badziklou K, Landoh ED, Halatoko AW, Nzussouo TN, Defang GN, et al. Implementation of influenza-like illness sentinel surveillance in Togo. BMC Public Health. 2014;14:981.CrossRefPubMedPubMedCentral
3.
go back to reference Baker MG, Wilson N, Huang QS, Paine S, Lopez L, Bandaranayake D, et al. Pandemic influenza A(H1N1)v in New Zealand: the experience from April to August 2009. Eurosurveillance. 2009;14:1–6. Baker MG, Wilson N, Huang QS, Paine S, Lopez L, Bandaranayake D, et al. Pandemic influenza A(H1N1)v in New Zealand: the experience from April to August 2009. Eurosurveillance. 2009;14:1–6.
4.
go back to reference Kelly H, Grant K. Interim Analysis of Pandemic Influenza (H1N1) 2009 In Australia: Surveillance trends, age of infection and effectiveness of seasonal vaccination. Euro Surveill. 2009;14 Kelly H, Grant K. Interim Analysis of Pandemic Influenza (H1N1) 2009 In Australia: Surveillance trends, age of infection and effectiveness of seasonal vaccination. Euro Surveill. 2009;14
5.
go back to reference Baker MG, Wilson N, Huang QS, Paine S, Lopez L, Bandaranayake D, et al. Pandemic Influenza A(H1N1) in New Zealand: the experience from April to August 2009. Euro Surveill. 2009;14:19319.PubMed Baker MG, Wilson N, Huang QS, Paine S, Lopez L, Bandaranayake D, et al. Pandemic Influenza A(H1N1) in New Zealand: the experience from April to August 2009. Euro Surveill. 2009;14:19319.PubMed
6.
go back to reference Khan NF, Harrison SE, Rose PW. Validity of diagnostic coding within the General Practice Research Database: a systematic review. Br J Gen Pract. 2010;60:199–206. Khan NF, Harrison SE, Rose PW. Validity of diagnostic coding within the General Practice Research Database: a systematic review. Br J Gen Pract. 2010;60:199–206.
7.
go back to reference Lawrenson R, Williams T, Farmer R. Clinical information for research; the use of general practice databases. J Public Health Med. 1999;21:299–304.CrossRefPubMed Lawrenson R, Williams T, Farmer R. Clinical information for research; the use of general practice databases. J Public Health Med. 1999;21:299–304.CrossRefPubMed
8.
go back to reference Jordan K, Porcheret M, Croft P. Quality of morbidity coding in general practice computerized medical records: a systematic review. Fam Pract. 2004;21:396–412.CrossRefPubMed Jordan K, Porcheret M, Croft P. Quality of morbidity coding in general practice computerized medical records: a systematic review. Fam Pract. 2004;21:396–412.CrossRefPubMed
9.
go back to reference Friedman C, Hripcsak G. Natural language processing and its future in medicine. AAMC Acad Med J Assoc Am Med Coll. 1999;74:890–5.CrossRef Friedman C, Hripcsak G. Natural language processing and its future in medicine. AAMC Acad Med J Assoc Am Med Coll. 1999;74:890–5.CrossRef
10.
go back to reference Haas SW, Travers D, Waller A, Mahalingam D, Crouch J, Schwartz TA, et al. Emergency medical text classifier: new system improves processing and classification of triage notes. Online J Public Health Inform. 2014;6:e178.CrossRefPubMedPubMedCentral Haas SW, Travers D, Waller A, Mahalingam D, Crouch J, Schwartz TA, et al. Emergency medical text classifier: new system improves processing and classification of triage notes. Online J Public Health Inform. 2014;6:e178.CrossRefPubMedPubMedCentral
11.
go back to reference Mendonca EA, Haas J, Shagina L, Larson E, Friedman C. Extracting information on pneumonia in infants using natrual language processing of radiology reports. J Biomed Inform. 2005;38:314–21.CrossRefPubMed Mendonca EA, Haas J, Shagina L, Larson E, Friedman C. Extracting information on pneumonia in infants using natrual language processing of radiology reports. J Biomed Inform. 2005;38:314–21.CrossRefPubMed
12.
go back to reference Koopman B, Karimi S, Nguyen A, McGuire R, Muscatello D, Kemp M, et al. Automatic classification of diseases from free-text death certificates for real-time surveillance. BMC Med Inform Decis Mak. 2015;15:53.CrossRefPubMedPubMedCentral Koopman B, Karimi S, Nguyen A, McGuire R, Muscatello D, Kemp M, et al. Automatic classification of diseases from free-text death certificates for real-time surveillance. BMC Med Inform Decis Mak. 2015;15:53.CrossRefPubMedPubMedCentral
13.
go back to reference Chapman WW, Christensen LM, Wagner MM, Haug PJ, Ivanov O, Dowling JN, et al. Classifying free-text triage chief complaints into syndromic categories with natural language processing. Artif Intell Med. 2005;33:1–10.CrossRef Chapman WW, Christensen LM, Wagner MM, Haug PJ, Ivanov O, Dowling JN, et al. Classifying free-text triage chief complaints into syndromic categories with natural language processing. Artif Intell Med. 2005;33:1–10.CrossRef
14.
go back to reference Chapman WW, Fiszma M, Dowling J, Chapman B, Rindflesh T. Identifying Respiratory Findings in Emergency Department Reports for Bio surveillance using MetaMap. Stud Health Technol Inform. 2004;107:487–91.PubMed Chapman WW, Fiszma M, Dowling J, Chapman B, Rindflesh T. Identifying Respiratory Findings in Emergency Department Reports for Bio surveillance using MetaMap. Stud Health Technol Inform. 2004;107:487–91.PubMed
15.
go back to reference Chapman WW, Fizman M, Chapman B, Haug P. A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia. J Biomed Inform. 2001;34:4–14.CrossRefPubMed Chapman WW, Fizman M, Chapman B, Haug P. A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia. J Biomed Inform. 2001;34:4–14.CrossRefPubMed
16.
go back to reference Chapman WW, Chu D, Dowling JN. ConText: an algorithm for identifying contextual features from clinical text. In: Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics; 2007. p. 81–8 [BioNLP’07].CrossRef Chapman WW, Chu D, Dowling JN. ConText: an algorithm for identifying contextual features from clinical text. In: Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics; 2007. p. 81–8 [BioNLP’07].CrossRef
17.
go back to reference Elkin PL, Brown SH, Bauer BA, Husser CS, Carruth W, Bergstrom LR, et al. A controlled trial of automated classification or negation from clinical notes. BMC Med Inform Decis Mak. 2005;5. Elkin PL, Brown SH, Bauer BA, Husser CS, Carruth W, Bergstrom LR, et al. A controlled trial of automated classification or negation from clinical notes. BMC Med Inform Decis Mak. 2005;5.
18.
go back to reference Hripcsak G, Friedman C, Alderson PO, DuMouchel W, Johnson SB, Layton PD. Unlocking clinical data from narrative reports: a study of natural language processing. Ann Intern Med. 1995;122:681–8.CrossRefPubMed Hripcsak G, Friedman C, Alderson PO, DuMouchel W, Johnson SB, Layton PD. Unlocking clinical data from narrative reports: a study of natural language processing. Ann Intern Med. 1995;122:681–8.CrossRefPubMed
19.
go back to reference Xu H, Stetson PD, Friedman C. A Study of Abbreviations in Clinical Notes. AMIA Annu Symp Proc. 2007:821–825 Xu H, Stetson PD, Friedman C. A Study of Abbreviations in Clinical Notes. AMIA Annu Symp Proc. 2007:821–825
20.
go back to reference Travers D, Lich KH, Lippmann SJ, Yeatts KB, Liao W, Waller A. Defining Emergency Department Asthma Visits for Public Health Surveillance, North Carolina, 2008–2009. Prev Chronic Dis. 2014;11:2008–9.CrossRef Travers D, Lich KH, Lippmann SJ, Yeatts KB, Liao W, Waller A. Defining Emergency Department Asthma Visits for Public Health Surveillance, North Carolina, 2008–2009. Prev Chronic Dis. 2014;11:2008–9.CrossRef
21.
go back to reference South BR, Chapman WW, Delisle S, Shen S, Kalp E, Perl T, et al. Optimizing A syndromic surveillance text classifier for influenza-like illness: Does document source matter? AMIA Annu Symp Proc 2008:692–696 South BR, Chapman WW, Delisle S, Shen S, Kalp E, Perl T, et al. Optimizing A syndromic surveillance text classifier for influenza-like illness: Does document source matter? AMIA Annu Symp Proc 2008:692–696
22.
go back to reference Barrows RC, Busuioc M, Friedman C. Limited Parsing of Notational Text Visit Notes: Ad-hoc vs. NLP Approaches. Proc AMIA Symp. 2000:51–55 Barrows RC, Busuioc M, Friedman C. Limited Parsing of Notational Text Visit Notes: Ad-hoc vs. NLP Approaches. Proc AMIA Symp. 2000:51–55
23.
go back to reference Payne L, Kühlmann-Berenzon S, Ekdahl K, Giesecke J, Högberg L, Penttinen P. “Did you have flu last week?” A telephone survey to estimate a point prevalence of influenza in the Swedish population. Euro Surveill. 2005;10:241–4.PubMed Payne L, Kühlmann-Berenzon S, Ekdahl K, Giesecke J, Högberg L, Penttinen P. “Did you have flu last week?” A telephone survey to estimate a point prevalence of influenza in the Swedish population. Euro Surveill. 2005;10:241–4.PubMed
24.
go back to reference Miller E, Hoschler K, Hardelid P, Stanford E, Andrews N, Zmabon M, et al. Incidence of 2009 pandemic influenza A H1N1 infection in England: a cross-sectional serological study. Lancet. 2010;375:1100–8.CrossRefPubMed Miller E, Hoschler K, Hardelid P, Stanford E, Andrews N, Zmabon M, et al. Incidence of 2009 pandemic influenza A H1N1 infection in England: a cross-sectional serological study. Lancet. 2010;375:1100–8.CrossRefPubMed
25.
go back to reference Collett D. Modelling Binary Data. Boca Raton Florida: Chapman & Hall/CRC; 1999. Collett D. Modelling Binary Data. Boca Raton Florida: Chapman & Hall/CRC; 1999.
26.
go back to reference Wilcox A, Hripcsak G. Classification Algorithms Applied to Narrative Reports. Proc AMIA Symp. 1999:455–459 Wilcox A, Hripcsak G. Classification Algorithms Applied to Narrative Reports. Proc AMIA Symp. 1999:455–459
27.
go back to reference Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated finds and disease in discharge summaries. J Biomed Inform. 2001;34:301–10.CrossRefPubMed Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated finds and disease in discharge summaries. J Biomed Inform. 2001;34:301–10.CrossRefPubMed
28.
go back to reference Chapman WW, Bridewell W, Handbury P, Cooper GF, Buchanan BG. Evaluation of Negation Phrases in Narrative Clinical Reports. Proc AMIA Symp. 2001:105–109 Chapman WW, Bridewell W, Handbury P, Cooper GF, Buchanan BG. Evaluation of Negation Phrases in Narrative Clinical Reports. Proc AMIA Symp. 2001:105–109
29.
go back to reference WHO-Europe. WHO Regional Office for Europe Guidance for Influenza Surveillance in Humans. 2009. WHO-Europe. WHO Regional Office for Europe Guidance for Influenza Surveillance in Humans. 2009.
30.
go back to reference Ruch P, Baud R, Geissbuhler A. Evaluating and reducing the effect of data corruption when applying bag of words approaches to medical records. Int J Med Inform. 2002;67:75–83.CrossRefPubMed Ruch P, Baud R, Geissbuhler A. Evaluating and reducing the effect of data corruption when applying bag of words approaches to medical records. Int J Med Inform. 2002;67:75–83.CrossRefPubMed
31.
32.
go back to reference Wagner MM, Tsui F-CC, Espino JU, Dato VM, Sittig DF, Caruana RA, et al. The emerging science of very early detection of disease outbreaks. J Public Heal Manag Pract. 2001;7:51–9.CrossRef Wagner MM, Tsui F-CC, Espino JU, Dato VM, Sittig DF, Caruana RA, et al. The emerging science of very early detection of disease outbreaks. J Public Heal Manag Pract. 2001;7:51–9.CrossRef
33.
go back to reference Hashimoto S, Murakami Y, Taniguchi K, Nagai M. Detection of epidemics in their early stage through inectious disease surveillance. Int J Epidemiol. 2000;29:905–10.CrossRefPubMed Hashimoto S, Murakami Y, Taniguchi K, Nagai M. Detection of epidemics in their early stage through inectious disease surveillance. Int J Epidemiol. 2000;29:905–10.CrossRefPubMed
34.
go back to reference De Lusignan S, van Weel C. The use of routinely collected computer data for research in primary care: opportunities and challenges. Fam Pract. 2006;23:253–63.CrossRefPubMed De Lusignan S, van Weel C. The use of routinely collected computer data for research in primary care: opportunities and challenges. Fam Pract. 2006;23:253–63.CrossRefPubMed
Metadata
Title
Identifying influenza-like illness presentation from unstructured general practice clinical narrative using a text classifier rule-based expert system versus a clinical expert
Authors
Jayden MacRae
Tom Love
Michael G. Baker
Anthony Dowell
Matthew Carnachan
Maria Stubbe
Lynn McBain
Publication date
01-12-2015
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2015
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-015-0201-3

Other articles of this Issue 1/2015

BMC Medical Informatics and Decision Making 1/2015 Go to the issue