Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2012

Open Access 01-12-2012 | Research article

Identification of methicillin-resistant Staphylococcus aureus within the Nation’s Veterans Affairs Medical Centers using natural language processing

Authors: Makoto Jones, Scott L DuVall, Joshua Spuhl, Matthew H Samore, Christopher Nielson, Michael Rubin

Published in: BMC Medical Informatics and Decision Making | Issue 1/2012

Login to get access

Abstract

Background

Accurate information is needed to direct healthcare systems’ efforts to control methicillin-resistant Staphylococcus aureus (MRSA). Assembling complete and correct microbiology data is vital to understanding and addressing the multiple drug-resistant organisms in our hospitals.

Methods

Herein, we describe a system that securely gathers microbiology data from the Department of Veterans Affairs (VA) network of databases. Using natural language processing methods, we applied an information extraction process to extract organisms and susceptibilities from the free-text data. We then validated the extraction against independently derived electronic data and expert annotation.

Results

We estimate that the collected microbiology data are 98.5% complete and that methicillin-resistant Staphylococcus aureus was extracted accurately 99.7% of the time.

Conclusions

Applying natural language processing methods to microbiology records appears to be a promising way to extract accurate and useful nosocomial pathogen surveillance data. Both scientific inquiry and the data’s reliability will be dependent on the surveillance system’s capability to compare from multiple sources and circumvent systematic error. The dataset constructed and methods used for this investigation could contribute to a comprehensive infectious disease surveillance system or other pressing needs.
Appendix
Available only for authorised users
Literature
1.
go back to reference Klevens RM, Morrison MA, Nadle J, Petit S, Gershman K, Ray S, Harrison LH, Lynfield R, Dumyati G, Townes JM: Invasive methicillin-resistant Staphylococcus aureus infections in the United States. JAMA. 2007, 298: 1763-1771. 10.1001/jama.298.15.1763.CrossRefPubMed Klevens RM, Morrison MA, Nadle J, Petit S, Gershman K, Ray S, Harrison LH, Lynfield R, Dumyati G, Townes JM: Invasive methicillin-resistant Staphylococcus aureus infections in the United States. JAMA. 2007, 298: 1763-1771. 10.1001/jama.298.15.1763.CrossRefPubMed
3.
go back to reference Jain R, Kralovic SM, Evans ME, Ambrose M, Simbartl LA, Obrosky DS, Render ML, Freyberg RW, Jernigan JA, Muder RR: Veterans Affairs initiative to prevent methicillin-resistant Staphylococcus aureus infections. N Engl J Med. 2011, 364: 1419-1430. 10.1056/NEJMoa1007474.CrossRefPubMed Jain R, Kralovic SM, Evans ME, Ambrose M, Simbartl LA, Obrosky DS, Render ML, Freyberg RW, Jernigan JA, Muder RR: Veterans Affairs initiative to prevent methicillin-resistant Staphylococcus aureus infections. N Engl J Med. 2011, 364: 1419-1430. 10.1056/NEJMoa1007474.CrossRefPubMed
5.
go back to reference Rubin MA, Mayer J, Greene T, Sauer BC, Hota B, Trick WE, Jernigan JA, Samore MH: An agent-based model for evaluating surveillance methods for catheter-related bloodstream infection. AMIA Annu Symp Proc. 2008, 631-635. Rubin MA, Mayer J, Greene T, Sauer BC, Hota B, Trick WE, Jernigan JA, Samore MH: An agent-based model for evaluating surveillance methods for catheter-related bloodstream infection. AMIA Annu Symp Proc. 2008, 631-635.
6.
go back to reference Borlawsky T, Hota B, Lin MY, Khan Y, Young J, Santangelo J, Stevenson KB: Development of a reference information model and knowledgebase for electronic bloodstream infection detection. AMIA Annu Symp Proc. 2008, 56-60. Borlawsky T, Hota B, Lin MY, Khan Y, Young J, Santangelo J, Stevenson KB: Development of a reference information model and knowledgebase for electronic bloodstream infection detection. AMIA Annu Symp Proc. 2008, 56-60.
7.
go back to reference Zeng D, Chen H, Lynch C, Millicent D, Gotham I: Infectious Disease Informatics and Outbreak Detection. Medical Informatics, Knowledge Management and Data Mining in Medicine. Edited by: Chen H, Fuller S, Friedman C, Hersch W. 2005, Springer Science + Business Media, Inc, New York, NY, 359-395. Zeng D, Chen H, Lynch C, Millicent D, Gotham I: Infectious Disease Informatics and Outbreak Detection. Medical Informatics, Knowledge Management and Data Mining in Medicine. Edited by: Chen H, Fuller S, Friedman C, Hersch W. 2005, Springer Science + Business Media, Inc, New York, NY, 359-395.
8.
go back to reference Brown SH, Lincoln MJ, Groen PJ, Kolodner RM: VistA--U.S. Department of Veterans Affairs national-scale HIS. Int J Med Inform. 2003, 69: 135-156. 10.1016/S1386-5056(02)00131-4.CrossRefPubMed Brown SH, Lincoln MJ, Groen PJ, Kolodner RM: VistA--U.S. Department of Veterans Affairs national-scale HIS. Int J Med Inform. 2003, 69: 135-156. 10.1016/S1386-5056(02)00131-4.CrossRefPubMed
9.
go back to reference Brown SH, Fischetti LF, Graham G, Bates J, Lancaster AE, McDaniel D, Gillon J, Darbe M, Kolodner RM: Use of electronic health records in disaster response: the experience of Department of Veterans Affairs after Hurricane Katrina. Am J Public Health. 2007, 97 (Suppl 1): S136-S141.CrossRefPubMedPubMedCentral Brown SH, Fischetti LF, Graham G, Bates J, Lancaster AE, McDaniel D, Gillon J, Darbe M, Kolodner RM: Use of electronic health records in disaster response: the experience of Department of Veterans Affairs after Hurricane Katrina. Am J Public Health. 2007, 97 (Suppl 1): S136-S141.CrossRefPubMedPubMedCentral
10.
go back to reference Ferrucci D, Lally A: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng. 2004, 10: 327-348. 10.1017/S1351324904003523.CrossRef Ferrucci D, Lally A: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng. 2004, 10: 327-348. 10.1017/S1351324904003523.CrossRef
11.
12.
go back to reference Wand Y, Wang RY: Anchoring data quality dimensions in ontological foundations. Commun ACM. 1996, 39: 86-95.CrossRef Wand Y, Wang RY: Anchoring data quality dimensions in ontological foundations. Commun ACM. 1996, 39: 86-95.CrossRef
13.
go back to reference German RR, Lee LM, Horan JM, Milstein RL, Pertowski CA, Waller MN: Updated guidelines for evaluating public health surveillance systems: recommendations from the Guidelines Working Group. MMWR Recomm Rep. 2001, 50: 1-35. quiz CE31-37PubMed German RR, Lee LM, Horan JM, Milstein RL, Pertowski CA, Waller MN: Updated guidelines for evaluating public health surveillance systems: recommendations from the Guidelines Working Group. MMWR Recomm Rep. 2001, 50: 1-35. quiz CE31-37PubMed
14.
go back to reference Clinical and Laboratory Standards Institute: Performance Standards for Antimicrobial Susceptibility Testing; Seventeenth Informational Supplement. CLSI document M100-S17 [ISBN 1-56238-625]. 2007, Clinical and Laboratory Standards Institute, 940 West Valley Road, Suite 1400, Wayne, Pennsylvania 19087-1898 USA Clinical and Laboratory Standards Institute: Performance Standards for Antimicrobial Susceptibility Testing; Seventeenth Informational Supplement. CLSI document M100-S17 [ISBN 1-56238-625]. 2007, Clinical and Laboratory Standards Institute, 940 West Valley Road, Suite 1400, Wayne, Pennsylvania 19087-1898 USA
15.
go back to reference Stein HD, Nadkarni P, Erdos J, Miller PL: Exploring the degree of concordance of coded and textual data in answering clinical queries from a clinical data repository. J Am Med Inform Assoc. 2000, 7: 42-54. 10.1136/jamia.2000.0070042.CrossRefPubMedPubMedCentral Stein HD, Nadkarni P, Erdos J, Miller PL: Exploring the degree of concordance of coded and textual data in answering clinical queries from a clinical data repository. J Am Med Inform Assoc. 2000, 7: 42-54. 10.1136/jamia.2000.0070042.CrossRefPubMedPubMedCentral
16.
go back to reference Haug A, Zachariassen F, van Liempd D: The costs of poor data quality. J Ind Eng Manage. 2011, 4: 168-193. Haug A, Zachariassen F, van Liempd D: The costs of poor data quality. J Ind Eng Manage. 2011, 4: 168-193.
Metadata
Title
Identification of methicillin-resistant Staphylococcus aureus within the Nation’s Veterans Affairs Medical Centers using natural language processing
Authors
Makoto Jones
Scott L DuVall
Joshua Spuhl
Matthew H Samore
Christopher Nielson
Michael Rubin
Publication date
01-12-2012
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2012
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/1472-6947-12-34

Other articles of this Issue 1/2012

BMC Medical Informatics and Decision Making 1/2012 Go to the issue