Skip to main content
Top
Published in: Drug Safety 7/2013

01-07-2013 | Original Research Article

Application of Information Retrieval Approaches to Case Classification in the Vaccine Adverse Event Reporting System

Authors: Taxiarchis Botsis, Emily Jane Woo, Robert Ball

Published in: Drug Safety | Issue 7/2013

Login to get access

Abstract

Background

Automating the classification of adverse event reports is an important step to improve the efficiency of vaccine safety surveillance. Previously we showed it was possible to classify reports using features extracted from the text of the reports.

Objective

The aim of this study was to use the information encoded in the Medical Dictionary for Regulatory Activities (MedDRA®) in the US Vaccine Adverse Event Reporting System (VAERS) to support and evaluate two classification approaches: a multiple information retrieval strategy and a rule-based approach. To evaluate the performance of these approaches, we selected the conditions of anaphylaxis and Guillain–Barré syndrome (GBS).

Methods

We used MedDRA® Preferred Terms stored in the VAERS, and two standardized medical terminologies: the Brighton Collaboration (BC) case definitions and Standardized MedDRA® Queries (SMQ) to classify two sets of reports for GBS and anaphylaxis. Two approaches were used: (i) the rule-based instruments that are available by the two terminologies (the Automatic Brighton Classification [ABC] tool and the SMQ algorithms); and (ii) the vector space model.

Results

We found that the rule-based instruments, particularly the SMQ algorithms, achieved a high degree of specificity; however, there was a cost in terms of sensitivity in all but the narrow GBS SMQ algorithm that outperformed the remaining approaches (sensitivity in the testing set was equal to 99.06 % for this algorithm vs. 93.40 % for the vector space model). In the case of anaphylaxis, the vector space model achieved higher sensitivity compared with the best values of both the ABC tool and the SMQ algorithms in the testing set (86.44 % vs. 64.11 % and 52.54 %, respectively).

Conclusions

Our results showed the superiority of the vector space model over the existing rule-based approaches irrespective of the standardized medical knowledge represented by either the SMQ or the BC case definition. The vector space model might make automation of case definitions for spontaneous report review more efficient than current rule-based approaches, allowing more time for critical assessment and decision making by pharmacovigilance experts.
Appendix
Available only for authorised users
Footnotes
1
MedDRA® terminology is the international medical terminology developed under the auspices of the International Conference on Harmonization of Technical Requirements for Registration of Pharmaceuticals for Human Use (ICH).
 
Literature
1.
go back to reference Varricchio F, Iskander J, Destefano F, Ball R, Pless R, Braun MM, et al. Understanding vaccine safety information from the vaccine adverse event reporting system. Pediatr Infect Dis J. 2004;23(4):287–94.PubMedCrossRef Varricchio F, Iskander J, Destefano F, Ball R, Pless R, Braun MM, et al. Understanding vaccine safety information from the vaccine adverse event reporting system. Pediatr Infect Dis J. 2004;23(4):287–94.PubMedCrossRef
2.
go back to reference Manning CD, Raghavan P, Schutze H. Introduction to information retrieval. 1st ed. Cambridge: Cambridge University Press; 2008.CrossRef Manning CD, Raghavan P, Schutze H. Introduction to information retrieval. 1st ed. Cambridge: Cambridge University Press; 2008.CrossRef
3.
go back to reference Manning CD, Schutze H. Foundations of statistical natural language processing. 1st ed. Cambridge: MIT Press; 1999. Manning CD, Schutze H. Foundations of statistical natural language processing. 1st ed. Cambridge: MIT Press; 1999.
4.
go back to reference Brown EG, Wood L, Wood S. The medical dictionary for regulatory activities (MedDRA). Drug Saf. 1999;20(2):109–17.PubMedCrossRef Brown EG, Wood L, Wood S. The medical dictionary for regulatory activities (MedDRA). Drug Saf. 1999;20(2):109–17.PubMedCrossRef
5.
go back to reference Bonhoeffer J, Kohl K, Chen R, Duclos P, Heijbel H, Heininger U, et al. The Brighton Collaboration: addressing the need for standardized case definitions of adverse events following immunization (AEFI). Vaccine. 2002;21(3–4):298–302.PubMedCrossRef Bonhoeffer J, Kohl K, Chen R, Duclos P, Heijbel H, Heininger U, et al. The Brighton Collaboration: addressing the need for standardized case definitions of adverse events following immunization (AEFI). Vaccine. 2002;21(3–4):298–302.PubMedCrossRef
6.
go back to reference Humphreys BL, Lindberg DAB, Schoolman HM, Barnett GO. The unified medical language system. J Am Med Inform Assoc. 1998;5(1):1–11.PubMedCrossRef Humphreys BL, Lindberg DAB, Schoolman HM, Barnett GO. The unified medical language system. J Am Med Inform Assoc. 1998;5(1):1–11.PubMedCrossRef
7.
go back to reference Liu H, Hu ZZ, Zhang J, Wu C. BioThesaurus: a web-based thesaurus of protein and gene names. Bioinformatics. 2006;22(1):103–5.PubMedCrossRef Liu H, Hu ZZ, Zhang J, Wu C. BioThesaurus: a web-based thesaurus of protein and gene names. Bioinformatics. 2006;22(1):103–5.PubMedCrossRef
8.
go back to reference Thompson P, McNaught J, Montemagni S, Calzolari N, Del Gratta R, Lee V, et al. The BioLexicon: a large-scale terminological resource for biomedical text mining. BMC Bioinformatics. 2011;12(1):397.PubMedCrossRef Thompson P, McNaught J, Montemagni S, Calzolari N, Del Gratta R, Lee V, et al. The BioLexicon: a large-scale terminological resource for biomedical text mining. BMC Bioinformatics. 2011;12(1):397.PubMedCrossRef
9.
go back to reference Botsis T, Nguyen MD, Woo EJ, Markatou M, Ball R. Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection. J Am Med Inform Assoc. 2011;18(5):631–8.PubMedCrossRef Botsis T, Nguyen MD, Woo EJ, Markatou M, Ball R. Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection. J Am Med Inform Assoc. 2011;18(5):631–8.PubMedCrossRef
10.
go back to reference Pedersen T, Pakhomov SV, Patwardhan S, Chute CG. Measures of semantic similarity and relatedness in the biomedical domain. J Biomed Inform. 2007;40(3):288–99.PubMedCrossRef Pedersen T, Pakhomov SV, Patwardhan S, Chute CG. Measures of semantic similarity and relatedness in the biomedical domain. J Biomed Inform. 2007;40(3):288–99.PubMedCrossRef
11.
go back to reference Lin D. An information-theoretic definition of similarity. In: Proceedings of 15th international conference on machine learning. San Francisco: Morgan Kaufmann Publishers Inc.; 1998. p. 296–304. Lin D. An information-theoretic definition of similarity. In: Proceedings of 15th international conference on machine learning. San Francisco: Morgan Kaufmann Publishers Inc.; 1998. p. 296–304.
12.
go back to reference Cao H, Melton GB, Markatou M, Hripcsak G. Use abstracted patient-specific features to assist an information-theoretic measurement to assess similarity between medical cases. J Biomed Inform. 2008;41(6):882–8.PubMedCrossRef Cao H, Melton GB, Markatou M, Hripcsak G. Use abstracted patient-specific features to assist an information-theoretic measurement to assess similarity between medical cases. J Biomed Inform. 2008;41(6):882–8.PubMedCrossRef
13.
go back to reference Markatou M, Kuruppumullage-Don P, Hu J, Wang F, Sun J, Sorrentino R, et al. Case-based reasoning in comparative effectiveness research. IBM J Res Dev. 2012;56:5.CrossRef Markatou M, Kuruppumullage-Don P, Hu J, Wang F, Sun J, Sorrentino R, et al. Case-based reasoning in comparative effectiveness research. IBM J Res Dev. 2012;56:5.CrossRef
14.
go back to reference Botsis T, Buttolph T, Nguyen MD, Winiecki S, Woo EJ, Ball R. Vaccine Adverse Event Text Mining (VaeTM) system for extracting features from vaccine safety reports. J Am Med Inform Assoc. 2012;19(6):1011–8.PubMedCrossRef Botsis T, Buttolph T, Nguyen MD, Winiecki S, Woo EJ, Ball R. Vaccine Adverse Event Text Mining (VaeTM) system for extracting features from vaccine safety reports. J Am Med Inform Assoc. 2012;19(6):1011–8.PubMedCrossRef
15.
16.
go back to reference Mozzicato P. Standardised MedDRA queries: their role in signal detection. Drug Saf. 2007;30(7):617–9.PubMedCrossRef Mozzicato P. Standardised MedDRA queries: their role in signal detection. Drug Saf. 2007;30(7):617–9.PubMedCrossRef
17.
go back to reference Ruggeberg JU, Gold MS, Bayas JM, Blum MD, Bonhoeffer J, Friedlander S, et al. Anaphylaxis: case definition and guidelines for data collection, analysis, and presentation of immunization safety data. Vaccine. 2007;25(31):5675–84.PubMedCrossRef Ruggeberg JU, Gold MS, Bayas JM, Blum MD, Bonhoeffer J, Friedlander S, et al. Anaphylaxis: case definition and guidelines for data collection, analysis, and presentation of immunization safety data. Vaccine. 2007;25(31):5675–84.PubMedCrossRef
18.
go back to reference Sejvar JJ, Kohl KS, Gidudu J, Amato A, Bakshi N, Baxter R, et al. Guillain–Barré syndrome and Fisher syndrome: case definitions and guidelines for collection, analysis, and presentation of immunization safety data. Vaccine. 2011;29(3):599–612.PubMedCrossRef Sejvar JJ, Kohl KS, Gidudu J, Amato A, Bakshi N, Baxter R, et al. Guillain–Barré syndrome and Fisher syndrome: case definitions and guidelines for collection, analysis, and presentation of immunization safety data. Vaccine. 2011;29(3):599–612.PubMedCrossRef
19.
go back to reference MedDRA Maintenance and Support Services Organization. Introductory guide for standardised MedDRA queries (SMQs) Version 14.1. Chantily: MedDRA; 2011. MedDRA Maintenance and Support Services Organization. Introductory guide for standardised MedDRA queries (SMQs) Version 14.1. Chantily: MedDRA; 2011.
Metadata
Title
Application of Information Retrieval Approaches to Case Classification in the Vaccine Adverse Event Reporting System
Authors
Taxiarchis Botsis
Emily Jane Woo
Robert Ball
Publication date
01-07-2013
Publisher
Springer International Publishing AG
Published in
Drug Safety / Issue 7/2013
Print ISSN: 0114-5916
Electronic ISSN: 1179-1942
DOI
https://doi.org/10.1007/s40264-013-0064-4

Other articles of this Issue 7/2013

Drug Safety 7/2013 Go to the issue