Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2023

Open Access 01-12-2023 | Research

Negation recognition in clinical natural language processing using a combination of the NegEx algorithm and a convolutional neural network

Authors: Guillermo Argüello-González, José Aquino-Esperanza, Daniel Salvador, Rosa Bretón-Romero, Carlos Del Río-Bermudez, Jorge Tello, Sebastian Menke

Published in: BMC Medical Informatics and Decision Making | Issue 1/2023

Login to get access

Abstract

Background

Important clinical information of patients is present in unstructured free-text fields of Electronic Health Records (EHRs). While this information can be extracted using clinical Natural Language Processing (cNLP), the recognition of negation modifiers represents an important challenge. A wide range of cNLP applications have been developed to detect the negation of medical entities in clinical free-text, however, effective solutions for languages other than English are scarce. This study aimed at developing a solution for negation recognition in Spanish EHRs based on a combination of a customized rule-based NegEx layer and a convolutional neural network (CNN).

Methods

Based on our previous experience in real world evidence (RWE) studies using information embedded in EHRs, negation recognition was simplified into a binary problem (‘affirmative’ vs. ‘non-affirmative’ class). For the NegEx layer, negation rules were obtained from a publicly available Spanish corpus and enriched with custom ones, whereby the CNN binary classifier was trained on EHRs annotated for clinical named entities (cNEs) and negation markers by medical doctors.

Results

The proposed negation recognition pipeline obtained precision, recall, and F1-score of 0.93, 0.94, and 0.94 for the ‘affirmative’ class, and 0.86, 0.84, and 0.85 for the ‘non-affirmative’ class, respectively. To validate the generalization capabilities of our methodology, we applied the negation recognition pipeline on EHRs (6,710 cNEs) from a different data source distribution than the training corpus and obtained consistent performance metrics for the ‘affirmative’ and ‘non-affirmative’ class (0.95, 0.97, and 0.96; and 0.90, 0.83, and 0.86 for precision, recall, and F1-score, respectively). Lastly, we evaluated the pipeline against two publicly available Spanish negation corpora, the IULA and NUBes, obtaining state-of-the-art metrics (1.00, 0.99, and 0.99; and 1.00, 0.93, and 0.96 for precision, recall, and F1-score, respectively).

Conclusion

Negation recognition is a source of low precision in the retrieval of cNEs from EHRs’ free-text. Combining a customized rule-based NegEx layer with a CNN binary classifier outperformed many other current approaches. RWE studies highly benefit from the correct recognition of negation as it reduces false positive detections of cNE which otherwise would undoubtedly reduce the credibility of cNLP systems.
Appendix
Available only for authorised users
Literature
1.
go back to reference Katkade VB, Sanders KN, Zou KH. Real world data: an opportunity to supplement existing evidence for the use of long-established medicines in health care decision making. J Multidiscip Healthc. 2018;11:295–304.CrossRefPubMedPubMedCentral Katkade VB, Sanders KN, Zou KH. Real world data: an opportunity to supplement existing evidence for the use of long-established medicines in health care decision making. J Multidiscip Healthc. 2018;11:295–304.CrossRefPubMedPubMedCentral
4.
go back to reference Sorin V, Barash Y, Konen E, Klang E. Deep-learning natural language processing for oncological applications. Lancet Oncol. 2020;21(12):1553–6.CrossRefPubMed Sorin V, Barash Y, Konen E, Klang E. Deep-learning natural language processing for oncological applications. Lancet Oncol. 2020;21(12):1553–6.CrossRefPubMed
5.
go back to reference Wu S, Miller T, Masanz J, Coarr M, Halgrim S, Carrell D. Negation’s not solved: Generalizability Versus Optimizability in Clinical Natural Language Processing. PLoS ONE. 2014;9(11):e112774. Wu S, Miller T, Masanz J, Coarr M, Halgrim S, Carrell D. Negation’s not solved: Generalizability Versus Optimizability in Clinical Natural Language Processing. PLoS ONE. 2014;9(11):e112774.
6.
go back to reference Mahany A, Khaled H, Elmitwally NS, Aljohani N, Ghoniemy S. Negation and speculation in NLP: a Survey, Corpora, methods, and applications. Appl Sci. 2022;12(10):5209. Mahany A, Khaled H, Elmitwally NS, Aljohani N, Ghoniemy S. Negation and speculation in NLP: a Survey, Corpora, methods, and applications. Appl Sci. 2022;12(10):5209.
7.
go back to reference Mehrabi S, Krishnan A, Sohn S, Roch AM, Schmidt H, Kesterson J. DEEPEN: a negation detection system for clinical text incorporating dependency relation into NegEx. J Biomed Inform. 2015;54:213–9.CrossRefPubMedPubMedCentral Mehrabi S, Krishnan A, Sohn S, Roch AM, Schmidt H, Kesterson J. DEEPEN: a negation detection system for clinical text incorporating dependency relation into NegEx. J Biomed Inform. 2015;54:213–9.CrossRefPubMedPubMedCentral
8.
go back to reference Costumero R, Lopez F, Gonzalo-Martín C, Millan M, Menasalvas E. An Approach to detect negation on medical documents in spanish. In: Ślezak D, Tan AH, Peters JF, Schwabe L, editors. Brain Informatics and Health. Cham: Springer International Publishing; 2014. pp. 366–75. (Lecture Notes in Computer Science).CrossRef Costumero R, Lopez F, Gonzalo-Martín C, Millan M, Menasalvas E. An Approach to detect negation on medical documents in spanish. In: Ślezak D, Tan AH, Peters JF, Schwabe L, editors. Brain Informatics and Health. Cham: Springer International Publishing; 2014. pp. 366–75. (Lecture Notes in Computer Science).CrossRef
9.
go back to reference Deléger L, Grouin C. Detecting negation of medical problems in French clinical notes. In: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium [Internet]. New York, NY, USA: Association for Computing Machinery; 2012 [cited 2022 Apr 29]. p. 697–702. (IHI ’12). Available from: https://doi.org/10.1145/2110363.2110443. Deléger L, Grouin C. Detecting negation of medical problems in French clinical notes. In: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium [Internet]. New York, NY, USA: Association for Computing Machinery; 2012 [cited 2022 Apr 29]. p. 697–702. (IHI ’12). Available from: https://​doi.​org/​10.​1145/​2110363.​2110443.
10.
go back to reference Cotik V, Roller R, Xu F, Uszkoreit H, Budde K, Schmidt D. Negation Detection in Clinical Reports Written in German. In: Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016)[Internet]. Osaka, Japan: The COLING2016 Organizing Committee; 2016 [cited2022Apr29]. p.115–24. Available from: https://aclanthology.org/W16–5113. Cotik V, Roller R, Xu F, Uszkoreit H, Budde K, Schmidt D. Negation Detection in Clinical Reports Written in German. In: Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016)[Internet]. Osaka, Japan: The COLING2016 Organizing Committee; 2016 [cited2022Apr29]. p.115–24. Available from: https://​aclanthology.​org/​W16–5113.
11.
go back to reference Skeppstedt M. Negation detection in swedish clinical text: an adaption of NegEx to swedish. J Biomed Semant. 2011;2(S3):1–12.CrossRef Skeppstedt M. Negation detection in swedish clinical text: an adaption of NegEx to swedish. J Biomed Semant. 2011;2(S3):1–12.CrossRef
12.
go back to reference Wu LT, Lin JR, Leng S, Li JL, Hu ZZ. Rule-based information extraction for mechanical-electrical-plumbing-specific semantic web. Autom Constr. 2022;135:104108.CrossRef Wu LT, Lin JR, Leng S, Li JL, Hu ZZ. Rule-based information extraction for mechanical-electrical-plumbing-specific semantic web. Autom Constr. 2022;135:104108.CrossRef
13.
go back to reference Kang T, Zhang S, Xu N, Wen D, Zhang X, Lei J. Detecting negation and scope in chinese clinical notes using character and word embedding. Comput Methods Programs Biomed. 2017;140:53–9.CrossRefPubMed Kang T, Zhang S, Xu N, Wen D, Zhang X, Lei J. Detecting negation and scope in chinese clinical notes using character and word embedding. Comput Methods Programs Biomed. 2017;140:53–9.CrossRefPubMed
14.
go back to reference Morante R, Daelemans W. A metalearning approach to processing the scope of negation. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning. USA: Association for Computational Linguistics; 2009. p.21–9. (CoNLL’09). Morante R, Daelemans W. A metalearning approach to processing the scope of negation. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning. USA: Association for Computational Linguistics; 2009. p.21–9. (CoNLL’09).
15.
go back to reference Fancellu F, Lopez A, Webber B. Neural Networks For Negation Scope Detection. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume1: LongPapers)[Internet]. Berlin,Germany: Association for Computational Linguistics; 2016 [cited2022Apr29]. p.495–504. Available from: https://doi.org/10.48550/arXiv.1706.03762. Fancellu F, Lopez A, Webber B. Neural Networks For Negation Scope Detection. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume1: LongPapers)[Internet]. Berlin,Germany: Association for Computational Linguistics; 2016 [cited2022Apr29]. p.495–504. Available from: https://​doi.​org/​10.​48550/​arXiv.​1706.​03762.
16.
go back to reference Chen L. Attention-based deep learning system for negation and assertion detection in clinical notes. Int J Artif Intell Appl. 2019;10(01):1–9. Chen L. Attention-based deep learning system for negation and assertion detection in clinical notes. Int J Artif Intell Appl. 2019;10(01):1–9.
17.
go back to reference Qian Z, Li P, Zhu Q, Zhou G, Luo Z, Luo W. Speculation and Negation Scope Detection via Convolutional Neural Networks. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing [Internet]. Austin, Texas: Association for Computational Linguistics; 2016 [cited 2022 Jun 10]. p. 815–25. Available from: https://aclanthology.org/D16-1078. Qian Z, Li P, Zhu Q, Zhou G, Luo Z, Luo W. Speculation and Negation Scope Detection via Convolutional Neural Networks. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing [Internet]. Austin, Texas: Association for Computational Linguistics; 2016 [cited 2022 Jun 10]. p. 815–25. Available from: https://​aclanthology.​org/​D16-1078.
18.
19.
go back to reference Santiso S, Pérez A, Casillas A, Oronoz M. Neural negated entity recognition in spanish electronic health records. J Biomed Inform. 2020;105:103419.CrossRefPubMed Santiso S, Pérez A, Casillas A, Oronoz M. Neural negated entity recognition in spanish electronic health records. J Biomed Inform. 2020;105:103419.CrossRefPubMed
20.
go back to reference Fabregat H, Duque A, Mart?nez-Romo J, Araujo L. Extending a Deep Learning Approach for Negation Cues Detection in Spanish. In: IberLEF@SEPLN. 2019. Fabregat H, Duque A, Mart?nez-Romo J, Araujo L. Extending a Deep Learning Approach for Negation Cues Detection in Spanish. In: IberLEF@SEPLN. 2019.
21.
go back to reference Fabregat H, Araujo L, Martínez-Romo J. Deep learning approach for negation trigger and scope recognition.Proces Leng Nat.2019. Fabregat H, Araujo L, Martínez-Romo J. Deep learning approach for negation trigger and scope recognition.Proces Leng Nat.2019.
22.
go back to reference Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2001;34(5):301–10.CrossRefPubMed Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2001;34(5):301–10.CrossRefPubMed
24.
go back to reference Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278–324.CrossRef Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278–324.CrossRef
27.
go back to reference Guo R, Zhao Y, Zou Q, Fang X, Peng S. Bioinformatics applications on Apache Spark. GigaScience. 2018 Aug 7;7(8):giy098. Guo R, Zhao Y, Zou Q, Fang X, Peng S. Bioinformatics applications on Apache Spark. GigaScience. 2018 Aug 7;7(8):giy098.
30.
go back to reference Zaharia M, Chen A, Davidson A, Ghodsi A, Hong SA, Konwinski A, et al. Accelerating the Machine Learning Lifecycle with MLflow. :7. Zaharia M, Chen A, Davidson A, Ghodsi A, Hong SA, Konwinski A, et al. Accelerating the Machine Learning Lifecycle with MLflow. :7.
32.
go back to reference Marimon M, Vivaldi J, Bel N. Annotation of negation in the IULA Spanish Clinical Record Corpus. In: Proceedings of the Workshop Computational Semantics Beyond Events and Roles [Internet]. Valencia, Spain: Association for Computational Linguistics; 2017 [cited 2022 Feb 10]. p. 43–52. Available from: https://aclanthology.org/W17-1807. Marimon M, Vivaldi J, Bel N. Annotation of negation in the IULA Spanish Clinical Record Corpus. In: Proceedings of the Workshop Computational Semantics Beyond Events and Roles [Internet]. Valencia, Spain: Association for Computational Linguistics; 2017 [cited 2022 Feb 10]. p. 43–52. Available from: https://​aclanthology.​org/​W17-1807.
33.
go back to reference Lima Lopez S, Perez N, Cuadros M, Rigau G. NUBes: A Corpus of Negation and Uncertainty in Spanish Clinical Texts. In: Proceedings of the 12th Language Resources and Evaluation Conference [Internet]. Marseille, France: European Language Resources Association; 2020 [cited 2022 Jun 10]. p. 5772–81. Available from: https://aclanthology.org/2020.lrec-1.708. Lima Lopez S, Perez N, Cuadros M, Rigau G. NUBes: A Corpus of Negation and Uncertainty in Spanish Clinical Texts. In: Proceedings of the 12th Language Resources and Evaluation Conference [Internet]. Marseille, France: European Language Resources Association; 2020 [cited 2022 Jun 10]. p. 5772–81. Available from: https://​aclanthology.​org/​2020.​lrec-1.​708.
34.
go back to reference Cohen KB, Demner-Fushman D. Biomedical Natural Language Processing. John Benjamins Publishing Company; 2014. p. 174. Cohen KB, Demner-Fushman D. Biomedical Natural Language Processing. John Benjamins Publishing Company; 2014. p. 174.
35.
go back to reference Vincze V, Szarvas G, Farkas R, Móra G, Csirik J. The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes. BMC Bioinformatics. 2008;9(11):9.CrossRef Vincze V, Szarvas G, Farkas R, Móra G, Csirik J. The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes. BMC Bioinformatics. 2008;9(11):9.CrossRef
36.
go back to reference Vincze V. Uncertainty Detection in Hungarian Texts. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers [Internet]. Dublin, Ireland: Dublin City University and Association for Computational Linguistics; 2014 [cited 2022 Jun 10]. p. 1844–53. Available from: https://aclanthology.org/C14-1174. Vincze V. Uncertainty Detection in Hungarian Texts. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers [Internet]. Dublin, Ireland: Dublin City University and Association for Computational Linguistics; 2014 [cited 2022 Jun 10]. p. 1844–53. Available from: https://​aclanthology.​org/​C14-1174.
37.
go back to reference Chapman WW, Hillert D, Velupillai S, Kvist M, Skeppstedt M, Chapman BE. Extending the NegEx lexicon for multiple languages. Stud Health Technol Inform. 2013;192:677–81.PubMedPubMedCentral Chapman WW, Hillert D, Velupillai S, Kvist M, Skeppstedt M, Chapman BE. Extending the NegEx lexicon for multiple languages. Stud Health Technol Inform. 2013;192:677–81.PubMedPubMedCentral
38.
go back to reference Lazib L, Qin B, Zhao Y, Zhang W, Liu T. A syntactic path-based hybrid neural network for negation scope detection. Front Comput Sci. 2020;14(1):84–94.CrossRef Lazib L, Qin B, Zhao Y, Zhang W, Liu T. A syntactic path-based hybrid neural network for negation scope detection. Front Comput Sci. 2020;14(1):84–94.CrossRef
39.
go back to reference Bhatia P, Busra Celikkaya E, Khalilia M. End-to-End Joint Entity Extraction and Negation Detection for Clinical Text. In: Shaban-Nejad A, Michalowski M, editors. Precision Health and Medicine: A Digital Revolution in Healthcare [Internet]. Cham: Springer International Publishing; 2020 [cited 2022 Jun 10]. p. 139–48. (Studies in Computational Intelligence). Available from: https://doi.org/10.1007/978-3-030-24409-5_13. Bhatia P, Busra Celikkaya E, Khalilia M. End-to-End Joint Entity Extraction and Negation Detection for Clinical Text. In: Shaban-Nejad A, Michalowski M, editors. Precision Health and Medicine: A Digital Revolution in Healthcare [Internet]. Cham: Springer International Publishing; 2020 [cited 2022 Jun 10]. p. 139–48. (Studies in Computational Intelligence). Available from: https://​doi.​org/​10.​1007/​978-3-030-24409-5_​13.
40.
go back to reference Rivera Zavala R, Martinez P. The impact of Pretrained Language Models on Negation and speculation detection in Cross-Lingual Medical text: comparative study. JMIR Med Inform. 2020;8(12):e18953.CrossRefPubMedPubMedCentral Rivera Zavala R, Martinez P. The impact of Pretrained Language Models on Negation and speculation detection in Cross-Lingual Medical text: comparative study. JMIR Med Inform. 2020;8(12):e18953.CrossRefPubMedPubMedCentral
41.
go back to reference Pabón OS, Montenegro O, Torrente M, González AR, Provencio M, Menasalvas E. Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach. PeerJ Comput Sci. 2022;8:e913.CrossRef Pabón OS, Montenegro O, Torrente M, González AR, Provencio M, Menasalvas E. Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach. PeerJ Comput Sci. 2022;8:e913.CrossRef
Metadata
Title
Negation recognition in clinical natural language processing using a combination of the NegEx algorithm and a convolutional neural network
Authors
Guillermo Argüello-González
José Aquino-Esperanza
Daniel Salvador
Rosa Bretón-Romero
Carlos Del Río-Bermudez
Jorge Tello
Sebastian Menke
Publication date
01-12-2023
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2023
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-023-02301-5

Other articles of this Issue 1/2023

BMC Medical Informatics and Decision Making 1/2023 Go to the issue