Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2020

Open Access 01-12-2020 | Research article

Violence detection explanation via semantic roles embeddings

Authors: Enrico Mensa, Davide Colla, Marco Dalmasso, Marco Giustini, Carlo Mamo, Alessio Pitidis, Daniele P. Radicioni

Published in: BMC Medical Informatics and Decision Making | Issue 1/2020

Login to get access

Abstract

Background

Emergency room reports pose specific challenges to natural language processing techniques. In this setting, violence episodes on women, elderly and children are often under-reported. Categorizing textual descriptions as containing violence-related injuries (V) vs. non-violence-related injuries (NV) is thus a relevant task to the ends of devising alerting mechanisms to track (and prevent) violence episodes.

Methods

We present ViDeS (so dubbed after Violence Detection System), a system to detect episodes of violence from narrative texts in emergency room reports. It employs a deep neural network for categorizing textual ER reports data, and complements such output by making explicit which elements corroborate the interpretation of the record as reporting about violence-related injuries. To these ends we designed a novel hybrid technique for filling semantic frames that employs distributed representations of terms herein, along with syntactic and semantic information. The system has been validated on real data annotated with two sorts of information: about the presence vs. absence of violence-related injuries, and about some semantic roles that can be interpreted as major cues for violent episodes, such as the agent that committed violence, the victim, the body district involved, etc.. The employed dataset contains over 150K records annotated with class (V,NV) information, and 200 records with finer-grained information on the aforementioned semantic roles.

Results

We used data coming from an Italian branch of the EU-Injury Database (EU-IDB) project, compiled by hospital staff. Categorization figures approach full precision and recall for negative cases and.97 precision and.94 recall on positive cases. As regards as the recognition of semantic roles, we recorded an accuracy varying from.28 to.90 according to the semantic roles involved. Moreover, the system allowed unveiling annotation errors committed by hospital staff.

Conclusions

Explaining systems’ results, so to make their output more comprehensible and convincing, is today necessary for AI systems. Our proposal is to combine distributed and symbolic (frame-like) representations as a possible answer to such pressing request for interpretability. Although presently focused on the medical domain, the proposed methodology is general and, in principle, it can be extended to further application areas and categorization tasks.
Literature
1.
go back to reference Moulin B, Irandoust H, Bélanger M, Desbordes G. Explanation and argumentation capabilities: Towards the creation of more persuasive agents. Artif Intell Rev. 2002; 17(3):169–222.CrossRef Moulin B, Irandoust H, Bélanger M, Desbordes G. Explanation and argumentation capabilities: Towards the creation of more persuasive agents. Artif Intell Rev. 2002; 17(3):169–222.CrossRef
2.
go back to reference Aamodt A. Explanation-driven case-based reasoning. In: European Workshop on Case-Based Reasoning. Springer: 1993. p. 274–88. Aamodt A. Explanation-driven case-based reasoning. In: European Workshop on Case-Based Reasoning. Springer: 1993. p. 274–88.
3.
go back to reference Roth-Berghofer TR. Explanations and case-based reasoning: Foundational issues. In: European Conference on Case-Based Reasoning. Springer: 2004. p. 389–403. Roth-Berghofer TR. Explanations and case-based reasoning: Foundational issues. In: European Conference on Case-Based Reasoning. Springer: 2004. p. 389–403.
4.
go back to reference Quinlan JR. Induction of decision trees. Mach Learn. 1986; 1(1):81–106. Quinlan JR. Induction of decision trees. Mach Learn. 1986; 1(1):81–106.
5.
go back to reference Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol. 1996; 58(1):267–88. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol. 1996; 58(1):267–88.
8.
go back to reference Voigt P, Von dem Bussche A. The EU General Data Protection Regulation (GDPR) In: A Practical Guide, editor. 1st Ed. Cham: Springer International Publishing: 2017. Voigt P, Von dem Bussche A. The EU General Data Protection Regulation (GDPR) In: A Practical Guide, editor. 1st Ed. Cham: Springer International Publishing: 2017.
10.
go back to reference Pieters W. Explanation and trust: what to tell the user in security and AI?Ethics Inf Technol. 2011; 13(1):53–64.CrossRef Pieters W. Explanation and trust: what to tell the user in security and AI?Ethics Inf Technol. 2011; 13(1):53–64.CrossRef
11.
go back to reference Miller T. Explanation in artificial intelligence: Insights from the social sciences. 2019; 267:1–38. Miller T. Explanation in artificial intelligence: Insights from the social sciences. 2019; 267:1–38.
12.
go back to reference Lapuschkin S, Wäldchen S, Binder A, Montavon G, Samek W, Müller K-R. Unmasking clever hans predictors and assessing what machines really learn. Nat Commun. 2019; 10(1):1–8.CrossRef Lapuschkin S, Wäldchen S, Binder A, Montavon G, Samek W, Müller K-R. Unmasking clever hans predictors and assessing what machines really learn. Nat Commun. 2019; 10(1):1–8.CrossRef
14.
go back to reference Samek W, Vol. 11700. Explainable AI: interpreting, explaining and visualizing deep learning: Springer; 2019. Samek W, Vol. 11700. Explainable AI: interpreting, explaining and visualizing deep learning: Springer; 2019.
15.
go back to reference World Health Organization. Responding to intimate partner violence and sexual violence against women: WHO clinical and policy guidelines: Technical report, World Health Organization; 2013. World Health Organization. Responding to intimate partner violence and sexual violence against women: WHO clinical and policy guidelines: Technical report, World Health Organization; 2013.
16.
go back to reference World Health Organization, et al.WHO: addressing violence against women: key achievements and priorities: Technical report, World Health Organization; 2018. World Health Organization, et al.WHO: addressing violence against women: key achievements and priorities: Technical report, World Health Organization; 2018.
17.
go back to reference Leeb RT. Child maltreatment surveillance: Uniform definitions for public health and recommended data elements. Centers for Disease Control and Prevention, National Center for Injury Prevention and Control. 2008. Leeb RT. Child maltreatment surveillance: Uniform definitions for public health and recommended data elements. Centers for Disease Control and Prevention, National Center for Injury Prevention and Control. 2008.
18.
go back to reference Fillmore CJ, Baker C. A frames approach to semantic analysis. In: The Oxford Handbook of Linguistic Analysis: 2010. Fillmore CJ, Baker C. A frames approach to semantic analysis. In: The Oxford Handbook of Linguistic Analysis: 2010.
20.
go back to reference Sikos J, Padó S. Using embeddings to compare framenet frames across languages. In: Proceedings of the First Workshop on Linguistic Resources for Natural Language Processing: 2018. p. 91–101. Sikos J, Padó S. Using embeddings to compare framenet frames across languages. In: Proceedings of the First Workshop on Linguistic Resources for Natural Language Processing: 2018. p. 91–101.
22.
23.
go back to reference Gildea D, Jurafsky D. Automatic labeling of semantic roles. Comput Linguist. 2002; 28(3):245–88.CrossRef Gildea D, Jurafsky D. Automatic labeling of semantic roles. Comput Linguist. 2002; 28(3):245–88.CrossRef
24.
go back to reference Croce D, Giannone C, Annesi P, Basili R. Towards open-domain semantic role labeling. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics: 2010. p. 237–46, Association for Computational Linguistics. Croce D, Giannone C, Annesi P, Basili R. Towards open-domain semantic role labeling. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics: 2010. p. 237–46, Association for Computational Linguistics.
25.
go back to reference Zapirain B, Agirre E, Marquez L, Surdeanu M. Selectional preferences for semantic role classification. Comput Linguist. 2013; 39(3):631–63.CrossRef Zapirain B, Agirre E, Marquez L, Surdeanu M. Selectional preferences for semantic role classification. Comput Linguist. 2013; 39(3):631–63.CrossRef
26.
go back to reference Roth M, Lapata M. Neural semantic role labeling with dependency path embeddings. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers): 2016. p. 1192–202. Roth M, Lapata M. Neural semantic role labeling with dependency path embeddings. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers): 2016. p. 1192–202.
27.
go back to reference Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. J Mach Learn Res. 2011; 12:2493–537. Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. Natural language processing (almost) from scratch. J Mach Learn Res. 2011; 12:2493–537.
28.
go back to reference Haug PJ, Koehler SB, Christensen LM, Gundersen ML, Van Bree RE. Probabilistic method for natural language processing and for encoding free-text data into a medical database by utilizing a Bayesian network to perform spell checking of words. 2001. US Patent 6,292,771. Haug PJ, Koehler SB, Christensen LM, Gundersen ML, Van Bree RE. Probabilistic method for natural language processing and for encoding free-text data into a medical database by utilizing a Bayesian network to perform spell checking of words. 2001. US Patent 6,292,771.
29.
go back to reference Ruch P, Baud RH, Geiddbühler A, Lovis C, Rassinoux A-M, Riviere A. Looking back or looking all around: comparing two spell checking strategies for documents edition in an electronic patient record. In: Proceedings of the AMIA Symposium: 2001. p. 568, American Medical Informatics Association. Ruch P, Baud RH, Geiddbühler A, Lovis C, Rassinoux A-M, Riviere A. Looking back or looking all around: comparing two spell checking strategies for documents edition in an electronic patient record. In: Proceedings of the AMIA Symposium: 2001. p. 568, American Medical Informatics Association.
30.
go back to reference Lyons R, Kisse R, Rogmans W. EU-Injury database Introduction to the functioning of the Injury Database (IDB). European Association for Injury Prevention and Safety Promotion (EuroSafe). 2015. https://bit.ly/37FAKaB. Lyons R, Kisse R, Rogmans W. EU-Injury database Introduction to the functioning of the Injury Database (IDB). European Association for Injury Prevention and Safety Promotion (EuroSafe). 2015. https://​bit.​ly/​37FAKaB.
31.
go back to reference Kisser R, Latarjet J, Bauer R, Rogmans W. Injury data needs and opportunities in Europe. Int J Inj Control Saf Promot. 2009; 16(2):103–12.CrossRef Kisser R, Latarjet J, Bauer R, Rogmans W. Injury data needs and opportunities in Europe. Int J Inj Control Saf Promot. 2009; 16(2):103–12.CrossRef
32.
go back to reference McNaughton R, Yamada H. Regular expressions and state graphs for automata. IRE transactions on Electronic Comput. 1960; EC-9(1):39–47.CrossRef McNaughton R, Yamada H. Regular expressions and state graphs for automata. IRE transactions on Electronic Comput. 1960; EC-9(1):39–47.CrossRef
33.
go back to reference Bojanowski GE, Joulin A, Mikolov T. Enriching word vectors with subword information. Trans Assoc Comput Linguist. 2017; 5:135–46.CrossRef Bojanowski GE, Joulin A, Mikolov T. Enriching word vectors with subword information. Trans Assoc Comput Linguist. 2017; 5:135–46.CrossRef
34.
go back to reference Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980. 2014. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980. 2014.
35.
go back to reference Minsky M. A framework for representing knowledge. In: Computation & Intelligence: 1995. p. 163–89, American Association for Artificial Intelligence. Minsky M. A framework for representing knowledge. In: Computation & Intelligence: 1995. p. 163–89, American Association for Artificial Intelligence.
36.
go back to reference Fillmore CJ. Frame semantics. Cogn Linguist Basic Readings. 2006; 34:373–400.CrossRef Fillmore CJ. Frame semantics. Cogn Linguist Basic Readings. 2006; 34:373–400.CrossRef
37.
go back to reference Jurafsky D. Part-of-speech tagging. In: Speech & language processing. Upper Saddle River: Pearson Education India: 2009. p. 157–206. Jurafsky D. Part-of-speech tagging. In: Speech & language processing. Upper Saddle River: Pearson Education India: 2009. p. 157–206.
38.
go back to reference Tseng H, Jurafsky D, Manning CD. Morphological features help POS tagging of unknown words across language varieties. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing. Association for Computational Linguistics: 2005. p. 32–39. Tseng H, Jurafsky D, Manning CD. Morphological features help POS tagging of unknown words across language varieties. In: Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing. Association for Computational Linguistics: 2005. p. 32–39.
39.
go back to reference Ciaramita M, Altun Y. Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing: 2006. p. 594–602, Association for Computational Linguistics. Ciaramita M, Altun Y. Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing: 2006. p. 594–602, Association for Computational Linguistics.
40.
go back to reference Miller GA. WordNet: a lexical database for English. Commun ACM. 1995; 38(11):39–41.CrossRef Miller GA. WordNet: a lexical database for English. Commun ACM. 1995; 38(11):39–41.CrossRef
41.
go back to reference Aprosio AP, Moretti G. Italy goes to Stanford: a collection of CoreNLP modules for Italian. arXiv preprint arXiv:1609.06204. 2016. http://arxiv.org/abs/1609.06204. Aprosio AP, Moretti G. Italy goes to Stanford: a collection of CoreNLP modules for Italian. arXiv preprint arXiv:1609.06204. 2016. http://​arxiv.​org/​abs/​1609.​06204.​
42.
go back to reference Chen D, Manning C. A fast and accurate dependency parser using neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP): 2014. p. 740–50. Chen D, Manning C. A fast and accurate dependency parser using neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP): 2014. p. 740–50.
43.
go back to reference Picca D, Gliozzo AM, Ciaramita M. Supersense tagger for Italian. In: Proceedings of the International Conference on Language Resources and Evaluation. European Language Resources Association: 2008. p. 2386–90. Picca D, Gliozzo AM, Ciaramita M. Supersense tagger for Italian. In: Proceedings of the International Conference on Language Resources and Evaluation. European Language Resources Association: 2008. p. 2386–90.
44.
go back to reference Pitidis A, Fondi G, Giustini M, Longo E, Balducci G, Gruppo di lavoro SINIACA-IDB, Dipartimento di Ambiente e Connessa Prevenzione Primaria ISS. Il Sistema SINIACA-IDB per la sorveglianza degli incidenti. Notiziario dell’Istituto Superiore di Sanità. 2014; 27(2):11–6. Pitidis A, Fondi G, Giustini M, Longo E, Balducci G, Gruppo di lavoro SINIACA-IDB, Dipartimento di Ambiente e Connessa Prevenzione Primaria ISS. Il Sistema SINIACA-IDB per la sorveglianza degli incidenti. Notiziario dell’Istituto Superiore di Sanità. 2014; 27(2):11–6.
45.
go back to reference Zvára K, Tomecková M, Peleška J, Svátek V, Zvárová J. Tool-supported interactive correction and semantic annotation of narrative clinical reports. Methods Inf Med. 2017; 56(03):217–29.PubMedCrossRef Zvára K, Tomecková M, Peleška J, Svátek V, Zvárová J. Tool-supported interactive correction and semantic annotation of narrative clinical reports. Methods Inf Med. 2017; 56(03):217–29.PubMedCrossRef
46.
go back to reference Wang L, Luo L, Wang Y, Wampfler J, Yang P, Liu H. Natural language processing for populating lung cancer clinical research data. BMC Med Informa Decis Mak. 2019; 19(5):239.CrossRef Wang L, Luo L, Wang Y, Wampfler J, Yang P, Liu H. Natural language processing for populating lung cancer clinical research data. BMC Med Informa Decis Mak. 2019; 19(5):239.CrossRef
47.
go back to reference Constant M, Eryiğit G, Monti J, Van Der Plas L, Ramisch C, Rosner M, Todirascu A. Multiword expression processing: A survey. Comput Linguist. 2017; 43(4):837–92.CrossRef Constant M, Eryiğit G, Monti J, Van Der Plas L, Ramisch C, Rosner M, Todirascu A. Multiword expression processing: A survey. Comput Linguist. 2017; 43(4):837–92.CrossRef
48.
go back to reference Ramisch C, Villavicencio A, Boitet C. Mwetoolkit: a framework for multiword expression identification. In: LREC: 2010. p. 662–9, Valletta. Ramisch C, Villavicencio A, Boitet C. Mwetoolkit: a framework for multiword expression identification. In: LREC: 2010. p. 662–9, Valletta.
49.
go back to reference Ivanova A, Oepen S, Øvrelid L. Survey on parsing three dependency representations for English. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop: 2013. p. 31–7. Ivanova A, Oepen S, Øvrelid L. Survey on parsing three dependency representations for English. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop: 2013. p. 31–7.
50.
go back to reference De Mori R. Spoken language understanding: a survey. In: 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU): 2007. p. 365–76, IEEE. De Mori R. Spoken language understanding: a survey. In: 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU): 2007. p. 365–76, IEEE.
51.
go back to reference Wang Z, Zhang J, Feng J, Chen Z. Knowledge graph embedding by translating on hyperplanes. In: Twenty-Eighth AAAI Conference on Artificial Intelligence: 2014. p. 1112–9. Wang Z, Zhang J, Feng J, Chen Z. Knowledge graph embedding by translating on hyperplanes. In: Twenty-Eighth AAAI Conference on Artificial Intelligence: 2014. p. 1112–9.
52.
go back to reference Goyal P, Ferrara E. Graph embedding techniques, applications, and performance: A survey. Knowl-Based Syst. 2018; 151:78–94.CrossRef Goyal P, Ferrara E. Graph embedding techniques, applications, and performance: A survey. Knowl-Based Syst. 2018; 151:78–94.CrossRef
53.
go back to reference Ma F, Wang Y, Xiao H, Yuan Y, Chitta R, Zhou J, Gao J. Incorporating medical code descriptions for diagnosis prediction in healthcare. BMC Med Informa Decis Mak. 2019; 19(6):1–13. Ma F, Wang Y, Xiao H, Yuan Y, Chitta R, Zhou J, Gao J. Incorporating medical code descriptions for diagnosis prediction in healthcare. BMC Med Informa Decis Mak. 2019; 19(6):1–13.
54.
go back to reference Danescu-Niculescu-Mizil C, Gamon M, Dumais S. Mark my words!: Linguistic style accommodation in social media. In: Proceedings of the 20th International Conference on World Wide Web: 2011. p. 745–54, ACM. Danescu-Niculescu-Mizil C, Gamon M, Dumais S. Mark my words!: Linguistic style accommodation in social media. In: Proceedings of the 20th International Conference on World Wide Web: 2011. p. 745–54, ACM.
55.
go back to reference Wang Y-Y. A robust parser for spoken language understanding. In: Sixth European Conference on Speech Communication and Technology: 1999. Wang Y-Y. A robust parser for spoken language understanding. In: Sixth European Conference on Speech Communication and Technology: 1999.
56.
go back to reference Aldinucci M, Bagnasco S, Lusso S, Pasteris P, Rabellino S, Vallero S. OCCAM: a flexible, multi-purpose and extendable HPC cluster. J Phys Conf Ser. 2017; 898(8):082039.CrossRef Aldinucci M, Bagnasco S, Lusso S, Pasteris P, Rabellino S, Vallero S. OCCAM: a flexible, multi-purpose and extendable HPC cluster. J Phys Conf Ser. 2017; 898(8):082039.CrossRef
Metadata
Title
Violence detection explanation via semantic roles embeddings
Authors
Enrico Mensa
Davide Colla
Marco Dalmasso
Marco Giustini
Carlo Mamo
Alessio Pitidis
Daniele P. Radicioni
Publication date
01-12-2020
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2020
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-020-01237-4

Other articles of this Issue 1/2020

BMC Medical Informatics and Decision Making 1/2020 Go to the issue