Top

BMC Medical Informatics and Decision Making

Published in:

Open Access 01-12-2020 | Research article

How to automatically turn patient experience free-text responses into actionable insights: a natural language programming (NLP) approach

Authors: Simone A. Cammel, Marit S. De Vos, Daphne van Soest, Kristina M. Hettne, Fred Boer, Ewout W. Steyerberg, Hileen Boosman

Published in: BMC Medical Informatics and Decision Making | Issue 1/2020

Abstract

Background

Patient experience surveys often include free-text responses. Analysis of these responses is time-consuming and often underutilized. This study examined whether Natural Language Processing (NLP) techniques could provide a data-driven, hospital-independent solution to indicate points for quality improvement.

Methods

This retrospective study used routinely collected patient experience data from two hospitals. A data-driven NLP approach was used. Free-text responses were categorized into topics, subtopics (i.e. n-grams) and labelled with a sentiment score. The indicator ‘impact’, combining sentiment and frequency, was calculated to reveal topics to improve, monitor or celebrate. The topic modelling architecture was tested on data from a second hospital to examine whether the architecture is transferable to another hospital.

Results

A total of 38,664 survey responses from the first hospital resulted in 127 topics and 294 n-grams. The indicator ‘impact’ revealed n-grams to celebrate (15.3%), improve (8.8%), and monitor (16.7%). For hospital 2, a similar percentage of free-text responses could be labelled with a topic and n-grams. Between-hospitals, most topics (69.7%) were similar, but 32.2% of topics for hospital 1 and 29.0% of topics for hospital 2 were unique.

Conclusions

In both hospitals, NLP techniques could be used to categorize patient experience free-text responses into topics, sentiment labels and to define priorities for improvement. The model’s architecture was shown to be hospital-specific as it was able to discover new topics for the second hospital. These methods should be considered for future patient experience analyses to make better use of this valuable source of information.

Available only for authorised users

Hamming, J. F., H. Boosman, and P. J. de Mheen Marang-van. "The Association Between Complications, Incidents, and Patient Experience: Retrospective Linkage of Routine Patient Experience Surveys and Safety Data." Journal of patient safety (2019).

Cunningham M, Wells M. Qualitative analysis of 6961 free-text comments from the first National Cancer Patient Experience Survey in Scotland. BMJ Open. 2017;7(6):e015726.CrossRef

Blei DM, McAuliffe JD. Supervised topic models; 2010.

Li S. Topic modeling and Latent Dirichlet Allocation (LDA) in Python; 2018.

Abirami AM, Askarunisa A. Sentiment analysis model to emphasize the impact of online reviews in Healthcare industry, vol. 41; 2017.

Edwards A, Evans R, White P, Elwyn G. Experiencing patient-experience surveys: a qualitative study of the accounts of GPs. Br J Gen Pract. 2011;61(585):157–66.CrossRef

Gallan AS, Girju M, Girju R. Perfect ratings with negative comments: learning from contradictory patient survey responses. Patient Exp J. 2017;4(3):15–28.CrossRef

Bahja M, Lycett M. Identifying patient experience from online resources via sentiment analysis and topic modelling. In: Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies; 2016. p. 94–9.

Bracher M, Corner DJ, Wagland R. Exploring experiences of cancer care in Wales: a thematic analysis of free-text responses to the 2013 Wales Cancer Patient Experience Survey (WCPES). BMJ Open. 2016;6(9):e011830.CrossRef

10.

Esuli, Andrea, Alejandro Moreo, and Fabrizio Sebastiani. "Building Automated Survey Coders via Interactive Machine Learning." arXiv preprint arXiv:1903.12110 (2019).

11.

Varanasi P, Tanniru M. Seeking intelligence from patient experience using text mining: analysis of emergency department data. Inf Syst Manag. 2015;32(3):220–8.CrossRef

12.

Ainley E, King J, Käsbauer S, Cooper R. A framework analysis of free-text data from the neonatal survey 2014. J Neonatal Nurs. 2018;24(3):163–8.CrossRef

13.

Weng J, Lim E-P, Jiang J, Qi Z. Twitterrank: Finding Topic-Sensitive Influential Twitterers; 2010.CrossRef

14.

Dalal MK, Zaveri M. Automatic Classification of Unstructured Blog Text, vol. 05; 2013.

15.

Gupta S. Sentiment Analysis: Concept, Analysis and Applications; 2018.

16.

Greaves F, Ramirez-Cano D, Millett C, Darzi A, Donaldson L. Use of Sentiment Analysis for Capturing Patient Experience From Free-Text Comments Posted Online, vol. 15; 2013.

17.

P. Norvig, How to Write a Spelling Corrector. 2016. [Online]. Available: https://norvig.com/spell-correct.html.

18.

J. Words!, “Dictionaries.” [Online]. Available: http://www.gwicks.net/dictionaries.htm. Accessed May 2019.

19.

Spackman KA, Campbell KE, Côté RA. SNOMED RT: a reference terminology for health care. In: Proceedings of the AMIA annual fall symposium; 1997. p. 640.

20.

Van Rossum, G., & Drake, F. L. (2009). Python 3 Reference Manual. Scotts Valley, CA: CreateSpace; 2019.

21.

Bird, Steven, Ewan Klein, and Edward Loper. Natural language processing with Python: analyzing text with the natural language toolkit. " O'Reilly Media, Inc.", 2009.

22.

Oliphant TE. Guide to NumPy, 2nd ed. USA: CreateSpace Independent Publishing Platform; 2015.

23.

Pedregosa, Fabian, et al. "Scikit-learn: Machine learning in Python." the Journal of machine Learning research 12 (2011): 2825-2830.

24.

Hunter JD. Matplotlib: a 2D graphics environment. Comput Sci Eng. 2007;9(3):90–5.CrossRef

25.

Lee DD, Seung HS. Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems; 2001. p. 556–62.

26.

O’Callaghan D, Greene D, Conway M, Carthy J, Cunningham P. Down the (White) rabbit hole: the extreme right and online recommender systems. Soc Sci Comput Rev. 2014;33(4):459–78.CrossRef

27.

Newman D, Lau JH, Grieser K, Baldwin T. Automatic evaluation of topic coherence. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics; 2010. p. 100–8.

28.

Wang X, McCallum A, Wei X. Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval. In: Seventh IEEE International Conference on Data Mining (ICDM 2007); 2007. p. 697–702.CrossRef

29.

Cavnar WB, Trenkle JM. N-gram-based text categorization. In: Proceedings of SDAIR-94, 3rd annual symposium on document analysis and information retrieval; 1994. p. 161175.

30.

A. Cohen, FuzzyWuzzy String Matching. 2011. [Online]. Available: https://chairnerd.seatgeek.com/fuzzywuzzy-fuzzy-string-matching-in-python/.

31.

McHugh ML. Interrater reliability: the kappa statistic. Biochem Med. 2012;22(3):276–82.CrossRef

32.

Aven T. Risk assessment and risk management: review of recent advances on their foundation. Eur J Oper Res. 2016;253(1):1–13.CrossRef

33.

Wagland R, et al. Development and testing of a text-mining approach to analyse patients’ comments on their experiences of colorectal cancer care. BMJ Qual Saf. 2016;25(8):604 LP–614.CrossRef

34.

Shah A, Yan X, Shah S, Khan S. Use of Sentiment Mining and Online NMF for Topic Modeling Through the Analysis of Patients Online Unstructured Comments: International Conference, ICSH 2018, Wuhan, China, July 1–3, 2018, Proceedings; 2018. p. 191–203.

35.

Carrera-Trejo JV, Sidorov G, Miranda-Jiménez S, Moreno Ibarra M, Cadena Martínez R. Latent Dirichlet allocation complement in the vector space model for multi-label text classification. Int J Comb Optim Probl Informatics. 2015;6(1):7–19.

36.

Vincent C. Incident reporting and patient safety. BMJ. 2007;334(7584):51.CrossRef

37.

de Vos MS, Hamming JF, Marang-van de Mheen PJ. The problem with using patient complaints for improvement. BMJ Qual Saf. 2018;27(9):758 LP–762.CrossRef

Title: How to automatically turn patient experience free-text responses into actionable insights: a natural language programming (NLP) approach
Authors: Simone A. Cammel
Marit S. De Vos
Daphne van Soest
Kristina M. Hettne
Fred Boer
Ewout W. Steyerberg
Hileen Boosman
Publication date: 01-12-2020
Publisher: BioMed Central
Published in: BMC Medical Informatics and Decision Making / Issue 1/2020
Electronic ISSN: 1472-6947
DOI: https://doi.org/10.1186/s12911-020-1104-5

At a glance: The STEP trials

Springer Medicine

How to automatically turn patient experience free-text responses into actionable insights: a natural language programming (NLP) approach

Abstract

Background

Methods

Results

Conclusions

At a glance: The STEP trials

Springer Medicine

Abstract

Background

Methods

Results

Conclusions

Please log in to get access to this content

Other articles of this Issue 1/2020

The costs outweigh the benefits: seeing side-effects online may decrease adherence to statins

A stacking-based model for predicting 30-day all-cause hospital readmissions of patients with acute myocardial infarction

The considerations, experiences and support needs of family members making treatment decisions for patients admitted with major stroke: a qualitative study

Leveraging hybrid biomarkers in clinical endpoint prediction

The information imperative: to study the impact of informational discontinuity on clinical decision making among doctors

Comparison of deep learning with regression analysis in creating predictive models for SARS-CoV-2 outcomes