Top

BMC Medical Research Methodology

Published in:

Open Access 01-12-2021 | Oseltamivir | Research

Machine learning in medicine: a practical introduction to natural language processing

Authors: Conrad J. Harrison, Chris J. Sidey-Gibbons

Published in: BMC Medical Research Methodology | Issue 1/2021

Abstract

Background

Unstructured text, including medical records, patient feedback, and social media comments, can be a rich source of data for clinical research. Natural language processing (NLP) describes a set of techniques used to convert passages of written text into interpretable datasets that can be analysed by statistical and machine learning (ML) models. The purpose of this paper is to provide a practical introduction to contemporary techniques for the analysis of text-data, using freely-available software.

Methods

We performed three NLP experiments using publicly-available data obtained from medicine review websites. First, we conducted lexicon-based sentiment analysis on open-text patient reviews of four drugs: Levothyroxine, Viagra, Oseltamivir and Apixaban. Next, we used unsupervised ML (latent Dirichlet allocation, LDA) to identify similar drugs in the dataset, based solely on their reviews. Finally, we developed three supervised ML algorithms to predict whether a drug review was associated with a positive or negative rating. These algorithms were: a regularised logistic regression, a support vector machine (SVM), and an artificial neural network (ANN). We compared the performance of these algorithms in terms of classification accuracy, area under the receiver operating characteristic curve (AUC), sensitivity and specificity.

Results

Levothyroxine and Viagra were reviewed with a higher proportion of positive sentiments than Oseltamivir and Apixaban. One of the three LDA clusters clearly represented drugs used to treat mental health problems. A common theme suggested by this cluster was drugs taking weeks or months to work. Another cluster clearly represented drugs used as contraceptives. Supervised machine learning algorithms predicted positive or negative drug ratings with classification accuracies ranging from 0.664, 95% CI [0.608, 0.716] for the regularised regression to 0.720, 95% CI [0.664,0.776] for the SVM.

Conclusions

In this paper, we present a conceptual overview of common techniques used to analyse large volumes of text, and provide reproducible code that can be readily applied to other research studies using open-source software.

Available only for authorised users

Lee CH, Yoon HJ. Medical big data: promise and challenges. Kidney Res Clin Pract. 2017. https://doi.org/10.23876/j.krcp.2017.36.1.3.CrossRefPubMedPubMedCentral

Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019. https://doi.org/10.1186/s12874-019-0681-4.CrossRefPubMedPubMedCentral

Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017. https://doi.org/10.1038/nature21056.CrossRefPubMed

Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Informatics Assoc. 2011. https://doi.org/10.1136/amiajnl-2011-000464.CrossRef

Gravesteijn BY, Nieboer D, Ercole A, et al. Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. J Clin Epidemiol. 2020. https://doi.org/10.1016/j.jclinepi.2020.03.005.CrossRefPubMed

Nusinovici S, Tham YC, Chak Yan MY, et al. Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol. 2020. https://doi.org/10.1016/j.jclinepi.2020.03.002.CrossRefPubMed

Lynam AL, Dennis JM, Owen KR, et al. Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults. Diagnostic Progn Res. 2020. https://doi.org/10.1186/s41512-020-00075-2.CrossRef

Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019. https://doi.org/10.1016/j.jclinepi.2019.02.004.CrossRefPubMed

Collins GS, van Smeden M, Riley RD. COVID-19 prediction models should adhere to methodological and reporting standards. Eur Respir J. 2020. https://doi.org/10.1183/13993003.02643-2020.CrossRefPubMedPubMedCentral

10.

Doshi-Velez F, Kim B. Considerations for evaluation and generalization in interpretable machine learning. 2018. https://doi.org/10.1007/978-3-319-98131-4_1.CrossRef

11.

Royal College of Surgeons of England. Commission on the Future of Surgery. 2020. Available at: https://www.rcseng.ac.uk/standards-and-research/future-of-surgery/. Accessed 25 July 2021.

12.

Iacus SM. Automated data collection with R - a practical guide to web scraping and text mining. J Stat Softw. 2015. https://doi.org/10.18637/jss.v068.b03.CrossRef

13.

Wickham H. Package “rvest”. 2021. https://cran.r-project.org/web/packages/rvest/rvest.pdf. Accessed 4 June 2021.

14.

Sidey-Gibbons J, Sidey-Gibbons C. Machine learning in medicine: a practical introduction. BMC Med Res Methodol. 2019;19(1):64.CrossRef

15.

Gonçalves P, Araújo M, Benevenuto F, Cha M. Comparing and combining sentiment analysis methods. In: COSN 2013 - proceedings of the 2013 Conference on Online Social Networks. 2013. https://doi.org/10.1145/2512938.2512951.

16.

Vaismoradi M, Turunen H, Bondas T. Content analysis and thematic analysis: implications for conducting a qualitative descriptive study. Nurs Health Sci. 2013. https://doi.org/10.1111/nhs.12048.CrossRefPubMed

17.

Hu M, Liu B. Mining and summarizing customer reviews. In: KDD-2004 - proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining. 2004. https://doi.org/10.1145/1014052.1014073.

18.

Ofoghi B, Mann M, Verspoor K. Towards early discovery of salient health threats: a social media emotion classification technique. In: Pacific symposium biocomputing. 2016. http://psb.stanford.edu/psb-online/proceedings/psb16/ofoghi.pdf. Accessed 4 June 2021.

19.

Davis MA, Zheng K, Liu Y, Levy H. Public response to obamacare on Twitter. J Med Internet Res. 2017. https://doi.org/10.2196/JMIR.6946.CrossRefPubMedPubMedCentral

20.

Gabarron E, Dorronzoro E, Rivera-Romero O, Wynn R. Diabetes on Twitter: a sentiment analysis. J Diabetes Sci Technol. 2019. https://doi.org/10.1177/1932296818811679.CrossRefPubMedPubMedCentral

21.

Bakliwal A, Arora P, Patil A, Varma V. Towards enhanced opinion classification using NLP techniques. In: Proceedings of the workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2011). 2011. p. 101–107.

22.

Gurevych I. Inverted polarity bigram lexicons. 2015. https://www.informatik.tu-darmstadt.de/ukp/research_6/data/sentiment_analysis/inverted_polarity_bigrams/index.en.jsp. Accessed 22 Jan 2021.

23.

Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2001. https://doi.org/10.1006/jbin.2001.1029.CrossRefPubMed

24.

Harkema H, Dowling JN, Thornblade T, Chapman WW. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports. J Biomed Inform. 2009. https://doi.org/10.1016/j.jbi.2009.05.002.CrossRefPubMedPubMedCentral

25.

Mukherjee S, Bala PK. Detecting sarcasm in customer tweets: an NLP based approach. Ind Manag Data Syst. 2017. https://doi.org/10.1108/IMDS-06-2016-0207.CrossRef

26.

Thakkar H, Patel D. Approaches for sentiment analysis on Twitter: a state‐of‐art study. 2015. Available at: https://arxiv.org/pdf/1512.01043.pdf. Accessed 25 July 2021.

27.

Sharma D, Sabharwal M, Goyal V, Vij M. Sentiment analysis techniques for social media data: a review. In: Advances in intelligent systems and computing. 2020. https://doi.org/10.1007/978-981-15-0029-9_7.

28.

Jelodar H, Wang Y, Rabbani M, Ayobi SVA. Natural language processing via LDA topic model in recommendation systems. arXiv. 2019.

29.

Rodriguez MY, Storer H. A computational social science perspective on qualitative data exploration: using topic models for the descriptive analysis of social media data*. J Technol Hum Serv. 2020. https://doi.org/10.1080/15228835.2019.1616350.CrossRef

30.

Abdellaoui R, Foulquie P, Texier N, Faviez C, Burgun A, Schück S. Detection of cases of noncompliance to drug treatment in patient forum posts: topic model approach. J Med Internet Res. 2018. https://doi.org/10.2196/jmir.9222.CrossRefPubMedPubMedCentral

31.

TapiNzali MD, Bringay S, Lavergne C, Mollevi C, Opitz T. What patients can tell us: topic analysis for social media on breast cancer. JMIR Med Informatics. 2017. https://doi.org/10.2196/medinform.7779.CrossRef

32.

Banerjee I, Li K, Seneviratne M, et al. Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment. JAMIA Open. 2019. https://doi.org/10.1093/jamiaopen/ooy057.CrossRefPubMedPubMedCentral

33.

Bedi G, Carrillo F, Cecchi GA, et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophr. 2015. https://doi.org/10.1038/npjschz.2015.30.

34.

Griffiths A, Leaver MP. Wisdom of patients: predicting the quality of care using aggregated patient feedback. BMJ Qual Saf. 2018. https://doi.org/10.1136/bmjqs-2017-006847.CrossRefPubMedPubMedCentral

35.

Ozgur C, Colliau T, Rogers G, Hughes Z, Myer-Tyson EB. MatLab vs Python vs. R. J Data Sci. 2017;15(3):355–71.CrossRef

36.

Kallumadi S, Gräßer F. Drug Review Dataset (Drugs.com) data set. University of California Irvine Machine Learning Repository; 2018. https://archive.ics.uci.edu/ml/datasets/Drug+Review+Dataset+%28Drugs.com%29. Accessed 22 Jan 2021.

37.

Brownlee J. Deep learning for natural language processing. 2017. Available at: http://ling.snu.ac.kr/class/AI_Agent/deep_learning_for_nlp.pdf. Accessed 22 July 2021.

38.

Manning CD, Raghavan P, Schutze H. Introduction to information retrieval. 2008. https://doi.org/10.1017/cbo9780511809071.CrossRef

39.

Porter MF. An algorithm for suffix stripping. Program. 2006. https://doi.org/10.1108/00330330610681286.CrossRef

40.

Wilbur WJ, Sirotkin K. The automatic identification of stop words. J Inf Sci. 1992. https://doi.org/10.1177/016555159201800106.CrossRef

41.

Piantadosi ST. Zipf’s word frequency law in natural language: a critical review and future directions. Psychon Bull Rev. 2014. https://doi.org/10.3758/s13423-014-0585-6.CrossRefPubMedPubMedCentral

42.

Fagan S, Gençay R. An introduction to textual econometrics. In: Handbook of empirical economics and finance. CRC Press; 2010. p. 139. https://books.google.co.uk/books?hl=en&lr=&id=QAUv9R6bJzwC&oi=fnd&pg=PA139&redir_esc=y#v=onepage&q&f=false.

43.

Blei DM, Lafferty JD. Dynamic topic models. In: ACM international conference proceeding series. 2006. https://doi.org/10.1145/1143844.1143859.

44.

Bail C. Topic modeling. Text as data course. 2019. https://sicss.io/2019/materials/day3-text-analysis/topic-modeling/rmarkdown/Topic_Modeling.html. Accessed 4 June 2021.

45.

Zhao W, Chen JJ, Perkins R, et al. A heuristic approach to determine an appropriate number of topics in topic modeling. BMC Bioinformatics. 2015. https://doi.org/10.1186/1471-2105-16-S13-S8.CrossRefPubMedPubMedCentral

46.

Guo X, Yin Y, Dong C, Yang G, Zhou G. On the class imbalance problem. In: Proceedings - 4th International Conference on Natural Computation, ICNC 2008. 2008. https://doi.org/10.1109/ICNC.2008.871.

47.

Blagus R, Lusa L. SMOTE for high-dimensional class-imbalanced data. BMC Bioinformatics. 2013. https://doi.org/10.1186/1471-2105-14-106.CrossRefPubMedPubMedCentral

48.

Japkowicz N. The class imbalance problem: significance and strategies. In: Proc 2000 Int Conf Artif Intell. 2000.

49.

National Institute for Health and Care Excellence. Antidepressant treatment in adults. 2020. https://pathways.nice.org.uk/pathways/depression/antidepressant-treatment-inadults#content=view-node%3Anodes-starting-antidepressant-treatment. Accessed 22 Jan 2021.

50.

National Institute for Health and Care Excellence. Levetiracetam. British National Forumlary; 2021. https://bnf.nice.org.uk/drug/levetiracetam.html. Accessed 22 Jan 2021.

51.

Luo W, Phung D, Tran T, et al. Guidelines for developing and reporting machine learning predictive models in biomedical research: a multidisciplinary view. J Med Internet Res. 2016. https://doi.org/10.2196/jmir.5870.CrossRefPubMedPubMedCentral

52.

Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Eur Urol. 2015. https://doi.org/10.1016/j.eururo.2014.11.025.CrossRefPubMed

53.

Collins GS, Moons KGM. Reporting of artificial intelligence prediction models. Lancet. 2019. https://doi.org/10.1016/S0140-6736(19)30037-6.CrossRefPubMedPubMedCentral

54.

Wolff RF, Moons KGM, Riley RD, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019. https://doi.org/10.7326/M18-1376.CrossRefPubMedPubMedCentral

55.

Babyak MA. What you see may not be what you get: a brief, nontechnical introduction to overfitting in regression-type models. Psychosom Med. 2004. https://doi.org/10.1097/01.psy.0000127692.23278.a9.CrossRefPubMed

56.

Balakrishnan V, Ethel L-Y. Stemming and lemmatization: a comparison of retrieval performances. Lect Notes Softw Eng. 2014. https://doi.org/10.7763/lnse.2014.v2.134.CrossRef

57.

Nugues PM. Dependency parsing. In: Cognitive technologies. 2014. https://doi.org/10.1007/978-3-642-41464-0_13.

Title: Machine learning in medicine: a practical introduction to natural language processing
Authors: Conrad J. Harrison
Chris J. Sidey-Gibbons
Publication date: 01-12-2021
Publisher: BioMed Central
Keywords: Oseltamivir
Apixaban
Published in: BMC Medical Research Methodology / Issue 1/2021
Electronic ISSN: 1471-2288
DOI: https://doi.org/10.1186/s12874-021-01347-1

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Machine learning in medicine: a practical introduction to natural language processing

Abstract

Background

Methods

Results

Conclusions

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Abstract

Background

Methods

Results

Conclusions

Please log in to get access to this content

Other articles of this Issue 1/2021

Open science saves lives: lessons from the COVID-19 pandemic

Conducting proportional meta-analysis in different types of systematic reviews: a guide for synthesisers of evidence

Incorporating and addressing testing bias within estimates of epidemic dynamics for SARS-CoV-2

Are morbidity and mortality estimates from randomized controlled trials externally valid? A comparison of outcomes among infants enrolled into an RCT or a cohort study in Botswana

Ordinal outcome analysis improves the detection of between-hospital differences in outcome

Predictive P-score for treatment ranking in Bayesian network meta-analysis