Top

BMC Medical Informatics and Decision Making

Published in:

Open Access 01-12-2020 | Research Article

AutoDiscern: rating the quality of online health information with hierarchical encoder attention-based neural networks

Authors: Laura Kinkead, Ahmed Allam, Michael Krauthammer

Published in: BMC Medical Informatics and Decision Making | Issue 1/2020

Abstract

Background

Patients increasingly turn to search engines and online content before, or in place of, talking with a health professional. Low quality health information, which is common on the internet, presents risks to the patient in the form of misinformation and a possibly poorer relationship with their physician. To address this, the DISCERN criteria (developed at University of Oxford) are used to evaluate the quality of online health information. However, patients are unlikely to take the time to apply these criteria to the health websites they visit.

Methods

We built an automated implementation of the DISCERN instrument (Brief version) using machine learning models. We compared the performance of a traditional model (Random Forest) with that of a hierarchical encoder attention-based neural network (HEA) model using two language embeddings, BERT and BioBERT.

Results

The HEA BERT and BioBERT models achieved average F1-macro scores across all criteria of 0.75 and 0.74, respectively, outperforming the Random Forest model (average F1-macro = 0.69). Overall, the neural network based models achieved 81% and 86% average accuracy at 100% and 80% coverage, respectively, compared to 94% manual rating accuracy. The attention mechanism implemented in the HEA architectures not only provided ’model explainability’ by identifying reasonable supporting sentences for the documents fulfilling the Brief DISCERN criteria, but also boosted F1 performance by 0.05 compared to the same architecture without an attention mechanism.

Conclusions

Our research suggests that it is feasible to automate online health information quality assessment, which is an important step towards empowering patients to become informed partners in the healthcare process.

https://www.crummy.com/software/BeautifulSoup/.

Hesse BW, Nelson DE, Kreps GL, Croyle RT, Arora NK, Rimer BK, Viswanath K. Trust and Sources of Health Information. Arch Intern Med. 2005; 165(22):2618. https://doi.org/10.1001/archinte.165.22.2618.CrossRef

Fahy E. Quality of patient health information on the internet: reviewing a complex and. Australas Med J. 2014; 7(1):24–8. https://doi.org/10.4066/AMJ.2014.1900.CrossRef

Zhang Y, Sun Y, Xie B. Quality of health information for consumers on the web: A systematic review of indicators, criteria, tools, and evaluation results. J Assoc Inf Sci Technol. 2015; 66(10). https://doi.org/10.1002/asi.23311.

Saunders CH, Petersen CL, Durang M-A, Bagley PJ, Elywn G. Bring on the Machines: Could Machine Learning Improve the Quality of Patient Education Materials? A Systematic Search and Rapid Review. JCO Clin Cancer Inform. 2018; 2:1–16. https://doi.org/10.1200/cci.18.00010.CrossRef

Murray E, Lo B, Pollack L, Donelan K, Catania J, Lee K, Zapert K, Turner R. The Impact of Health Information on the Internet on Health Care and the Physician-Patient Relationship: National U.S. Survey among 1.050 U.S. Physicians. J Med Internet Res. 2003; 5(3):17. https://doi.org/10.2196/jmir.5.3.e17.CrossRef

Allam A, Schulz PJ, Nakamoto K. The impact of search engine selection and sorting criteria on vaccination beliefs and attitudes: two experiments manipulating Google output,. J Med Internet Res. 2014; 16(4):100. https://doi.org/10.2196/jmir.2642.CrossRef

Ludolph R, Allam A, Schulz PJ. Manipulating Google’s Knowledge Graph Box to Counter Biased Information Processing During an Online Search on Vaccination: Application of a Technological Debiasing Strategy. J Med Internet Res. 2016; 18(6):137. https://doi.org/10.2196/jmir.5430.CrossRef

Iverson SA, Howard KB, Penney BK. Impact of internet use on health-related behaviors and the patient-physician relationship: a survey-based study and review. J Am Osteopath Assoc. 2008; 108(12):699–711.PubMed

Wald HS, Dube CE, Anthony DC. Untangling the Web-The impact of Internet use on health care and the physician-patient relationship. Patient Educ Couns. 2007; 68(3):218–24. https://doi.org/10.1016/j.pec.2007.05.016.CrossRef

10.

Risk A, Dzenowagis J. Review of Internet health information quality initiatives. J Med Internet Res. 2001. https://doi.org/10.2196/jmir.3.4.e28.

11.

Viviani M, Pasi G. Credibility in social media: opinions, news, and health information-a survey. Wiley Interdiscip Rev Data Min Knowl Disc. 2017; 7(5):1209. https://doi.org/10.1002/widm.1209.CrossRef

12.

Charnock D, Shepperd S, Needham G, Gann R. DISCERN: an instrument for judging the quality of written consumer health information on treatment choices,. J Epidemiol Community Health. 1999; 53(2):105–11. https://doi.org/10.1136/jech.53.2.105.CrossRef

13.

Boyer C, Dolamic L. Feasibility of automated detection of HONcode conformity for health-related websites. Int J Adv Comput Sci Appl. 2014; 5(3). https://doi.org/10.14569/IJACSA.2014.050309.

14.

Boyer C, Dolamic L. Automated Detection of HONcode Website Conformity Compared to Manual Detection: An Evaluation,. J Med Internet Res. 2015; 17(6):135. https://doi.org/10.2196/jmir.3831.CrossRef

15.

Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I. Attention Is All You Need. 2017. http://arxiv.org/abs/1706.03762. Accessed 14 Oct 2019.

16.

Luong M-T, Pham H, Manning CD. Effective Approaches to Attention-based Neural Machine Translation. 2015. http://arxiv.org/abs/1508.04025. Accessed 15 Oct 2019.

17.

Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J. Transformers: State-of-the-art Natural Language Processing. 2019. http://arxiv.org/abs/1910.03771.

18.

Ruder S, Peters ME, Swayamdipta S, Wolf T. Transfer Learning in Natural Language Processing. In: Proceedings of the 2019 Conference of the North. Stroudsburg: Association for Computational Linguistics: 2019. p. 15–18. https://doi.org/10.18653/v1/N19-5004.

19.

Rees CE, Ford JE, Sheard CE. Evaluating the reliability of DISCERN: A tool for assessing the quality of written patient information on treatment choices. Patient Educ Couns. 2002. https://doi.org/10.1016/S0738-3991(01)00225-7.

20.

Khazaal Y, Chatton A, Cochand S, Coquard O, Fernandez S, Khan R, Billieux J, Zullino D. Brief DISCERN, six questions for the evaluation of evidence-based content of health-related websites. Patient Educ Couns. 2009. https://doi.org/10.1016/j.pec.2009.02.016.

21.

Devlin J, Chang M-W, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. 2018. http://arxiv.org/abs/1810.04805.

22.

Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, Kang J. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2019. https://doi.org/10.1093/bioinformatics/btz682.

23.

Allam A, Schulz PJ, Krauthammer M. Toward automated assessment of health Web page quality using the DISCERN instrument. J Am Med Informa Assoc. 2017; 24(3):481–87. https://doi.org/10.1093/jamia/ocw140.

24.

Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, Desmaison A, Kopf A, Yang E, DeVito Z, Raison M, Tejani A, Chilamkurthy S, Steiner B, Fang L, Bai J, Chintala S. PyTorch: An Imperative Style, High-Performance Deep Learning Library In: Wallach H, Larochelle H, Beygelzimer A, Alché-Buc F, Fox E, Garnett R, editors. Advances in Neural Information Processing Systems 32. Curran Associates, Inc.: 2019. p. 8024–8035. http://papers.neurips.cc/paper/9015-pytorch-an-imperative-style-high-performance-deep-learning-library.pdf.

25.

Hochreiter S, Schmidhuber J. Long Short-Term Memory. Neural Comput. 1997; 9(8):1735–80. https://doi.org/10.1162/neco.1997.9.8.1735.

26.

Bengio Y, Simard P, Frasconi P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw. 1994; 5(2):157–66. https://doi.org/10.1109/72.279181.

27.

Graves A. Supervised Sequence Labelling with Recurrent Neural Networks. Berlin, Heidelberg: Springer; 2012. https://doi.org/10.1007/978-3-642-24797-2. http://link.springer.com/10.1007/978-3-642-24797-2.

28.

Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha: Association for Computational Linguistics: 2014. p. 1724–1734. http://aclweb.org/anthology/D14-1179. Accessed 01 Nov 2019.

29.

Chung J, Gulcehre C, Cho K, Bengio Y. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. 2014. http://arxiv.org/abs/1412.3555. Accessed 01 Nov 2018.

30.

Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate. 2014. http://arxiv.org/abs/1409.0473. Accessed 18 Dec 2019.

31.

Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J Mach Learn Res. 2014; 15:1929–58.

32.

Bergstra JAMESBERGSTRA J, Yoshua Bengio YOSHUABENGIO U. Random Search for HyperParameter Optimization. J Mach Learn Res. 2012. https://doi.org/10.1162/153244303322533223.

33.

Demner-Fushman D, Rogers WJ, Aronson AR. MetaMap Lite: an evaluation of a new Java implementation of MetaMap. J Am Med Inform Assoc. 2017; 177. https://doi.org/10.1093/jamia/ocw177.

34.

Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E. Scikit-learn: Machine learning in Python. J Mach Learn Res. 2011; 12:2825–30.

35.

Berthelot D, Carlini N, Goodfellow I, Papernot N, Oliver A, Raffel C. MixMatch: A Holistic Approach to Semi-Supervised Learning. 2019. http://arxiv.org/abs/1905.02249. Accessed 18 Dec 2019.

36.

Xie Q, Dai Z, Hovy E, Luong M-T, Le QV. Unsupervised Data Augmentation for Consistency Training. 2019. http://arxiv.org/abs/1904.12848.

Title: AutoDiscern: rating the quality of online health information with hierarchical encoder attention-based neural networks
Authors: Laura Kinkead
Ahmed Allam
Michael Krauthammer
Publication date: 01-12-2020
Publisher: BioMed Central
Published in: BMC Medical Informatics and Decision Making / Issue 1/2020
Electronic ISSN: 1472-6947
DOI: https://doi.org/10.1186/s12911-020-01131-z

Keynote webinar | Spotlight on sleep in brain health

Springer Medicine

AutoDiscern: rating the quality of online health information with hierarchical encoder attention-based neural networks

Abstract

Background

Methods

Results

Conclusions

Keynote webinar | Spotlight on sleep in brain health

Springer Medicine

Abstract

Background

Methods

Results

Conclusions

Please log in to get access to this content

Other articles of this Issue 1/2020

Comparison of deep learning with regression analysis in creating predictive models for SARS-CoV-2 outcomes

Pre-implementation adaptation of primary care cancer prevention clinical decision support in a predominantly rural healthcare system

The development of the evidence-based SDMMCC intervention to improve shared decision making in geriatric outpatients: the DICO study

A combination of two methods for evaluating the usability of a hospital information system

Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis

Early warning score validation methodologies and performance metrics: a systematic review