Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 5/2018

Open Access 01-12-2018 | Research

Mining and standardizing chinese consumer health terms

Authors: Li Hou, Hongyu Kang, Yan Liu, Luqi Li, Jiao Li

Published in: BMC Medical Informatics and Decision Making | Special Issue 5/2018

Login to get access

Abstract

Background

Health professionals and consumers use different terms to express medical events or concerns, which makes the communication barriers between the professionals and consumers. This may lead to bias in the diagnosis or treatment due to the misunderstanding or incomplete understanding. To solve the issue, a consumer health vocabulary was developed to map the consumer-used health terms to professional-used medical terms.

Methods

In this study, we extracted Chinese consumer health terms from both online health forum and patient education monographs, and manually mapped them to medical terms used by professionals (terms in medical thesauri or in medical books). To ensure the above annotation quality, we developed annotation guidelines.

Results

We applied our method to extract consumer-used disease terms in endocrinology, cardiology, gastroenterology and dermatology. In this study, we identified 1349 medical mentions from 8436 questions posted in an online health forum and 1428 articles for patient education monographs. After manual annotation and review, we released 1036 Chinese consumer health terms with mapping to 480 medical terms. Four annotators worked on the manual annotation work following the Chinese consumer health term annotation guidelines. Their average inter-annotator agreement (IAA) score was 93.91% ensuring high consistency of the released terms.

Conclusions

We extracted Chinese consumer health terms from online forum and patient education monographs, and mapped them to medical terms used by professionals. Manual annotation efforts have been made for term annotating and mapping. Our study may contribute to the Chinese consumer health vocabulary construction. In addition, our annotated corpus, both the contexts of consumer health terms and consumer-professional term mapping, would be a useful resource for automatic methodology development. The dataset of the Chinese consumer health terms (CHT) is publicly available at http://​www.​phoc.​org.​cn/​cht/​.
Literature
4.
go back to reference Chen AT. The relationship between health management and information behavior over time: a study of the illness journeys of people living with fibromyalgia. J Med Internet Res. 2016;18:e269.CrossRefPubMedPubMedCentral Chen AT. The relationship between health management and information behavior over time: a study of the illness journeys of people living with fibromyalgia. J Med Internet Res. 2016;18:e269.CrossRefPubMedPubMedCentral
5.
go back to reference Miller N, Lacroix EM, Backus JE. MEDLINEplus: building and maintaining the National Library of Medicine’s consumer health web service. Bull Med Libr Assoc. 2000;88:11–7.PubMedPubMedCentral Miller N, Lacroix EM, Backus JE. MEDLINEplus: building and maintaining the National Library of Medicine’s consumer health web service. Bull Med Libr Assoc. 2000;88:11–7.PubMedPubMedCentral
6.
go back to reference Williams MD, Gish KW, Giuse NB, Sathe NA. The patient informatics consult service:an approach for a patient-centered service. Bull Med Libr Assoc. 2001;89:185–93.PubMedPubMedCentral Williams MD, Gish KW, Giuse NB, Sathe NA. The patient informatics consult service:an approach for a patient-centered service. Bull Med Libr Assoc. 2001;89:185–93.PubMedPubMedCentral
8.
go back to reference Fiksdal AS, Kumbamu A, Jadhav AS, Cocos C, Nelsen LA, Pathak J, et al. Evaluating the process of online health information searching: a qualitative approach to exploring consumer perspectives. J Med Internet Res. 2014;16:e224.CrossRefPubMedPubMedCentral Fiksdal AS, Kumbamu A, Jadhav AS, Cocos C, Nelsen LA, Pathak J, et al. Evaluating the process of online health information searching: a qualitative approach to exploring consumer perspectives. J Med Internet Res. 2014;16:e224.CrossRefPubMedPubMedCentral
9.
go back to reference Zeng QT, Kogan S, Plovnick RM, Crowell J, Lacroix E, Greenes RA. Positive attitudes and failed queries: an exploration of the conundrums of consumer health information retrieval. Int J Med Inform. 2004;73:45–55.CrossRefPubMed Zeng QT, Kogan S, Plovnick RM, Crowell J, Lacroix E, Greenes RA. Positive attitudes and failed queries: an exploration of the conundrums of consumer health information retrieval. Int J Med Inform. 2004;73:45–55.CrossRefPubMed
11.
go back to reference Vydiswaran VGV, Mei Q, Hanauer DA, Zheng K. Mining consumer health vocabulary from community-generated text. AMIA Annu Symp Proc. 2011:1150–9. Vydiswaran VGV, Mei Q, Hanauer DA, Zheng K. Mining consumer health vocabulary from community-generated text. AMIA Annu Symp Proc. 2011:1150–9.
12.
go back to reference Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:1012–24.CrossRefPubMed Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:1012–24.CrossRefPubMed
17.
go back to reference Zeng QT, Tse T, Crowell J. Identifying consumer-friendly display (CFD) names for health concepts. AMIA Annu Symp Proc. 2005:859–63. Zeng QT, Tse T, Crowell J. Identifying consumer-friendly display (CFD) names for health concepts. AMIA Annu Symp Proc. 2005:859–63.
19.
go back to reference Qenam B, Kim TY, Carroll MJ, et al. Text simplification using consumer health vocabulary to generate patient-centered radiology reporting: translation and evaluation. J Med Internet Res. 2017;19:e417.CrossRefPubMedPubMedCentral Qenam B, Kim TY, Carroll MJ, et al. Text simplification using consumer health vocabulary to generate patient-centered radiology reporting: translation and evaluation. J Med Internet Res. 2017;19:e417.CrossRefPubMedPubMedCentral
20.
go back to reference Zeng Q, Kogan S, Ash N, Greenes RA, Boxwala AA. Characteristics of consumer terminology for health information retrieval. Methods Inf Med. 2002;41:289–98.CrossRefPubMed Zeng Q, Kogan S, Ash N, Greenes RA, Boxwala AA. Characteristics of consumer terminology for health information retrieval. Methods Inf Med. 2002;41:289–98.CrossRefPubMed
21.
go back to reference Zeng Q, Kogan S, Ash N, Greenes RA. Patient and clinician vocabulary: how different are they? MEDINFO. 2001;10:399–403. Zeng Q, Kogan S, Ash N, Greenes RA. Patient and clinician vocabulary: how different are they? MEDINFO. 2001;10:399–403.
22.
go back to reference Smith CA, Stavri PZ, Chapman WW. In their own words? A terminological analysis of e-mail to a cancer information service. AMIA Annu Symp Proc. 2002:697–701. Smith CA, Stavri PZ, Chapman WW. In their own words? A terminological analysis of e-mail to a cancer information service. AMIA Annu Symp Proc. 2002:697–701.
23.
24.
go back to reference Li J. The mechanism of mesh subheading automatic attachment for Chinese biomedical literatures. New technology of library and information service. 2012;220:17–21. Li J. The mechanism of mesh subheading automatic attachment for Chinese biomedical literatures. New technology of library and information service. 2012;220:17–21.
25.
go back to reference Sun HX, Li JL, Li DY. The study on semantic mapping from free word to subject headings based on semantic system of CMeSH. Data analysis and knowledge discovery. 2013;29:46–51. Sun HX, Li JL, Li DY. The study on semantic mapping from free word to subject headings based on semantic system of CMeSH. Data analysis and knowledge discovery. 2013;29:46–51.
26.
go back to reference Zeng Q, Zhang X, Li Z. Extracting clinical information from free-text of pathology and operation notes via Chinese natural language processing. In: IEEE international conference on bioinformatics and biomedicine workshops. Hong Kong: IEEE; 2010. p. 593–7. Zeng Q, Zhang X, Li Z. Extracting clinical information from free-text of pathology and operation notes via Chinese natural language processing. In: IEEE international conference on bioinformatics and biomedicine workshops. Hong Kong: IEEE; 2010. p. 593–7.
27.
go back to reference Wang H, Zhang W, Zeng Q, Li Z, Feng K, Liu L. Extracting important information from Chinese operation notes with natural language processing methods. J Biomed Inform. 2014;48:130–6.CrossRefPubMed Wang H, Zhang W, Zeng Q, Li Z, Feng K, Liu L. Extracting important information from Chinese operation notes with natural language processing methods. J Biomed Inform. 2014;48:130–6.CrossRefPubMed
28.
go back to reference Zhang S, Kang T, Zhang X, Wen D, Elhadad N, Lei J. Speculation detection for Chinese clinical notes: impacts of word segmentation and embedding models. J Biomed Inform. 2016;60:334–41.CrossRefPubMedPubMedCentral Zhang S, Kang T, Zhang X, Wen D, Elhadad N, Lei J. Speculation detection for Chinese clinical notes: impacts of word segmentation and embedding models. J Biomed Inform. 2016;60:334–41.CrossRefPubMedPubMedCentral
31.
go back to reference Dai T. Enriching online access to health information for the public: efforts from medical library in China. Chin J Lib Inform Sci. 2013;6:1–13. Dai T. Enriching online access to health information for the public: efforts from medical library in China. Chin J Lib Inform Sci. 2013;6:1–13.
32.
go back to reference Zhong N. Hundred thousand whys of national health: infectious disease prevention. Beijing: Beijing publishing house; 2012. p. 77. Zhong N. Hundred thousand whys of national health: infectious disease prevention. Beijing: Beijing publishing house; 2012. p. 77.
33.
go back to reference Hou L. Extended abstract: study on the Design for Consumer Health Knowledge Organization System in China. In: International conference for smart health. Beijing: Lecture Notes in Computer Science (LNCS); 2014. p. 127–9. Hou L. Extended abstract: study on the Design for Consumer Health Knowledge Organization System in China. In: International conference for smart health. Beijing: Lecture Notes in Computer Science (LNCS); 2014. p. 127–9.
34.
go back to reference Li J, Sun Y, Johson RJ, et al. Annotating chemicals, diseases and their interactions in biomedical literature. In: Proceedings of the fifth BioCreative challenge evaluation workshop. Sevilla: Biocreative; 2015. p. 173–82. Li J, Sun Y, Johson RJ, et al. Annotating chemicals, diseases and their interactions in biomedical literature. In: Proceedings of the fifth BioCreative challenge evaluation workshop. Sevilla: Biocreative; 2015. p. 173–82.
Metadata
Title
Mining and standardizing chinese consumer health terms
Authors
Li Hou
Hongyu Kang
Yan Liu
Luqi Li
Jiao Li
Publication date
01-12-2018
Publisher
BioMed Central
DOI
https://doi.org/10.1186/s12911-018-0695-6

Other articles of this Special Issue 5/2018

BMC Medical Informatics and Decision Making 5/2018 Go to the issue