Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2014

Open Access 01-12-2014 | Research article

Multi-topic assignment for exploratory navigation of consumer health information in NetWellness using formal concept analysis

Authors: Licong Cui, Rong Xu, Zhihui Luo, Susan Wentz, Kyle Scarberry, Guo-Qiang Zhang

Published in: BMC Medical Informatics and Decision Making | Issue 1/2014

Login to get access

Abstract

Background

Finding quality consumer health information online can effectively bring important public health benefits to the general population. It can empower people with timely and current knowledge for managing their health and promoting wellbeing. Despite a popular belief that search engines such as Google can solve all information access problems, recent studies show that using search engines and simple search terms is not sufficient. Our objective is to provide an approach to organizing consumer health information for navigational exploration, complementing keyword-based direct search. Multi-topic assignment to health information, such as online questions, is a fundamental step for navigational exploration.

Methods

We introduce a new multi-topic assignment method combining semantic annotation using UMLS concepts (CUIs) and Formal Concept Analysis (FCA). Each question was tagged with CUIs identified by MetaMap. The CUIs were filtered with term-frequency and a new term-strength index to construct a CUI-question context. The CUI-question context and a topic-subject context were used for multi-topic assignment, resulting in a topic-question context. The topic-question context was then directly used for constructing a prototype navigational exploration interface.

Results

Experimental evaluation was performed on the task of automatic multi-topic assignment of 99 predefined topics for about 60,000 consumer health questions from NetWellness. Using example-based metrics, suitable for multi-topic assignment problems, our method achieved a precision of 0.849, recall of 0.774, and F 1 measure of 0.782, using a reference standard of 278 questions with manually assigned topics. Compared to NetWellness’ original topic assignment, a 36.5% increase in recall is achieved with virtually no sacrifice in precision.

Conclusion

Enhancing the recall of multi-topic assignment without sacrificing precision is a prerequisite for achieving the benefits of navigational exploration. Our new multi-topic assignment method, combining term-strength, FCA, and information retrieval techniques, significantly improved recall and performed well according to example-based metrics.
Appendix
Available only for authorised users
Literature
1.
go back to reference Berland GK, Elliott MN, Morales LS, Algazy JI, Kravitz RL, Broder MS, Kanouse DE, Muñoz JA, Puyol J-A, Lara M, Watkins KE, Yang H, McGlynn EA: Health information on the internet: accessibility, quality, and readability in English and Spanish. JAMA. 2001, 285 (20): 2612-2621.CrossRefPubMedPubMedCentral Berland GK, Elliott MN, Morales LS, Algazy JI, Kravitz RL, Broder MS, Kanouse DE, Muñoz JA, Puyol J-A, Lara M, Watkins KE, Yang H, McGlynn EA: Health information on the internet: accessibility, quality, and readability in English and Spanish. JAMA. 2001, 285 (20): 2612-2621.CrossRefPubMedPubMedCentral
2.
3.
go back to reference White RW, Kules B, Drucker SM, Schraefel MC: Supporting exploratory search, introduction, special issue, communications of the ACM. Commun ACM. 2006, 49 (4): 37-39.CrossRef White RW, Kules B, Drucker SM, Schraefel MC: Supporting exploratory search, introduction, special issue, communications of the ACM. Commun ACM. 2006, 49 (4): 37-39.CrossRef
4.
go back to reference Hearst MA: Clustering versus faceted categories for information exploration. Commun ACM. 2006, 49 (4): 59-61.CrossRef Hearst MA: Clustering versus faceted categories for information exploration. Commun ACM. 2006, 49 (4): 59-61.CrossRef
5.
go back to reference Hearst M: Design recommendations for hierarchical faceted search interfaces. Proceedings of ACM SIGIR Workshop on Faceted Search: 10 Aug 2006; Seattle. Edited by: Broder AZ, Maarek YS. 2006, New York: ACM, 1-5. Hearst M: Design recommendations for hierarchical faceted search interfaces. Proceedings of ACM SIGIR Workshop on Faceted Search: 10 Aug 2006; Seattle. Edited by: Broder AZ, Maarek YS. 2006, New York: ACM, 1-5.
6.
go back to reference Sacco GM, Tzitzikas Y: Dynamic taxonomies and faceted search: theory, practice, and experience. 2009, Germany: SpringerCrossRef Sacco GM, Tzitzikas Y: Dynamic taxonomies and faceted search: theory, practice, and experience. 2009, Germany: SpringerCrossRef
7.
go back to reference Mu X, Ryu H, Lu K: Supporting effective health and biomedical information retrieval and navigation: a novel facet view interface evaluation. J Biomed Inf. 2011, 44 (4): 576-586.CrossRef Mu X, Ryu H, Lu K: Supporting effective health and biomedical information retrieval and navigation: a novel facet view interface evaluation. J Biomed Inf. 2011, 44 (4): 576-586.CrossRef
8.
go back to reference Cui L, Carter R, Zhang G-Q: Evaluation of a novel conjunctive exploratory navigation interface for consumer health information: a crowdsourced comparative study. J Med Internet Res. 2014, 16 (2): e45-CrossRefPubMedPubMedCentral Cui L, Carter R, Zhang G-Q: Evaluation of a novel conjunctive exploratory navigation interface for consumer health information: a crowdsourced comparative study. J Med Internet Res. 2014, 16 (2): e45-CrossRefPubMedPubMedCentral
9.
go back to reference Tsoumakas G, Katakis I: Multi-label classification: an overview. Int J Data Warehousing Mining (IJDWM). 2007, 3 (3): 1-13.CrossRef Tsoumakas G, Katakis I: Multi-label classification: an overview. Int J Data Warehousing Mining (IJDWM). 2007, 3 (3): 1-13.CrossRef
10.
go back to reference Tsoumakas G, Katakis I, Vlahavas I: Mining multi-label data. Data Mining and Knowledge Discovery Handbook. 2010, Berlin: Springer, 667-685. Tsoumakas G, Katakis I, Vlahavas I: Mining multi-label data. Data Mining and Knowledge Discovery Handbook. 2010, Berlin: Springer, 667-685.
11.
go back to reference Pestian JP, Brew C, Matykiewicz P, Hovermale D, Johnson N, Cohen KB, Duch W: A shared task involving multi-label classification of clinical free text. Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing: 29 June 2007; Prague. Edited by: Cohen KB, Demner-Fushman D, Friedman C, Hirschman L, Pestian J. 2007, Stroudsburg: ACL, 97-104. Pestian JP, Brew C, Matykiewicz P, Hovermale D, Johnson N, Cohen KB, Duch W: A shared task involving multi-label classification of clinical free text. Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing: 29 June 2007; Prague. Edited by: Cohen KB, Demner-Fushman D, Friedman C, Hirschman L, Pestian J. 2007, Stroudsburg: ACL, 97-104.
12.
go back to reference Cao Y-g, Cimino JJ, Ely J, Yu H: Automatically extracting information needs from complex clinical questions. J Biomed Inf. 2010, 43 (6): 962-971.CrossRef Cao Y-g, Cimino JJ, Ely J, Yu H: Automatically extracting information needs from complex clinical questions. J Biomed Inf. 2010, 43 (6): 962-971.CrossRef
13.
go back to reference Ganter B, Wille R, Franzke C: Formal concept analysis: mathematical foundations. 1997, New York: Springer Ganter B, Wille R, Franzke C: Formal concept analysis: mathematical foundations. 1997, New York: Springer
14.
go back to reference Bodenreider O: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004, 32 (suppl 1): 267-270.CrossRef Bodenreider O: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004, 32 (suppl 1): 267-270.CrossRef
16.
go back to reference Aronson AR, Lang F-M: An overview of MetaMap: historical perspective and recent advances. J Am Med Inf Assoc. 2010, 17 (3): 229-236.CrossRef Aronson AR, Lang F-M: An overview of MetaMap: historical perspective and recent advances. J Am Med Inf Assoc. 2010, 17 (3): 229-236.CrossRef
17.
go back to reference Jones KS: A statistical interpretation of term specificity and its application in retrieval. J Doc. 1972, 28 (1): 11-21.CrossRef Jones KS: A statistical interpretation of term specificity and its application in retrieval. J Doc. 1972, 28 (1): 11-21.CrossRef
18.
go back to reference Zeng QT, Tse T: Exploring and developing consumer health vocabularies. J Am Med Inf Assoc. 2006, 13 (1): 24-29.CrossRef Zeng QT, Tse T: Exploring and developing consumer health vocabularies. J Am Med Inf Assoc. 2006, 13 (1): 24-29.CrossRef
20.
go back to reference Artstein R, Poesio M: Inter-coder agreement for computational linguistics. Comput Ling. 2008, 34 (4): 555-596.CrossRef Artstein R, Poesio M: Inter-coder agreement for computational linguistics. Comput Ling. 2008, 34 (4): 555-596.CrossRef
21.
go back to reference Hripcsak G, Rothschild AS: Agreement, the f-measure, and reliability in information retrieval. J Am Med Inf Assoc. 2005, 12 (3): 296-298.CrossRef Hripcsak G, Rothschild AS: Agreement, the f-measure, and reliability in information retrieval. J Am Med Inf Assoc. 2005, 12 (3): 296-298.CrossRef
22.
go back to reference Zhang G-Q, Shen G, Tian Y, Sun J: Concept analysis as a formal method for menu design. Interactive Systems. Design, Specification, and Verification. 2006, Berlin: Springer, 173-187.CrossRef Zhang G-Q, Shen G, Tian Y, Sun J: Concept analysis as a formal method for menu design. Interactive Systems. Design, Specification, and Verification. 2006, Berlin: Springer, 173-187.CrossRef
23.
go back to reference Carpineto C, Romano G: Concept data analysis: theory and applications. 2004, Hoboken: Wiley.comCrossRef Carpineto C, Romano G: Concept data analysis: theory and applications. 2004, Hoboken: Wiley.comCrossRef
24.
go back to reference Priss U: Formal concept analysis in information science. ARIST. 2006, 40 (1): 521-543. Priss U: Formal concept analysis in information science. ARIST. 2006, 40 (1): 521-543.
25.
go back to reference Tunkelang D: Faceted search. Synth Lect Inf Concepts Retrieval Serv. 2009, 1 (1): 1-80. Tunkelang D: Faceted search. Synth Lect Inf Concepts Retrieval Serv. 2009, 1 (1): 1-80.
26.
go back to reference Spiteri L: A simplified model for facet analysis. Can J Inform Library Sci. 1998, 23 (1–2): 1-30. Spiteri L: A simplified model for facet analysis. Can J Inform Library Sci. 1998, 23 (1–2): 1-30.
27.
go back to reference Cui L, Tao S, Zhang GQ: A semantic-based approach for exploring consumer health questions using UMLS. Proceeding of AMIA Annual Symp. 2014, (In Press) Cui L, Tao S, Zhang GQ: A semantic-based approach for exploring consumer health questions using UMLS. Proceeding of AMIA Annual Symp. 2014, (In Press)
Metadata
Title
Multi-topic assignment for exploratory navigation of consumer health information in NetWellness using formal concept analysis
Authors
Licong Cui
Rong Xu
Zhihui Luo
Susan Wentz
Kyle Scarberry
Guo-Qiang Zhang
Publication date
01-12-2014
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2014
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/1472-6947-14-63

Other articles of this Issue 1/2014

BMC Medical Informatics and Decision Making 1/2014 Go to the issue