Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2021

Open Access 01-12-2021 | Research article

Using NLP in openEHR archetypes retrieval to promote interoperability: a feasibility study in China

Authors: Bo Sun, Fei Zhang, Jing Li, Yicheng Yang, Xiaolin Diao, Wei Zhao, Ting Shu

Published in: BMC Medical Informatics and Decision Making | Issue 1/2021

Login to get access

Abstract

Background

With the development and application of medical information system, semantic interoperability is essential for accurate and advanced health-related computing and electronic health record (EHR) information sharing. The openEHR approach can improve semantic interoperability. One key improvement of openEHR is that it allows for the use of existing archetypes. The crucial problem is how to improve the precision and resolve ambiguity in the archetype retrieval.

Method

Based on the query expansion technology and Word2Vec model in Nature Language Processing (NLP), we propose to find synonyms as substitutes for original search terms in archetype retrieval. Test sets in different medical professional level are used to verify the feasibility.

Result

Applying the approach to each original search term (n = 120) in test sets, a total of 69,348 substitutes were constructed. Precision at 5 (P@5) was improved by 0.767, on average. For the best result, the P@5 was up to 0.975.

Conclusions

We introduce a novel approach that using NLP technology and corpus to find synonyms as substitutes for original search terms. Compared to simply mapping the element contained in openEHR to an external dictionary, this approach could greatly improve precision and resolve ambiguity in retrieval tasks. This is helpful to promote the application of openEHR and advance EHR information sharing.
Literature
1.
go back to reference Murdoch TB, Detsky AS. The inevitable application of big data to health care. JAMA. 2013;309(13):1351–2.CrossRef Murdoch TB, Detsky AS. The inevitable application of big data to health care. JAMA. 2013;309(13):1351–2.CrossRef
2.
go back to reference Chaudhry B. Systematic review: impact of health information technology on quality, efficiency, and costs of medical care. Ann Intern Med. 2006;144(10):742–52.CrossRef Chaudhry B. Systematic review: impact of health information technology on quality, efficiency, and costs of medical care. Ann Intern Med. 2006;144(10):742–52.CrossRef
3.
go back to reference Hillestad R, Bigelow J, Bower A, Girosi F, Meili R, Scoville R, Taylor R. Can electronic medical record systems transform health care? Potential health benefits, savings, and costs. Health Aff (Millwood). 2005;24:1103–17.CrossRef Hillestad R, Bigelow J, Bower A, Girosi F, Meili R, Scoville R, Taylor R. Can electronic medical record systems transform health care? Potential health benefits, savings, and costs. Health Aff (Millwood). 2005;24:1103–17.CrossRef
4.
go back to reference Patel KK, Nadel J. Improving the quality and lowering the cost of health care: medicare reforms from the national commission on physician payment reform. J Gen Intern Med. 2014;29(5):703–4.CrossRef Patel KK, Nadel J. Improving the quality and lowering the cost of health care: medicare reforms from the national commission on physician payment reform. J Gen Intern Med. 2014;29(5):703–4.CrossRef
5.
go back to reference Kennedy-Shaffer L. When the alpha is the omega: p-values, “substantial evidence”, and the 005 standard at FDA. Food Drug Law J. 2017;72(4):595.PubMedPubMedCentral Kennedy-Shaffer L. When the alpha is the omega: p-values, “substantial evidence”, and the 005 standard at FDA. Food Drug Law J. 2017;72(4):595.PubMedPubMedCentral
6.
go back to reference Verheij RA, Curcin V, Delaney BC, Mcgilchrist MM. Possible sources of bias in primary care electronic health record data use and reuse. J Med Intern Res. 2018;20(5):e185. Verheij RA, Curcin V, Delaney BC, Mcgilchrist MM. Possible sources of bias in primary care electronic health record data use and reuse. J Med Intern Res. 2018;20(5):e185.
7.
go back to reference Fickenscher KM. President’s column: interoperability—the 30% solution: from dialog and rhetoric to reality. J Am Med Inform Assoc JAMIA. 2013;3:593–4.CrossRef Fickenscher KM. President’s column: interoperability—the 30% solution: from dialog and rhetoric to reality. J Am Med Inform Assoc JAMIA. 2013;3:593–4.CrossRef
8.
go back to reference Saleem JJ, Flanagan ME, Wilck NR, Jim D, Doebbeling BN. The next-generation electronic health record: perspectives of key leaders from the US Department of Veterans Affairs. J Am Med Inform Assoc JAMIA. 2013;2013(e1):e175.CrossRef Saleem JJ, Flanagan ME, Wilck NR, Jim D, Doebbeling BN. The next-generation electronic health record: perspectives of key leaders from the US Department of Veterans Affairs. J Am Med Inform Assoc JAMIA. 2013;2013(e1):e175.CrossRef
9.
go back to reference Roadmap D, Europe FOR. Semantic Interoperability for Better Health and Safer Healthcare. 2009. Roadmap D, Europe FOR. Semantic Interoperability for Better Health and Safer Healthcare. 2009.
10.
go back to reference Commision E. eHealth Action Plan 2012–2020-innvative healthcare for the 21st century. 2011. Commision E. eHealth Action Plan 2012–2020-innvative healthcare for the 21st century. 2011.
11.
go back to reference Jianxing H, Sally B, Jie X, Jiming X, Zhou. The practical implementation of artificial intelligence technologies in medicine. Nat Med; 2019. Jianxing H, Sally B, Jie X, Jiming X, Zhou. The practical implementation of artificial intelligence technologies in medicine. Nat Med; 2019.
12.
go back to reference Alberto MC, da David MCWD, Santos MR, Alberto MJ, Montserrat R. Clinical information modeling processes for semantic interoperability of electronic health records: systematic review and inductive analysis. J Am Med Inform Assoc JAMIA. 2015;4:925–34. Alberto MC, da David MCWD, Santos MR, Alberto MJ, Montserrat R. Clinical information modeling processes for semantic interoperability of electronic health records: systematic review and inductive analysis. J Am Med Inform Assoc JAMIA. 2015;4:925–34.
13.
go back to reference Blobel B, Goosscn W, Brochhausen M. Clinical modeling—A critical analysis. Int J Med Informatics. 2014;83(1):57–69.CrossRef Blobel B, Goosscn W, Brochhausen M. Clinical modeling—A critical analysis. Int J Med Informatics. 2014;83(1):57–69.CrossRef
14.
go back to reference Edidin H, Bhardwaj V: HL7 Version 2. x Data Types. In: HL7 for BizTalk. Springer; 2014: 169–174. Edidin H, Bhardwaj V: HL7 Version 2. x Data Types. In: HL7 for BizTalk. Springer; 2014: 169–174.
15.
go back to reference Saripalle RK. Fast health interoperability resources (FHIR): current status in the healthcare system. Int J E-Health Med Commun. 2019;10(1):76–93.CrossRef Saripalle RK. Fast health interoperability resources (FHIR): current status in the healthcare system. Int J E-Health Med Commun. 2019;10(1):76–93.CrossRef
16.
go back to reference Standards S. Health informatics—Electronic health record communication—Part 1: Reference model. 2006. Standards S. Health informatics—Electronic health record communication—Part 1: Reference model. 2006.
17.
go back to reference Standards B. Health Informatics—Electronic Healthcare Record Communication—Part 3: Distribution Rules. Standards B. Health Informatics—Electronic Healthcare Record Communication—Part 3: Distribution Rules.
18.
go back to reference Health informatics—Electronic health record communication—Part 2: Archetypes interchange specification (Endorsed by AENOR in October of 2007). 2007. Health informatics—Electronic health record communication—Part 2: Archetypes interchange specification (Endorsed by AENOR in October of 2007). 2007.
19.
go back to reference Beale T, Heard S, Kalra D, Lloyd D: OpenEHR architecture overview. The OpenEHR Foundation 2006, 7. Beale T, Heard S, Kalra D, Lloyd D: OpenEHR architecture overview. The OpenEHR Foundation 2006, 7.
20.
go back to reference Costa CM, Menárguez-Tortosa M, Fernández-Breis JT. Clinical data interoperability based on archetype transformation. J Biomed Inform. 2011;44(5):869–80.CrossRef Costa CM, Menárguez-Tortosa M, Fernández-Breis JT. Clinical data interoperability based on archetype transformation. J Biomed Inform. 2011;44(5):869–80.CrossRef
21.
go back to reference Wang L, Min L, Wang R, Lu X, Duan H. Archetype relational mapping—a practical openEHR persistence solution. BMC Med Inform Decis Mak. 2015;15(1):88.CrossRef Wang L, Min L, Wang R, Lu X, Duan H. Archetype relational mapping—a practical openEHR persistence solution. BMC Med Inform Decis Mak. 2015;15(1):88.CrossRef
22.
go back to reference Role of OpenEHR as an open source solution for the regional modelling of patient data in obstetrics. J Biomed Inform. 2015. Role of OpenEHR as an open source solution for the regional modelling of patient data in obstetrics. J Biomed Inform. 2015.
23.
go back to reference Sundvall E, Nyström M, Karlsson D, Eneling M, Orman H. Applying representational state transfer (REST) architecture to archetype-based electronic health record systems. BMC Med Inform Decis Mak. 2013;13(1):57.CrossRef Sundvall E, Nyström M, Karlsson D, Eneling M, Orman H. Applying representational state transfer (REST) architecture to archetype-based electronic health record systems. BMC Med Inform Decis Mak. 2013;13(1):57.CrossRef
24.
go back to reference Min L, Tian Q, Lu X, Duan H. Modeling EHR with the openEHR approach: an exploratory study in China. BMC Med Inform Decis Mak. 2018;18(1):75.CrossRef Min L, Tian Q, Lu X, Duan H. Modeling EHR with the openEHR approach: an exploratory study in China. BMC Med Inform Decis Mak. 2018;18(1):75.CrossRef
25.
go back to reference Garde S, Knaup P, Hovenga EJS, Heard S. Towards semantic interoperability for electronic health records. Methods Inf Med. 2007;46(03):332–43.CrossRef Garde S, Knaup P, Hovenga EJS, Heard S. Towards semantic interoperability for electronic health records. Methods Inf Med. 2007;46(03):332–43.CrossRef
26.
go back to reference Bernstein K, Tvede I, Petersen J, Bredegaard K. Can openEHR archetypes be used in a national context? The Danish archetype proof-of-concept project. Stud Health Technol Inform. 2009;150:147–51.PubMed Bernstein K, Tvede I, Petersen J, Bredegaard K. Can openEHR archetypes be used in a national context? The Danish archetype proof-of-concept project. Stud Health Technol Inform. 2009;150:147–51.PubMed
27.
go back to reference Garde S, Chen R, Leslie H, Beale T, McNICOLL I, Heard S. Archetype-based knowledge management for semantic interoperability of electronic health records. In: MIE: 2009; 2009. pp. 1007–11. Garde S, Chen R, Leslie H, Beale T, McNICOLL I, Heard S. Archetype-based knowledge management for semantic interoperability of electronic health records. In: MIE: 2009; 2009. pp. 1007–11.
28.
go back to reference Yang L, Huang X, Li J. Discovering clinical information models online to promote interoperability of electronic health records: a feasibility study of OpenEHR. J Med Intern Res. 2019;21(5):576. Yang L, Huang X, Li J. Discovering clinical information models online to promote interoperability of electronic health records: a feasibility study of OpenEHR. J Med Intern Res. 2019;21(5):576.
29.
go back to reference Min L, Tian Q, Lu X, An J, Duan H. An openEHR based approach to improve the semantic interoperability of clinical data registry. BMC Med Inform Decis Mak. 2018;18(Suppl 1):15.CrossRef Min L, Tian Q, Lu X, An J, Duan H. An openEHR based approach to improve the semantic interoperability of clinical data registry. BMC Med Inform Decis Mak. 2018;18(Suppl 1):15.CrossRef
30.
go back to reference Cohen MA. A survey of current work in biomedical text mining. Brief Bioinform. 2005;6(1):57–71.CrossRef Cohen MA. A survey of current work in biomedical text mining. Brief Bioinform. 2005;6(1):57–71.CrossRef
31.
go back to reference Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008;17(01):128–44.CrossRef Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008;17(01):128–44.CrossRef
32.
go back to reference Chen G, Ye D, Xing Z, Chen J, Cambria E. Ensemble application of convolutional and recurrent neural networks for multi-label text categorization. In: 2017 International joint conference on neural networks (IJCNN): 2017; 2017. Chen G, Ye D, Xing Z, Chen J, Cambria E. Ensemble application of convolutional and recurrent neural networks for multi-label text categorization. In: 2017 International joint conference on neural networks (IJCNN): 2017; 2017.
33.
go back to reference Guo J, Lu S, Han C, Zhang W, Wang J. Long text generation via adversarial training with leaked information. 2017. Guo J, Lu S, Han C, Zhang W, Wang J. Long text generation via adversarial training with leaked information. 2017.
34.
go back to reference Li Y, Pan Q, Wang S, Yang T, Cambria E. A generative model for category text generation. Inf Sci. 2018:S0020025518302366. Li Y, Pan Q, Wang S, Yang T, Cambria E. A generative model for category text generation. Inf Sci. 2018:S0020025518302366.
35.
go back to reference Younas M, Jawawi DNA, Ghani I. Extraction of non-functional requirement using semantic similarity distance. Neural Comput Appl. 2019;11:8989. Younas M, Jawawi DNA, Ghani I. Extraction of non-functional requirement using semantic similarity distance. Neural Comput Appl. 2019;11:8989.
36.
go back to reference Jin L, Yang Y, He H. Multi-level semantic representation enhancement network for relationship extraction. Neurocomputing. 2020;403:282–93.CrossRef Jin L, Yang Y, He H. Multi-level semantic representation enhancement network for relationship extraction. Neurocomputing. 2020;403:282–93.CrossRef
37.
go back to reference Crimp R, Trotman A. ACM: refining query expansion terms using query context; 2018. Crimp R, Trotman A. ACM: refining query expansion terms using query context; 2018.
38.
go back to reference Chaturvedi I, Ong YS, Tsang IW, Welsch RE, Cambria E. Learning word dependencies in text by means of a deep recurrent belief network. Knowl Based Syst. 2016;108:144–54.CrossRef Chaturvedi I, Ong YS, Tsang IW, Welsch RE, Cambria E. Learning word dependencies in text by means of a deep recurrent belief network. Knowl Based Syst. 2016;108:144–54.CrossRef
39.
go back to reference Huang Q, Yang Y, Cheng M. Deep learning the semantics of change sequences for query expansion. Softw Pract Exp. 2019;49(11):1600–17.CrossRef Huang Q, Yang Y, Cheng M. Deep learning the semantics of change sequences for query expansion. Softw Pract Exp. 2019;49(11):1600–17.CrossRef
40.
go back to reference Yusuf N, Yunus MAM, Wahid N, Wahid N, Nawi NM, Samsudin NA. IEEE: enhancing query expansion method using word embedding. In: 2019 IEEE 9th international conference on system engineering and technology. 2019; pp. 232–235. Yusuf N, Yunus MAM, Wahid N, Wahid N, Nawi NM, Samsudin NA. IEEE: enhancing query expansion method using word embedding. In: 2019 IEEE 9th international conference on system engineering and technology. 2019; pp. 232–235.
41.
go back to reference Pennington J, Socher R, Manning C. Glove: global vectors for word representation. In: Conference on empirical methods in natural language processing. 2014; 2014. Pennington J, Socher R, Manning C. Glove: global vectors for word representation. In: Conference on empirical methods in natural language processing. 2014; 2014.
42.
go back to reference Liang Y, Zhang W, Yang K. Attention-based Chinese word embedding. In: International conference on cloud computing and security: 2018; 2018. pp. 277–87. Liang Y, Zhang W, Yang K. Attention-based Chinese word embedding. In: International conference on cloud computing and security: 2018; 2018. pp. 277–87.
43.
go back to reference Liang C, Shao Y, Jing Z. Construction of a Chinese semantic dictionary by integrating two heterogeneous dictionaries: TongYiCi Cilin and HowNet. In: IEEE/WIC/ACM international joint conferences on web intelligence: 2013; 2013. Liang C, Shao Y, Jing Z. Construction of a Chinese semantic dictionary by integrating two heterogeneous dictionaries: TongYiCi Cilin and HowNet. In: IEEE/WIC/ACM international joint conferences on web intelligence: 2013; 2013.
44.
go back to reference Leroy G, Chen H. Meeting medical terminology needs-the ontology-enhanced Medical Concept Mapper. IEEE Trans Inf Technol Biomed. 2001;5(4):261–70.CrossRef Leroy G, Chen H. Meeting medical terminology needs-the ontology-enhanced Medical Concept Mapper. IEEE Trans Inf Technol Biomed. 2001;5(4):261–70.CrossRef
45.
go back to reference Koehn P, Knight K. Empirical methods for compound splitting. arXiv preprint cs/0302032; 2003. Koehn P, Knight K. Empirical methods for compound splitting. arXiv preprint cs/0302032; 2003.
46.
go back to reference Alfonseca E, Bilac S, Pharies S. Decompounding query keywords from compounding languages. In: Proceedings of ACL-08: HLT, short papers: 2008; 2008. pp. 253–56. Alfonseca E, Bilac S, Pharies S. Decompounding query keywords from compounding languages. In: Proceedings of ACL-08: HLT, short papers: 2008; 2008. pp. 253–56.
47.
go back to reference Zhao H, Cai D, Huang C, Kit C. Chinese word segmentation: another decade review (2007–2017). arXiv preprint arXiv:190106079; 2019. Zhao H, Cai D, Huang C, Kit C. Chinese word segmentation: another decade review (2007–2017). arXiv preprint arXiv:​190106079; 2019.
48.
go back to reference Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J. Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems: 2013; 2013. pp. 3111–9. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J. Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems: 2013; 2013. pp. 3111–9.
49.
go back to reference Tay Y, Tuan LA, Hui SC. Multi-cast attention networks for retrieval-based question answering and response prediction; 2018. Tay Y, Tuan LA, Hui SC. Multi-cast attention networks for retrieval-based question answering and response prediction; 2018.
50.
go back to reference Li X, Lu J, Hu S, Cheng KK, De Maeseneer J, Meng Q, Mossialos E, Xu DR, Yip W, Zhang H, et al. The primary health-care system in China. The Lancet. 2017;390(10112):2584–94.CrossRef Li X, Lu J, Hu S, Cheng KK, De Maeseneer J, Meng Q, Mossialos E, Xu DR, Yip W, Zhang H, et al. The primary health-care system in China. The Lancet. 2017;390(10112):2584–94.CrossRef
51.
go back to reference Henriksson A, Moen H, Skeppstedt M, Daudaraviius V, Duneld M. Synonym extraction and abbreviation expansion with ensembles of semantic spaces. J Biomed Sem. 2014;5(1):6.CrossRef Henriksson A, Moen H, Skeppstedt M, Daudaraviius V, Duneld M. Synonym extraction and abbreviation expansion with ensembles of semantic spaces. J Biomed Sem. 2014;5(1):6.CrossRef
Metadata
Title
Using NLP in openEHR archetypes retrieval to promote interoperability: a feasibility study in China
Authors
Bo Sun
Fei Zhang
Jing Li
Yicheng Yang
Xiaolin Diao
Wei Zhao
Ting Shu
Publication date
01-12-2021
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2021
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-021-01554-2

Other articles of this Issue 1/2021

BMC Medical Informatics and Decision Making 1/2021 Go to the issue