Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2019

Open Access 01-12-2019 | Oral Anticoagulant | Research article

Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse

Authors: Georg Dietrich, Jonathan Krebs, Leon Liman, Georg Fette, Maximilian Ertl, Mathias Kaspar, Stefan Störk, Frank Puppe

Published in: BMC Medical Informatics and Decision Making | Issue 1/2019

Login to get access

Abstract

Background

Medication trend studies show the changes of medication over the years and may be replicated using a clinical Data Warehouse (CDW). Even nowadays, a lot of the patient information, like medication data, in the EHR is stored in the format of free text. As the conventional approach of information extraction (IE) demands a high developmental effort, we used ad hoc IE instead. This technique queries information and extracts it on the fly from texts contained in the CDW.

Methods

We present a generalizable approach of ad hoc IE for pharmacotherapy (medications and their daily dosage) presented in hospital discharge letters. We added import and query features to the CDW system, like error tolerant queries to deal with misspellings and proximity search for the extraction of the daily dosage. During the data integration process in the CDW, negated, historical and non-patient context data are filtered. For the replication studies, we used a drug list grouped by ATC (Anatomical Therapeutic Chemical Classification System) codes as input for queries to the CDW.

Results

We achieve an F1 score of 0.983 (precision 0.997, recall 0.970) for extracting medication from discharge letters and an F1 score of 0.974 (precision 0.977, recall 0.972) for extracting the dosage. We replicated three published medical trend studies for hypertension, atrial fibrillation and chronic kidney disease. Overall, 93% of the main findings could be replicated, 68% of sub-findings, and 75% of all findings. One study could be completely replicated with all main and sub-findings.

Conclusion

A novel approach for ad hoc IE is presented. It is very suitable for basic medical texts like discharge letters and finding reports. Ad hoc IE is by definition more limited than conventional IE and does not claim to replace it, but it substantially exceeds the search capabilities of many CDWs and it is convenient to conduct replication studies fast and with high quality.
Literature
1.
go back to reference Zoega H, Furu K, Halldorsson M, Thomsen PH, Sourander A, Martikainen JE. Use of adhd drugs in the nordic countries: a population-based comparison study. Acta Psychiatr Scand. 2011; 123(5):360–7.PubMedCrossRef Zoega H, Furu K, Halldorsson M, Thomsen PH, Sourander A, Martikainen JE. Use of adhd drugs in the nordic countries: a population-based comparison study. Acta Psychiatr Scand. 2011; 123(5):360–7.PubMedCrossRef
2.
go back to reference Fang MC, Stafford RS, Ruskin JN, Singer DE. National trends in antiarrhythmic and antithrombotic medication use in atrial fibrillation. Arch Intern Med. 2004; 164(1):55–60.PubMedCrossRef Fang MC, Stafford RS, Ruskin JN, Singer DE. National trends in antiarrhythmic and antithrombotic medication use in atrial fibrillation. Arch Intern Med. 2004; 164(1):55–60.PubMedCrossRef
3.
go back to reference Gadsbøll K, Staerk L, Fosbøl EL, Sindet-Pedersen C, Gundlund A, Lip GY, Gislason GH, Olesen JB. Increased use of oral anticoagulants in patients with atrial fibrillation: temporal trends from 2005 to 2015 in denmark. Eur Heart J. 2017; 38(12):899–906.PubMed Gadsbøll K, Staerk L, Fosbøl EL, Sindet-Pedersen C, Gundlund A, Lip GY, Gislason GH, Olesen JB. Increased use of oral anticoagulants in patients with atrial fibrillation: temporal trends from 2005 to 2015 in denmark. Eur Heart J. 2017; 38(12):899–906.PubMed
4.
go back to reference Staerk L, Fosbøl EL, Gadsbøll K, Sindet-Pedersen C, Pallisgaard JL, Lamberts M, Lip GY, Torp-Pedersen C, Gislason GH, Olesen JB. Non-vitamin k antagonist oral anticoagulation usage according to age among patients with atrial fibrillation: Temporal trends 2011–2015 in denmark. Sci Rep. 2016; 6:31477.PubMedPubMedCentralCrossRef Staerk L, Fosbøl EL, Gadsbøll K, Sindet-Pedersen C, Pallisgaard JL, Lamberts M, Lip GY, Torp-Pedersen C, Gislason GH, Olesen JB. Non-vitamin k antagonist oral anticoagulation usage according to age among patients with atrial fibrillation: Temporal trends 2011–2015 in denmark. Sci Rep. 2016; 6:31477.PubMedPubMedCentralCrossRef
5.
go back to reference Wu B, Bell K, Stanford A, Kern DM, Tunceli O, Vupputuri S, Kalsekar I, Willey V. Understanding ckd among patients with t2dm: prevalence, temporal trends, and treatment patterns—nhanes 2007–2012. BMJ Open Diabetes Res Care. 2016; 4(1):000154.CrossRef Wu B, Bell K, Stanford A, Kern DM, Tunceli O, Vupputuri S, Kalsekar I, Willey V. Understanding ckd among patients with t2dm: prevalence, temporal trends, and treatment patterns—nhanes 2007–2012. BMJ Open Diabetes Res Care. 2016; 4(1):000154.CrossRef
7.
go back to reference Katada H, Yukawa N, Urushihara H, Tanaka S, Mimori T, Kawakami K. Prescription patterns and trends in anti-rheumatic drug use based on a large-scale claims database in japan. Clin Rheumatol. 2015; 34(5):949–56.PubMedCrossRef Katada H, Yukawa N, Urushihara H, Tanaka S, Mimori T, Kawakami K. Prescription patterns and trends in anti-rheumatic drug use based on a large-scale claims database in japan. Clin Rheumatol. 2015; 34(5):949–56.PubMedCrossRef
8.
go back to reference Bromfield S, Muntner P. High blood pressure: the leading global burden of disease risk factor and the need for worldwide prevention programs. Curr Hypertens Rep. 2013; 15(3):134–6.PubMedPubMedCentralCrossRef Bromfield S, Muntner P. High blood pressure: the leading global burden of disease risk factor and the need for worldwide prevention programs. Curr Hypertens Rep. 2013; 15(3):134–6.PubMedPubMedCentralCrossRef
9.
go back to reference Falaschetti E, Mindell J, Knott C, Poulter N. Hypertension management in england: a serial cross-sectional study from 1994 to 2011. Lancet. 2014; 383(9932):1912–9.PubMedCrossRef Falaschetti E, Mindell J, Knott C, Poulter N. Hypertension management in england: a serial cross-sectional study from 1994 to 2011. Lancet. 2014; 383(9932):1912–9.PubMedCrossRef
10.
go back to reference Godet-Mardirossian H, Girerd X, Vernay M, Chamontin B, Castetbon K, de Peretti C. Patterns of hypertension management in france (enns 2006–2007). Eur J Prev Cardiol. 2012; 19(2):213–20.PubMedCrossRef Godet-Mardirossian H, Girerd X, Vernay M, Chamontin B, Castetbon K, de Peretti C. Patterns of hypertension management in france (enns 2006–2007). Eur J Prev Cardiol. 2012; 19(2):213–20.PubMedCrossRef
11.
go back to reference Sarganas G, Knopf H, Grams D, Neuhauser HK. Trends in antihypertensive medication use and blood pressure control among adults with hypertension in germany. Am J Hypertens. 2015; 29(1):104–13.PubMedCrossRef Sarganas G, Knopf H, Grams D, Neuhauser HK. Trends in antihypertensive medication use and blood pressure control among adults with hypertension in germany. Am J Hypertens. 2015; 29(1):104–13.PubMedCrossRef
12.
go back to reference Wallentin F, Wettermark B, Kahan T. Drug treatment of hypertension in sweden in relation to sex, age, and comorbidity. J Clin Hypertens. 2018; 20(1):106–14.CrossRef Wallentin F, Wettermark B, Kahan T. Drug treatment of hypertension in sweden in relation to sex, age, and comorbidity. J Clin Hypertens. 2018; 20(1):106–14.CrossRef
13.
go back to reference Gu Q, Burt VL, Dillon CF, Yoon S. Trends in antihypertensive medication use and blood pressure control among united states adults with hypertensionclinical perspective: The national health and nutrition examination survey, 2001 to 2010. Circulation. 2012; 126(17):2105–14.PubMedCrossRef Gu Q, Burt VL, Dillon CF, Yoon S. Trends in antihypertensive medication use and blood pressure control among united states adults with hypertensionclinical perspective: The national health and nutrition examination survey, 2001 to 2010. Circulation. 2012; 126(17):2105–14.PubMedCrossRef
15.
go back to reference Begley CG, Ellis LM. Drug development: Raise standards for preclinical cancer research. Nature. 2012; 483(7391):531.CrossRefPubMed Begley CG, Ellis LM. Drug development: Raise standards for preclinical cancer research. Nature. 2012; 483(7391):531.CrossRefPubMed
17.
go back to reference Jensen K, Soguero-Ruiz C, Mikalsen KO, Lindsetmo R-O, Kouskoumvekaki I, Girolami M, Skrovseth SO, Augestad KM. Analysis of free text in electronic health records for identification of cancer patient trajectories. Sci Rep. 2017; 7:46226.PubMedPubMedCentralCrossRef Jensen K, Soguero-Ruiz C, Mikalsen KO, Lindsetmo R-O, Kouskoumvekaki I, Girolami M, Skrovseth SO, Augestad KM. Analysis of free text in electronic health records for identification of cancer patient trajectories. Sci Rep. 2017; 7:46226.PubMedPubMedCentralCrossRef
18.
go back to reference Dietrich G, Krebs J, Fette G, Ertl M, Kaspar M, Störk S, Puppe F. Ad hoc information extraction for clinical data warehouses. Methods Inf Med. 2018; 57(01):22–9. Dietrich G, Krebs J, Fette G, Ertl M, Kaspar M, Störk S, Puppe F. Ad hoc information extraction for clinical data warehouses. Methods Inf Med. 2018; 57(01):22–9.
19.
go back to reference Xu H, Stenner SP, Doan S, Johnson KB, Waitman LR, Denny JC. Medex: a medication information extraction system for clinical narratives. J Am Med Inform Assoc. 2010; 17(1):19–24.PubMedPubMedCentralCrossRef Xu H, Stenner SP, Doan S, Johnson KB, Waitman LR, Denny JC. Medex: a medication information extraction system for clinical narratives. J Am Med Inform Assoc. 2010; 17(1):19–24.PubMedPubMedCentralCrossRef
20.
go back to reference Spasić I, Sarafraz F, Keane JA, Nenadić G. Medication information extraction with linguistic pattern matching and semantic rules. J Am Med Inform Assoc. 2010; 17(5):532–5.PubMedPubMedCentralCrossRef Spasić I, Sarafraz F, Keane JA, Nenadić G. Medication information extraction with linguistic pattern matching and semantic rules. J Am Med Inform Assoc. 2010; 17(5):532–5.PubMedPubMedCentralCrossRef
21.
go back to reference Sohn S, Kocher J-PA, Chute CG, Savova GK. Drug side effect extraction from clinical narratives of psychiatry and psychology patients. J Am Med Inform Assoc. 2011; 18(Supplement_1):144–9.CrossRef Sohn S, Kocher J-PA, Chute CG, Savova GK. Drug side effect extraction from clinical narratives of psychiatry and psychology patients. J Am Med Inform Assoc. 2011; 18(Supplement_1):144–9.CrossRef
22.
go back to reference Wang Y, Wang L, Rastegar-Mojarad M, Moon S, Shen F, Afzal N, Liu S, Zeng Y, Mehrabi S, Sohn S, et al. Clinical information extraction applications: A literature review. J Biomed Inform. 2018; 77:34–49.PubMedCrossRef Wang Y, Wang L, Rastegar-Mojarad M, Moon S, Shen F, Afzal N, Liu S, Zeng Y, Mehrabi S, Sohn S, et al. Clinical information extraction applications: A literature review. J Biomed Inform. 2018; 77:34–49.PubMedCrossRef
24.
go back to reference Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. Evaluation of negation phrases in narrative clinical reports. In: Proceedings of the AMIA Symposium. Washington, DC: American Medical Informatics Association.2001. p. 105. Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. Evaluation of negation phrases in narrative clinical reports. In: Proceedings of the AMIA Symposium. Washington, DC: American Medical Informatics Association.2001. p. 105.
25.
go back to reference Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2001; 34(5):301–10.PubMedCrossRef Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2001; 34(5):301–10.PubMedCrossRef
26.
go back to reference Harkema H, Dowling JN, Thornblade T, Chapman WW. Context: an algorithm for determining negation, experiencer, and temporal status from clinical reports. J Biomed Inform. 2009; 42(5):839–51.PubMedPubMedCentralCrossRef Harkema H, Dowling JN, Thornblade T, Chapman WW. Context: an algorithm for determining negation, experiencer, and temporal status from clinical reports. J Biomed Inform. 2009; 42(5):839–51.PubMedPubMedCentralCrossRef
27.
go back to reference Bard GV. Spelling-error tolerant, order-independent pass-phrases via the damerau-levenshtein string-edit distance metric. In: Proceedings of the Fifth Australasian Symposium on ACSW frontiers-Volume 68. Ballarat: Citeseer: 2007. p. 117–24. Bard GV. Spelling-error tolerant, order-independent pass-phrases via the damerau-levenshtein string-edit distance metric. In: Proceedings of the Fifth Australasian Symposium on ACSW frontiers-Volume 68. Ballarat: Citeseer: 2007. p. 117–24.
28.
go back to reference Krug M, Tu NDT, Weimer L, Reger I, Konle L, Jannidis F, Puppe F. Annotation and beyond – using athen annotation and text highlighting environment. In: DHd 2018.Cologne: Digital Humanities im deutschsprachigen Raum e.V.: 2018. Krug M, Tu NDT, Weimer L, Reger I, Konle L, Jannidis F, Puppe F. Annotation and beyond – using athen annotation and text highlighting environment. In: DHd 2018.Cologne: Digital Humanities im deutschsprachigen Raum e.V.: 2018.
29.
go back to reference National Center for Health Statistics. Analytic and Reporting Guidelines: The National Health and Nutrition Examination Survey (NHANES). https://www.cdc.gov/nchs/data/nhanes/nhanes_03_04/nhanes_analytic_guideli%nes_dec_2005.pdf. Accessed May 2018. National Center for Health Statistics. Analytic and Reporting Guidelines: The National Health and Nutrition Examination Survey (NHANES). https://​www.​cdc.​gov/​nchs/​data/​nhanes/​nhanes_​03_​04/​nhanes_​analytic_​guideli%nes_​dec_​2005.​pdf. Accessed May 2018.
30.
go back to reference Xu H, Jiang M, Oetjens M, Bowton EA, Ramirez AH, Jeff JM, Basford MA, Pulley JM, Cowan JD, Wang X, et al. Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin. J Am Med Inform Assoc. 2011; 18(4):387–91.PubMedPubMedCentralCrossRef Xu H, Jiang M, Oetjens M, Bowton EA, Ramirez AH, Jeff JM, Basford MA, Pulley JM, Cowan JD, Wang X, et al. Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin. J Am Med Inform Assoc. 2011; 18(4):387–91.PubMedPubMedCentralCrossRef
31.
go back to reference Xu H, Doan S, Birdwell KA, Cowan JD, Vincz AJ, Haas DW, Basford MA, Denny JC. An automated approach to calculating the daily dose of tacrolimus in electronic health records. Summit Transl Bioinforma. 2010; 2010:71. Xu H, Doan S, Birdwell KA, Cowan JD, Vincz AJ, Haas DW, Basford MA, Denny JC. An automated approach to calculating the daily dose of tacrolimus in electronic health records. Summit Transl Bioinforma. 2010; 2010:71.
32.
go back to reference Sohn S, Clark C, Halgrim SR, Murphy SP, Jonnalagadda SR, Wagholikar KB, Wu ST, Chute CG, Liu H. Analysis of cross-institutional medication description patterns in clinical narratives. Biomed Inform Insights. 2013; 6:11634.CrossRef Sohn S, Clark C, Halgrim SR, Murphy SP, Jonnalagadda SR, Wagholikar KB, Wu ST, Chute CG, Liu H. Analysis of cross-institutional medication description patterns in clinical narratives. Biomed Inform Insights. 2013; 6:11634.CrossRef
33.
go back to reference Jurafsky D, Martin JH. Speech and Language Processing, vol. 3.London: Pearson London; 2014. Jurafsky D, Martin JH. Speech and Language Processing, vol. 3.London: Pearson London; 2014.
34.
go back to reference Sarawagi S, et al. Information extraction. Found TrendsⓇDatabase. 2008; 1(3):261–377. Sarawagi S, et al. Information extraction. Found TrendsⓇDatabase. 2008; 1(3):261–377.
35.
go back to reference Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (redcap)—a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009; 42(2):377–81.PubMedCrossRef Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (redcap)—a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009; 42(2):377–81.PubMedCrossRef
36.
go back to reference Cuggia M, Garcelon N, Campillo-Gimenez B, Bernicot T, Laurent J-F, Garin E, Happe A, Duvauferrier R. Roogle: an information retrieval engine for clinical data warehouse. Stud health technol inform. 2011; 169:584–8. ISSN: 0926-9630.PubMed Cuggia M, Garcelon N, Campillo-Gimenez B, Bernicot T, Laurent J-F, Garin E, Happe A, Duvauferrier R. Roogle: an information retrieval engine for clinical data warehouse. Stud health technol inform. 2011; 169:584–8. ISSN: 0926-9630.PubMed
37.
go back to reference Lowe HJ, Ferris TA, Hernandez PM, Weber SC. Stride–an integrated standards-based translational research informatics platform. In: AMIA Annual Symposium Proceedings. San Francisco: American Medical Informatics Association: 2009. p. 391. Lowe HJ, Ferris TA, Hernandez PM, Weber SC. Stride–an integrated standards-based translational research informatics platform. In: AMIA Annual Symposium Proceedings. San Francisco: American Medical Informatics Association: 2009. p. 391.
38.
go back to reference Garcelon N, Neuraz A, Benoit V, Salomon R, Burgun A. Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse. J Am Med Inform Assoc. 2016; 24(3):607–13.PubMedCentral Garcelon N, Neuraz A, Benoit V, Salomon R, Burgun A. Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse. J Am Med Inform Assoc. 2016; 24(3):607–13.PubMedCentral
Metadata
Title
Replicating medication trend studies using ad hoc information extraction in a clinical data warehouse
Authors
Georg Dietrich
Jonathan Krebs
Leon Liman
Georg Fette
Maximilian Ertl
Mathias Kaspar
Stefan Störk
Frank Puppe
Publication date
01-12-2019
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2019
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-018-0729-0

Other articles of this Issue 1/2019

BMC Medical Informatics and Decision Making 1/2019 Go to the issue