Skip to main content
Top
Published in: BMC Infectious Diseases 1/2022

Open Access 01-12-2022 | Tuberculosis | Research

Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection

Authors: Ying Luo, Ying Xue, Wei Liu, Huijuan Song, Yi Huang, Guoxing Tang, Feng Wang, Qi Wang, Yimin Cai, Ziyong Sun

Published in: BMC Infectious Diseases | Issue 1/2022

Login to get access

Abstract

Background

The discrimination between active tuberculosis (ATB) and latent tuberculosis infection (LTBI) remains challenging. The present study aims to investigate the value of diagnostic models established by machine learning based on multiple laboratory data for distinguishing Mycobacterium tuberculosis (Mtb) infection status.

Methods

T-SPOT, lymphocyte characteristic detection, and routine laboratory tests were performed on participants. Diagnostic models were built according to various algorithms.

Results

A total of 892 participants (468 ATB and 424 LTBI) and another 263 participants (125 ATB and 138 LTBI), were respectively enrolled at Tongji Hospital (discovery cohort) and Sino-French New City Hospital (validation cohort). Receiver operating characteristic (ROC) curve analysis showed that the value of individual indicator for differentiating ATB from LTBI was limited (area under the ROC curve (AUC) < 0.8). A total of 28 models were successfully established using machine learning. Among them, the AUCs of 25 models were more than 0.9 in test set. It was found that conditional random forests (cforest) model, based on the implementation of the random forest and bagging ensemble algorithms utilizing conditional inference trees as base learners, presented best discriminative power in segregating ATB from LTBI. Specially, cforest model presented an AUC of 0.978, with the sensitivity of 93.39% and the specificity of 91.18%. Mtb-specific response represented by early secreted antigenic target 6 (ESAT-6) and culture filtrate protein 10 (CFP-10) spot-forming cell (SFC) in T-SPOT assay, as well as global adaptive immunity assessed by CD4 cell IFN-γ secretion, CD8 cell IFN-γ secretion, and CD4 cell number, were found to contribute greatly to the cforest model. Superior performance obtained in the discovery cohort was further confirmed in the validation cohort. The sensitivity and specificity of cforest model in validation set were 92.80% and 89.86%, respectively.

Conclusions

Cforest model developed upon machine learning could serve as a valuable and prospective tool for identifying Mtb infection status. The present study provided a novel and viable idea for realizing the clinical diagnostic application of the combination of machine learning and laboratory findings.
Appendix
Available only for authorised users
Literature
2.
go back to reference Luo Y, Suliman S, Asgari S, Amariuta T, Baglaenko Y, Martinez-Bonet M, Ishigaki K, Gutierrez-Arcelus M, Calderon R, Lecca L, et al. Early progression to active tuberculosis is a highly heritable trait driven by 3q23 in Peruvians. Nat Commun. 2019;10(1):3765.CrossRef Luo Y, Suliman S, Asgari S, Amariuta T, Baglaenko Y, Martinez-Bonet M, Ishigaki K, Gutierrez-Arcelus M, Calderon R, Lecca L, et al. Early progression to active tuberculosis is a highly heritable trait driven by 3q23 in Peruvians. Nat Commun. 2019;10(1):3765.CrossRef
4.
go back to reference Gong W, Wu X. Differential diagnosis of latent tuberculosis infection and active tuberculosis: a key to a successful tuberculosis control strategy. Front Microbiol. 2021;12: 745592.CrossRef Gong W, Wu X. Differential diagnosis of latent tuberculosis infection and active tuberculosis: a key to a successful tuberculosis control strategy. Front Microbiol. 2021;12: 745592.CrossRef
6.
go back to reference Furin J, Cox H, Pai M. Tuberculosis. The Lancet. 2019;393(10181):1642–56.CrossRef Furin J, Cox H, Pai M. Tuberculosis. The Lancet. 2019;393(10181):1642–56.CrossRef
7.
go back to reference MacLean E, Broger T, Yerlikaya S, Fernandez-Carballo BL, Pai M, Denkinger CM. A systematic review of biomarkers to detect active tuberculosis. Nat Microbiol. 2019;4(5):748–58.CrossRef MacLean E, Broger T, Yerlikaya S, Fernandez-Carballo BL, Pai M, Denkinger CM. A systematic review of biomarkers to detect active tuberculosis. Nat Microbiol. 2019;4(5):748–58.CrossRef
9.
go back to reference Singhania A, Wilkinson RJ, Rodrigue M, Haldar P, O’Garra A. The value of transcriptomics in advancing knowledge of the immune response and diagnosis in tuberculosis. Nat Immunol. 2018;19(11):1159–68.CrossRef Singhania A, Wilkinson RJ, Rodrigue M, Haldar P, O’Garra A. The value of transcriptomics in advancing knowledge of the immune response and diagnosis in tuberculosis. Nat Immunol. 2018;19(11):1159–68.CrossRef
10.
go back to reference Turner CT, Gupta RK, Tsaliki E, Roe JK, Mondal P, Nyawo GR, Palmer Z, Miller RF, Reeve BW, Theron G, et al. Blood transcriptional biomarkers for active pulmonary tuberculosis in a high-burden setting: a prospective, observational, diagnostic accuracy study. Lancet Respir Med. 2020;8(4):407–19.CrossRef Turner CT, Gupta RK, Tsaliki E, Roe JK, Mondal P, Nyawo GR, Palmer Z, Miller RF, Reeve BW, Theron G, et al. Blood transcriptional biomarkers for active pulmonary tuberculosis in a high-burden setting: a prospective, observational, diagnostic accuracy study. Lancet Respir Med. 2020;8(4):407–19.CrossRef
11.
go back to reference Yang Q, Chen Q, Zhang M, Cai Y, Yang F, Zhang J, Deng G, Ye T, Deng Q, Li G, et al. Identification of eight-protein biosignature for diagnosis of tuberculosis. Thorax. 2020;75(7):576–83.CrossRef Yang Q, Chen Q, Zhang M, Cai Y, Yang F, Zhang J, Deng G, Ye T, Deng Q, Li G, et al. Identification of eight-protein biosignature for diagnosis of tuberculosis. Thorax. 2020;75(7):576–83.CrossRef
12.
go back to reference Togun T, Hoggart CJ, Agbla SC, Gomez MP, Egere U, Sillah AK, Saidy B, Mendy F, Pai M, Kampmann B. A three-marker protein biosignature distinguishes tuberculosis from other respiratory diseases in Gambian children. EBioMedicine. 2020;58: 102909.CrossRef Togun T, Hoggart CJ, Agbla SC, Gomez MP, Egere U, Sillah AK, Saidy B, Mendy F, Pai M, Kampmann B. A three-marker protein biosignature distinguishes tuberculosis from other respiratory diseases in Gambian children. EBioMedicine. 2020;58: 102909.CrossRef
13.
go back to reference Dai Y, Shan W, Yang Q, Guo J, Zhai R, Tang X, Tang L, Tan Y, Cai Y, Chen X. Biomarkers of iron metabolism facilitate clinical diagnosis in Mycobacterium tuberculosis infection. Thorax. 2019;74(12):1161–7.CrossRef Dai Y, Shan W, Yang Q, Guo J, Zhai R, Tang X, Tang L, Tan Y, Cai Y, Chen X. Biomarkers of iron metabolism facilitate clinical diagnosis in Mycobacterium tuberculosis infection. Thorax. 2019;74(12):1161–7.CrossRef
14.
go back to reference Albors-Vaquer A, Rizvi A, Matzapetakis M, Lamosa P, Coelho AV, Patel AB, Mande SC, Gaddam S, Pineda-Lucena A, Banerjee S, et al. Active and prospective latent tuberculosis are associated with different metabolomic profiles: clinical potential for the identification of rapid and non-invasive biomarkers. Emerg Microbes Infect. 2020;9(1):1131–9.CrossRef Albors-Vaquer A, Rizvi A, Matzapetakis M, Lamosa P, Coelho AV, Patel AB, Mande SC, Gaddam S, Pineda-Lucena A, Banerjee S, et al. Active and prospective latent tuberculosis are associated with different metabolomic profiles: clinical potential for the identification of rapid and non-invasive biomarkers. Emerg Microbes Infect. 2020;9(1):1131–9.CrossRef
15.
go back to reference Noursadeghi M, Gupta RK. New insights into the limitations of host transcriptional biomarkers of tuberculosis. Am J Respir Crit Care Med. 2021;204(12):1363–5.CrossRef Noursadeghi M, Gupta RK. New insights into the limitations of host transcriptional biomarkers of tuberculosis. Am J Respir Crit Care Med. 2021;204(12):1363–5.CrossRef
16.
go back to reference Luo Y, Xue Y, Lin Q, Tang G, Yuan X, Mao L, Song H, Wang F, Sun Z. A combination of iron metabolism indexes and tuberculosis-specific antigen/phytohemagglutinin ratio for distinguishing active tuberculosis from latent tuberculosis infection. Int J Infect Dis. 2020;97:190–6.CrossRef Luo Y, Xue Y, Lin Q, Tang G, Yuan X, Mao L, Song H, Wang F, Sun Z. A combination of iron metabolism indexes and tuberculosis-specific antigen/phytohemagglutinin ratio for distinguishing active tuberculosis from latent tuberculosis infection. Int J Infect Dis. 2020;97:190–6.CrossRef
17.
go back to reference Luo Y, Xue Y, Yuan X, Lin Q, Tang G, Mao L, Song H, Wang F, Sun Z. Combination of prealbumin and tuberculosis-specific antigen/phytohemagglutinin ratio for discriminating active tuberculosis from latent tuberculosis infection. Int J Clin Pract. 2021;75(4): e13831.CrossRef Luo Y, Xue Y, Yuan X, Lin Q, Tang G, Mao L, Song H, Wang F, Sun Z. Combination of prealbumin and tuberculosis-specific antigen/phytohemagglutinin ratio for discriminating active tuberculosis from latent tuberculosis infection. Int J Clin Pract. 2021;75(4): e13831.CrossRef
18.
go back to reference Luo Y, Tang G, Yuan X, Lin Q, Mao L, Song H, Xue Y, Wu S, Ouyang R, Hou H, et al. Combination of blood routine examination and T-SPOT.TB assay for distinguishing between active tuberculosis and latent tuberculosis infection. Front Cell Infect Microbiol. 2021;11:575650.CrossRef Luo Y, Tang G, Yuan X, Lin Q, Mao L, Song H, Xue Y, Wu S, Ouyang R, Hou H, et al. Combination of blood routine examination and T-SPOT.TB assay for distinguishing between active tuberculosis and latent tuberculosis infection. Front Cell Infect Microbiol. 2021;11:575650.CrossRef
19.
go back to reference Luo Y, Xue Y, Song H, Tang G, Liu W, Bai H, Yuan X, Tong S, Wang F, Cai Y, et al. Machine learning based on routine laboratory indicators promoting the discrimination between active tuberculosis and latent tuberculosis infection. J Infect. 2022;84:648–57.CrossRef Luo Y, Xue Y, Song H, Tang G, Liu W, Bai H, Yuan X, Tong S, Wang F, Cai Y, et al. Machine learning based on routine laboratory indicators promoting the discrimination between active tuberculosis and latent tuberculosis infection. J Infect. 2022;84:648–57.CrossRef
20.
go back to reference Luo Y, Xue Y, Cai Y, Lin Q, Tang G, Song H, Liu W, Mao L, Yuan X, Zhou Y, et al. Lymphocyte non-specific function detection facilitating the stratification of Mycobacterium tuberculosis infection. Front Immunol. 2021;12: 641378.CrossRef Luo Y, Xue Y, Cai Y, Lin Q, Tang G, Song H, Liu W, Mao L, Yuan X, Zhou Y, et al. Lymphocyte non-specific function detection facilitating the stratification of Mycobacterium tuberculosis infection. Front Immunol. 2021;12: 641378.CrossRef
21.
go back to reference Luo Y, Xue Y, Tang G, Cai Y, Yuan X, Lin Q, Song H, Liu W, Mao L, Zhou Y, et al. Lymphocyte-related immunological indicators for stratifying mycobacterium tuberculosis infection. Front Immunol. 2021;12: 658843.CrossRef Luo Y, Xue Y, Tang G, Cai Y, Yuan X, Lin Q, Song H, Liu W, Mao L, Zhou Y, et al. Lymphocyte-related immunological indicators for stratifying mycobacterium tuberculosis infection. Front Immunol. 2021;12: 658843.CrossRef
22.
go back to reference Luo Y, Xie Y, Zhang W, Lin Q, Tang G, Wu S, Huang M, Yin B, Huang J, Wei W, et al. Combination of lymphocyte number and function in evaluating host immunity. Aging. 2019;11(24):12685–707.CrossRef Luo Y, Xie Y, Zhang W, Lin Q, Tang G, Wu S, Huang M, Yin B, Huang J, Wei W, et al. Combination of lymphocyte number and function in evaluating host immunity. Aging. 2019;11(24):12685–707.CrossRef
23.
go back to reference DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45.CrossRef DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45.CrossRef
24.
go back to reference Luo Y, Tang G, Lin Q, Mao L, Xue Y, Yuan X, Ouyang R, Wu S, Yu J, Zhou Y, et al. Combination of mean spot sizes of ESAT-6 spot-forming cells and modified tuberculosis-specific antigen/phytohemagglutinin ratio of T-SPOT.TB assay in distinguishing between active tuberculosis and latent tuberculosis infection. J Infect. 2020;81(1):81–9.CrossRef Luo Y, Tang G, Lin Q, Mao L, Xue Y, Yuan X, Ouyang R, Wu S, Yu J, Zhou Y, et al. Combination of mean spot sizes of ESAT-6 spot-forming cells and modified tuberculosis-specific antigen/phytohemagglutinin ratio of T-SPOT.TB assay in distinguishing between active tuberculosis and latent tuberculosis infection. J Infect. 2020;81(1):81–9.CrossRef
25.
go back to reference Luo Y, Xue Y, Tang G, Lin Q, Song H, Liu W, Yin B, Huang J, Wei W, Mao L, et al. Combination of HLA-DR on Mycobacterium tuberculosis-specific cells and tuberculosis antigen/phytohemagglutinin ratio for discriminating active tuberculosis from latent tuberculosis infection. Front Immunol. 2021;12: 761209.CrossRef Luo Y, Xue Y, Tang G, Lin Q, Song H, Liu W, Yin B, Huang J, Wei W, Mao L, et al. Combination of HLA-DR on Mycobacterium tuberculosis-specific cells and tuberculosis antigen/phytohemagglutinin ratio for discriminating active tuberculosis from latent tuberculosis infection. Front Immunol. 2021;12: 761209.CrossRef
26.
go back to reference Xing Z, Ding W, Zhang S, Zhong L, Wang L, Wang J, Wang K, Xie Y, Zhao X, Li N, et al. Machine learning-based differentiation of nontuberculous mycobacteria lung disease and pulmonary tuberculosis using CT images. Biomed Res Int. 2020;2020:6287545.CrossRef Xing Z, Ding W, Zhang S, Zhong L, Wang L, Wang J, Wang K, Xie Y, Zhao X, Li N, et al. Machine learning-based differentiation of nontuberculous mycobacteria lung disease and pulmonary tuberculosis using CT images. Biomed Res Int. 2020;2020:6287545.CrossRef
27.
go back to reference Zhou Z, Zhou X, Cheng L, Wen L, An T, Gao H, Deng H, Yan Q, Zhang X, Li Y, et al. Machine learning algorithms utilizing blood parameters enable early detection of immunethrombotic dysregulation in COVID-19. Clin Transl Med. 2021;11(9): e523.CrossRef Zhou Z, Zhou X, Cheng L, Wen L, An T, Gao H, Deng H, Yan Q, Zhang X, Li Y, et al. Machine learning algorithms utilizing blood parameters enable early detection of immunethrombotic dysregulation in COVID-19. Clin Transl Med. 2021;11(9): e523.CrossRef
28.
go back to reference Liao CH, Lai CC, Tan CK, Chou CH, Hsu HL, Tasi TH, Huang YT, Hsueh PR. False-negative results by enzyme-linked immunospot assay for interferon-gamma among patients with culture-confirmed tuberculosis. J Infect. 2009;59(6):421–3.CrossRef Liao CH, Lai CC, Tan CK, Chou CH, Hsu HL, Tasi TH, Huang YT, Hsueh PR. False-negative results by enzyme-linked immunospot assay for interferon-gamma among patients with culture-confirmed tuberculosis. J Infect. 2009;59(6):421–3.CrossRef
29.
go back to reference Nguyen DT, Teeter LD, Graves J, Graviss EA. Characteristics Associated with Negative Interferon-gamma Release Assay Results in Culture-Confirmed Tuberculosis Patients, Texas, USA, 2013–2015. Emerg Infect Dis. 2018;24(3):534–40.CrossRef Nguyen DT, Teeter LD, Graves J, Graviss EA. Characteristics Associated with Negative Interferon-gamma Release Assay Results in Culture-Confirmed Tuberculosis Patients, Texas, USA, 2013–2015. Emerg Infect Dis. 2018;24(3):534–40.CrossRef
30.
go back to reference Pan L, Jia H, Liu F, Sun H, Gao M, Du F, Xing A, Du B, Sun Q, Wei R et al: Risk factors for false-negative T-SPOT.TB assay results in patients with pulmonary and extra-pulmonary TB. J Infect 2015, 70(4):367–380. Pan L, Jia H, Liu F, Sun H, Gao M, Du F, Xing A, Du B, Sun Q, Wei R et al: Risk factors for false-negative T-SPOT.TB assay results in patients with pulmonary and extra-pulmonary TB. J Infect 2015, 70(4):367–380.
31.
go back to reference Lau A, Lin C, Barrie J, Winter C, Armstrong G, Egedahl ML, Doroshenko A, Heffernan C, Asadi L, Fisher D, et al. The radiographic and mycobacteriologic correlates of subclinical pulmonary TB in Canada: a retrospective cohort study. Chest. 2022;162(2):309–20.CrossRef Lau A, Lin C, Barrie J, Winter C, Armstrong G, Egedahl ML, Doroshenko A, Heffernan C, Asadi L, Fisher D, et al. The radiographic and mycobacteriologic correlates of subclinical pulmonary TB in Canada: a retrospective cohort study. Chest. 2022;162(2):309–20.CrossRef
32.
go back to reference Drain PK, Bajema KL, Dowdy D, Dheda K, Naidoo K, Schumacher SG, Ma S, Meermeier E, Lewinsohn DM, Sherman DR. Incipient and subclinical tuberculosis: a clinical review of early stages and progression of infection. Clin Microbiol Rev. 2018;31(4):e00021.CrossRef Drain PK, Bajema KL, Dowdy D, Dheda K, Naidoo K, Schumacher SG, Ma S, Meermeier E, Lewinsohn DM, Sherman DR. Incipient and subclinical tuberculosis: a clinical review of early stages and progression of infection. Clin Microbiol Rev. 2018;31(4):e00021.CrossRef
33.
go back to reference Cai Y, Dai Y, Wang Y, Yang Q, Guo J, Wei C, Chen W, Huang H, Zhu J, Zhang C, et al. Single-cell transcriptomics of blood reveals a natural killer cell subset depletion in tuberculosis. EBioMedicine. 2020;53: 102686.CrossRef Cai Y, Dai Y, Wang Y, Yang Q, Guo J, Wei C, Chen W, Huang H, Zhu J, Zhang C, et al. Single-cell transcriptomics of blood reveals a natural killer cell subset depletion in tuberculosis. EBioMedicine. 2020;53: 102686.CrossRef
34.
go back to reference Garay-Baquero DJ, White CH, Walker NF, Tebruegge M, Schiff HF, Ugarte-Gil C, Morris-Jones S, Marshall BG, Manousopoulou A, Adamson J, et al. Comprehensive plasma proteomic profiling reveals biomarkers for active tuberculosis. JCI Insight. 2020;5(18): e137427.CrossRef Garay-Baquero DJ, White CH, Walker NF, Tebruegge M, Schiff HF, Ugarte-Gil C, Morris-Jones S, Marshall BG, Manousopoulou A, Adamson J, et al. Comprehensive plasma proteomic profiling reveals biomarkers for active tuberculosis. JCI Insight. 2020;5(18): e137427.CrossRef
35.
go back to reference Ho J, Bokil NJ, Nguyen PTB, Nguyen TA, Liu MY, Hare N, Fox GJ, Saunders BM, Marks GB, Britton WJ. A transcriptional blood signature distinguishes early tuberculosis disease from latent tuberculosis infection and uninfected individuals in a Vietnamese cohort. J Infect. 2020;81(1):72–80.CrossRef Ho J, Bokil NJ, Nguyen PTB, Nguyen TA, Liu MY, Hare N, Fox GJ, Saunders BM, Marks GB, Britton WJ. A transcriptional blood signature distinguishes early tuberculosis disease from latent tuberculosis infection and uninfected individuals in a Vietnamese cohort. J Infect. 2020;81(1):72–80.CrossRef
36.
go back to reference Hoang LT, Jain P, Pillay TD, Tolosa-Wright M, Niazi U, Takwoingi Y, Halliday A, Berrocal-Almanza LC, Deeks JJ, Beverley P, et al. Transcriptomic signatures for diagnosing tuberculosis in clinical practice: a prospective, multicentre cohort study. Lancet Infect Dis. 2021;21(3):366–75.CrossRef Hoang LT, Jain P, Pillay TD, Tolosa-Wright M, Niazi U, Takwoingi Y, Halliday A, Berrocal-Almanza LC, Deeks JJ, Beverley P, et al. Transcriptomic signatures for diagnosing tuberculosis in clinical practice: a prospective, multicentre cohort study. Lancet Infect Dis. 2021;21(3):366–75.CrossRef
37.
go back to reference Mpande CAM, Musvosvi M, Rozot V, Mosito B, Reid TD, Schreuder C, Lloyd T, Bilek N, Huang H, Obermoser G, et al. Antigen-specific T-cell activation distinguishes between recent and remote tuberculosis infection. Am J Respir Crit Care Med. 2021;203(12):1556–65.CrossRef Mpande CAM, Musvosvi M, Rozot V, Mosito B, Reid TD, Schreuder C, Lloyd T, Bilek N, Huang H, Obermoser G, et al. Antigen-specific T-cell activation distinguishes between recent and remote tuberculosis infection. Am J Respir Crit Care Med. 2021;203(12):1556–65.CrossRef
Metadata
Title
Development of diagnostic algorithm using machine learning for distinguishing between active tuberculosis and latent tuberculosis infection
Authors
Ying Luo
Ying Xue
Wei Liu
Huijuan Song
Yi Huang
Guoxing Tang
Feng Wang
Qi Wang
Yimin Cai
Ziyong Sun
Publication date
01-12-2022
Publisher
BioMed Central
Published in
BMC Infectious Diseases / Issue 1/2022
Electronic ISSN: 1471-2334
DOI
https://doi.org/10.1186/s12879-022-07954-7

Other articles of this Issue 1/2022

BMC Infectious Diseases 1/2022 Go to the issue
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discuss last year's major advances in heart failure and cardiomyopathies.