Skip to main content
Top
Published in: Journal of Translational Medicine 1/2020

Open Access 01-12-2020 | Diseases of the neuromuscular synapses and muscles | Research

A machine learning-based clinical tool for diagnosing myopathy using multi-cohort microarray expression profiles

Authors: Andrew Tran, Chris J. Walsh, Jane Batt, Claudia C. dos Santos, Pingzhao Hu

Published in: Journal of Translational Medicine | Issue 1/2020

Login to get access

Abstract

Background

Myopathies are a heterogenous collection of disorders characterized by dysfunction of skeletal muscle. In practice, myopathies are frequently encountered by physicians and precise diagnosis remains a challenge in primary care. Molecular expression profiles show promise for disease diagnosis in various pathologies. We propose a novel machine learning-based clinical tool for predicting muscle disease subtypes using multi-cohort microarray expression data.

Materials and methods

Muscle tissue samples originating from 1260 patients with muscle weakness. Data was curated from 42 independent cohorts with expression profiles in public microarray gene expression repositories, which represent a broad range of patient ages and peripheral muscles. Cohorts were categorized into five muscle disease subtypes: immobility, inflammatory myopathies, intensive care unit acquired weakness (ICUAW), congenital, and chronic systemic disease. The data contains expression data on 34,099 genes. Data augmentation techniques were used to address class imbalances in the muscle disease subtypes. Support vector machine (SVM) models were trained on two-thirds of the 1260 samples based on the top selected gene signature using analysis of variance (ANOVA). The model was validated in the remaining samples using area under the receiver operator curve (AUC). Gene enrichment analysis was used to identify enriched biological functions in the gene signature.

Results

The AUC ranges from 0.611 to 0.649 in the observed imbalanced data. Overall, using the augmented data, chronic systemic disease was the best predicted class with AUC 0.872 (95% confidence interval (CI): 0.824–0.920). The least discriminated classes were ICUAW with AUC 0.777 (95% CI: 0.668–0.887) and immobility with AUC 0.789 (95% CI: 0.716–0.861). Disease-specific gene set enrichment results showed that the gene signature was enriched in biological processes including neural precursor cell proliferation for ICUAW and aerobic respiration for congenital (false discovery rate q-value < 0.001).

Conclusion

Our results present a well-performing molecular classification tool with the selected gene markers for muscle disease classification. In practice, this tool addresses an important gap in the literature on myopathies and presents a potentially useful clinical tool for muscle disease subtype diagnosis.
Appendix
Available only for authorised users
Literature
1.
go back to reference Domingo-horne RM, Salajegheh MK. An approach to myopathy for the primary care clinician. Am J Med. 2017;131:237–43.CrossRef Domingo-horne RM, Salajegheh MK. An approach to myopathy for the primary care clinician. Am J Med. 2017;131:237–43.CrossRef
2.
go back to reference Parry SM, Puthucheary ZA. The impact of extended bed rest on the musculoskeletal system in the critical care environment. Extrem Physiol Med. 2015;4:1–8.CrossRef Parry SM, Puthucheary ZA. The impact of extended bed rest on the musculoskeletal system in the critical care environment. Extrem Physiol Med. 2015;4:1–8.CrossRef
3.
go back to reference Schmidt J. Current classification and management of inflammatory myopathies. J Neuromuscul Dis. 2018;5:109–29.CrossRef Schmidt J. Current classification and management of inflammatory myopathies. J Neuromuscul Dis. 2018;5:109–29.CrossRef
4.
go back to reference Cassandrini D, et al. Congenital myopathies: clinical phenotypes and new diagnostic tools. Ital J Pediatr. 2017;43:1–16.CrossRef Cassandrini D, et al. Congenital myopathies: clinical phenotypes and new diagnostic tools. Ital J Pediatr. 2017;43:1–16.CrossRef
6.
go back to reference Pérez-Baos S, et al. Mediators and patterns of muscle loss in chronic systemic inflammation. Front Physiol. 2018;9:1–14.CrossRef Pérez-Baos S, et al. Mediators and patterns of muscle loss in chronic systemic inflammation. Front Physiol. 2018;9:1–14.CrossRef
7.
go back to reference Malone ER, Oliva M, Sabatini PJB, Stockley TL, Siu LL. Molecular profiling for precision cancer therapies. Genome Med. 2020;12:8.CrossRef Malone ER, Oliva M, Sabatini PJB, Stockley TL, Siu LL. Molecular profiling for precision cancer therapies. Genome Med. 2020;12:8.CrossRef
8.
go back to reference Cho SB, Won HH. Machine learning in DNA microarray analysis for cancer classification. Proc First Asia-Pacific Bioinforma Conf Bioinforma. 2003;19:189–98. Cho SB, Won HH. Machine learning in DNA microarray analysis for cancer classification. Proc First Asia-Pacific Bioinforma Conf Bioinforma. 2003;19:189–98.
9.
go back to reference Walsh et al. Multi-cohort transcriptional meta-analysis identifies a signature of muscle disease severity. Manuscript in submission. Walsh et al. Multi-cohort transcriptional meta-analysis identifies a signature of muscle disease severity. Manuscript in submission.
10.
go back to reference Sweeney TE, Shidham A, Wong H, Khatri P. A comprehensive time-based meta-analysis of sirs and sepsis reveals a robust discriminatory gene set. Crit Care Med. 2014;7:1–33. Sweeney TE, Shidham A, Wong H, Khatri P. A comprehensive time-based meta-analysis of sirs and sepsis reveals a robust discriminatory gene set. Crit Care Med. 2014;7:1–33.
11.
go back to reference Lofgren S, et al. Integrated, multicohort analysis of systemic sclerosis identifies robust transcriptional signature of disease severity. JCI Insight. 2016;1:1–12.CrossRef Lofgren S, et al. Integrated, multicohort analysis of systemic sclerosis identifies robust transcriptional signature of disease severity. JCI Insight. 2016;1:1–12.CrossRef
12.
go back to reference Quackenbush J. Microarray data normalization and transformation. Nat Genet. 2002;32:496–501.CrossRef Quackenbush J. Microarray data normalization and transformation. Nat Genet. 2002;32:496–501.CrossRef
13.
go back to reference Jakobsen JC, Gluud C, Wetterslev J, Winkel P. When and how should multiple imputation be used for handling missing data in randomised clinical trials—a practical guide with flowcharts. BMC Med Res Methodol. 2017;17:1–10.CrossRef Jakobsen JC, Gluud C, Wetterslev J, Winkel P. When and how should multiple imputation be used for handling missing data in randomised clinical trials—a practical guide with flowcharts. BMC Med Res Methodol. 2017;17:1–10.CrossRef
16.
go back to reference Chawla N, Bowyer K, Hall L, Kegelmeyer W. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;2002:321–57.CrossRef Chawla N, Bowyer K, Hall L, Kegelmeyer W. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;2002:321–57.CrossRef
17.
go back to reference Miao J, Niu L. A survey on feature selection. Procedia Comput Sci. 2016;91:919–26.CrossRef Miao J, Niu L. A survey on feature selection. Procedia Comput Sci. 2016;91:919–26.CrossRef
18.
go back to reference Aly M. Survey on multiclass classification methods. Neural Netw. 2005;4:1–9. Aly M. Survey on multiclass classification methods. Neural Netw. 2005;4:1–9.
19.
go back to reference Müller AC, et al. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap. Nat Protoc. 2019;22:924–34. Müller AC, et al. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap. Nat Protoc. 2019;22:924–34.
21.
go back to reference Burlina P, Billings S, Joshi N, Albayda J. Automated diagnosis of myositis from muscle ultrasound: exploring the use of machine learning and deep learning methods. PLoS ONE. 2017;12:1–15.CrossRef Burlina P, Billings S, Joshi N, Albayda J. Automated diagnosis of myositis from muscle ultrasound: exploring the use of machine learning and deep learning methods. PLoS ONE. 2017;12:1–15.CrossRef
22.
go back to reference Richard A, John K. Intensive care unit-acquired weakness. Contin Educ Anaesthesia Crit Care Pain. 2012;12:62–6.CrossRef Richard A, John K. Intensive care unit-acquired weakness. Contin Educ Anaesthesia Crit Care Pain. 2012;12:62–6.CrossRef
23.
go back to reference Reynolds BA, Weiss S. Generation of neurons and astrocytes from isolated cells of the adult mammalian central nervous system. Science. 2016;255:1707–10.CrossRef Reynolds BA, Weiss S. Generation of neurons and astrocytes from isolated cells of the adult mammalian central nervous system. Science. 2016;255:1707–10.CrossRef
24.
go back to reference Dibajnia P, Morshead CM. Role of neural precursor cells in promoting repair following stroke. Acta Pharmacol Sin. 2013;34:78–90.CrossRef Dibajnia P, Morshead CM. Role of neural precursor cells in promoting repair following stroke. Acta Pharmacol Sin. 2013;34:78–90.CrossRef
25.
go back to reference Kim HJ, Sun W. Adult neurogenesis in the central and peripheral nervous systems. Int Neurourol J. 2012;16:57–61.CrossRef Kim HJ, Sun W. Adult neurogenesis in the central and peripheral nervous systems. Int Neurourol J. 2012;16:57–61.CrossRef
26.
go back to reference Berardo A, DiMauro S, Hirano M. Diagnostic algorithm for metabolic myopathies. Curr Neurol Neurosci Rep. 2010;10:118.CrossRef Berardo A, DiMauro S, Hirano M. Diagnostic algorithm for metabolic myopathies. Curr Neurol Neurosci Rep. 2010;10:118.CrossRef
27.
go back to reference Jungbluth H, Ochala J, Treves S, Gautel M. Current and future therapeutic approaches to the congenital myopathies. Semin Cell Dev Biol. 2017;64:191–200.CrossRef Jungbluth H, Ochala J, Treves S, Gautel M. Current and future therapeutic approaches to the congenital myopathies. Semin Cell Dev Biol. 2017;64:191–200.CrossRef
Metadata
Title
A machine learning-based clinical tool for diagnosing myopathy using multi-cohort microarray expression profiles
Authors
Andrew Tran
Chris J. Walsh
Jane Batt
Claudia C. dos Santos
Pingzhao Hu
Publication date
01-12-2020
Publisher
BioMed Central
Published in
Journal of Translational Medicine / Issue 1/2020
Electronic ISSN: 1479-5876
DOI
https://doi.org/10.1186/s12967-020-02630-3

Other articles of this Issue 1/2020

Journal of Translational Medicine 1/2020 Go to the issue