Skip to main content
Top
Published in: Digestive Diseases and Sciences 3/2020

01-03-2020 | Respiratory Microbiota | Review

Gut-host Crosstalk: Methodological and Computational Challenges

Author: Ivan Ivanov

Published in: Digestive Diseases and Sciences | Issue 3/2020

Login to get access

Abstract

Understanding how health-promoting microbiota are established and their beneficial interactions with the host is of critical biomedical importance. The current high throughput data acquisition technologies allow for integrating components of the gut ecosystem. The richness of data types and large number of measured variables involved underscores the critical importance of the appropriate choice of analytical and computational methods that can be used to model this complex ecosystem. This review outlines currently used approaches to perform analyses of data obtained as a result of interrogation of the gut-microbiota ecosystem and the challenges associated with these methodological and computational efforts. The problem of large dimensionality versus small numbers of samples is explained with discussions of clustering, dimensionality reduction, and statistical testing. Predictive modeling and data integration specific to the gut ecosystem are also discussed.
Literature
5.
go back to reference Hooper LV, Midtvedt T, Gordon JI. How host-microbial interactions shape the nutrient environment of the mammalian intestine. Annu Rev Nutr. 2002;22:283–307.PubMed Hooper LV, Midtvedt T, Gordon JI. How host-microbial interactions shape the nutrient environment of the mammalian intestine. Annu Rev Nutr. 2002;22:283–307.PubMed
7.
go back to reference Knight R. Follow your gut: the enormous impact of tiny microbes. New York: Simon & Schuster/TED; 2015. Knight R. Follow your gut: the enormous impact of tiny microbes. New York: Simon & Schuster/TED; 2015.
14.
go back to reference Ley RE, Bäckhed F, Turnbaugh P, Lozupone CA, Knight RD, et al. Obesity alters gut microbial ecology. Proc Natl Acad Sci USA. 2005;102:11070–11075.PubMedPubMedCentral Ley RE, Bäckhed F, Turnbaugh P, Lozupone CA, Knight RD, et al. Obesity alters gut microbial ecology. Proc Natl Acad Sci USA. 2005;102:11070–11075.PubMedPubMedCentral
15.
go back to reference Frank DN, St Amand AL, Feldman RA, et al. Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases. Proc Natl Acad Sci USA. 2007;104:13780–13785.PubMedPubMedCentral Frank DN, St Amand AL, Feldman RA, et al. Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases. Proc Natl Acad Sci USA. 2007;104:13780–13785.PubMedPubMedCentral
31.
go back to reference Davidson LA, Jiang YH, Lupton JR, Chapkin RS. Noninvasive detection of putative biomarkers for colon cancer using fecal messenger RNA. Cancer Epidemiol Biomark Prev. 1995;4:643–647. PMID: 8547831. Davidson LA, Jiang YH, Lupton JR, Chapkin RS. Noninvasive detection of putative biomarkers for colon cancer using fecal messenger RNA. Cancer Epidemiol Biomark Prev. 1995;4:643–647. PMID: 8547831.
32.
go back to reference Afgan E, Baker D, van den Beek M, et al. The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res. 2016;44:W3–W10.PubMedPubMedCentral Afgan E, Baker D, van den Beek M, et al. The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res. 2016;44:W3–W10.PubMedPubMedCentral
36.
go back to reference Chong J, Soufan O, Li C, et al. Using MetaboAnalyst 4.0 for comprehensive and integrative metabolomics data analysis. Nucleic Acids Res. 2018;46:W486–W494.PubMedPubMedCentral Chong J, Soufan O, Li C, et al. Using MetaboAnalyst 4.0 for comprehensive and integrative metabolomics data analysis. Nucleic Acids Res. 2018;46:W486–W494.PubMedPubMedCentral
38.
go back to reference Everitt B. Cluster analysis. Chichester: Wiley; 2011. Everitt B. Cluster analysis. Chichester: Wiley; 2011.
39.
go back to reference Lior R, Maimon O. Clustering methods. Data mining and knowledge discovery handbook. New York: Springer; 2005. Lior R, Maimon O. Clustering methods. Data mining and knowledge discovery handbook. New York: Springer; 2005.
40.
go back to reference Claeskens G, Hjort NL. Model selection and model averaging. Cambridge: Cambridge University Press; 2008. Claeskens G, Hjort NL. Model selection and model averaging. Cambridge: Cambridge University Press; 2008.
42.
go back to reference James CB. Pattern recognition with fuzzy objective function algorithms. New York: Springer; 1981. James CB. Pattern recognition with fuzzy objective function algorithms. New York: Springer; 1981.
44.
go back to reference Devroye L, Györfi L, Lugosi G. A probabilistic theory of pattern recognition. New York: Springer; 1996. Devroye L, Györfi L, Lugosi G. A probabilistic theory of pattern recognition. New York: Springer; 1996.
45.
go back to reference Duda RO, Hart PE, Stork DG. Pattern classification. New York: Wiley; 2001. Duda RO, Hart PE, Stork DG. Pattern classification. New York: Wiley; 2001.
50.
go back to reference Boehmke B, Greenwell BM. Hands-on machine learning with R. London: Chapman & Hall; 2019. Boehmke B, Greenwell BM. Hands-on machine learning with R. London: Chapman & Hall; 2019.
51.
go back to reference Cox TF, Cox MA. Multidimensional scaling. London: Chapman and Hall; 2001. Cox TF, Cox MA. Multidimensional scaling. London: Chapman and Hall; 2001.
52.
go back to reference Jolliffe IT. Principal component analysis. New York: Springer; 2002. Jolliffe IT. Principal component analysis. New York: Springer; 2002.
53.
go back to reference Zomorodian A, Carlsson G. Computing persistent hodmology. Discrete Comput Geom. 2004;33:249–274. Zomorodian A, Carlsson G. Computing persistent hodmology. Discrete Comput Geom. 2004;33:249–274.
54.
go back to reference Braga-Neto U, Dougherty ER. Classification. In: Dougherty ER, Shmulevich I, Chen J, Wang ZJ, eds. Genomic Signal Processing and Statistics., EURASIP book series on signal processing and communicationLondon: Hindawi Publishing Corporation; 2005. Braga-Neto U, Dougherty ER. Classification. In: Dougherty ER, Shmulevich I, Chen J, Wang ZJ, eds. Genomic Signal Processing and Statistics., EURASIP book series on signal processing and communicationLondon: Hindawi Publishing Corporation; 2005.
55.
go back to reference Braga-Neto U, Dougherty ER. Exact performance measures and distributions of error estimators for discrete classifiers. Pattern Recognit. 2005;38:1799–1814. Braga-Neto U, Dougherty ER. Exact performance measures and distributions of error estimators for discrete classifiers. Pattern Recognit. 2005;38:1799–1814.
56.
go back to reference Fu WJ, Dougherty ER, Mallick B, Carroll RJ. How many samples are needed to build a classifier: a general sequential approach. Bioinformatics. 2005;21:63–70.PubMed Fu WJ, Dougherty ER, Mallick B, Carroll RJ. How many samples are needed to build a classifier: a general sequential approach. Bioinformatics. 2005;21:63–70.PubMed
57.
go back to reference Dougherty ER, Barrera J. Pattern recognition theory in nonlinear signal processing. Math Imaging Vis. 2002;16:181–197. Dougherty ER, Barrera J. Pattern recognition theory in nonlinear signal processing. Math Imaging Vis. 2002;16:181–197.
58.
go back to reference Braga-Neto UM, Dougherty ER. Error estimation for pattern recognition. New York: Wiley; 2015. Braga-Neto UM, Dougherty ER. Error estimation for pattern recognition. New York: Wiley; 2015.
59.
go back to reference Hanczar B, Hua J, Dougherty ER. Decorrelation of the true and estimated classifier errors in high-dimensional settings. EURASIP J Bioinf Syst Biol. 2007;12:2007. Hanczar B, Hua J, Dougherty ER. Decorrelation of the true and estimated classifier errors in high-dimensional settings. EURASIP J Bioinf Syst Biol. 2007;12:2007.
60.
go back to reference Zollanvari A, Braga-Neto UM, Dougherty ER. On the sampling distribution of resubstitution and leave-one-out error estimators for linear classifiers. Pattern Recognit.. 2009;42:2705–2723. Zollanvari A, Braga-Neto UM, Dougherty ER. On the sampling distribution of resubstitution and leave-one-out error estimators for linear classifiers. Pattern Recognit.. 2009;42:2705–2723.
62.
go back to reference Knight JM, Ivanov I, Dougherty ER. MCMC implementation of the optimal Bayesian classifier for non-Gaussian models: model-based RNA-Seq classification. BMC Bioinform. 2014;15:401. PMID: 25491122. Knight JM, Ivanov I, Dougherty ER. MCMC implementation of the optimal Bayesian classifier for non-Gaussian models: model-based RNA-Seq classification. BMC Bioinform. 2014;15:401. PMID: 25491122.
63.
go back to reference Dalton LA, Dougherty ER. Optimal classifiers with minimum expected error within a Bayesian framework—parts I & II. Pattern Recognit. 2013;46:1301–1314. Dalton LA, Dougherty ER. Optimal classifiers with minimum expected error within a Bayesian framework—parts I & II. Pattern Recognit. 2013;46:1301–1314.
64.
go back to reference Robert CP, Casella G. Monte Carlo statistical methods. New York: Springer; 2004. Robert CP, Casella G. Monte Carlo statistical methods. New York: Springer; 2004.
65.
go back to reference Bahadorinejad A, Ivanov I, Lampe J, et al. Bayesian classification of microbial communities based on 16S rRNA metagenomic data. BioRxiv, 340653; 2018. Bahadorinejad A, Ivanov I, Lampe J, et al. Bayesian classification of microbial communities based on 16S rRNA metagenomic data. BioRxiv, 340653; 2018.
66.
go back to reference Dunkler D, Sánchez-Cabo F, Heinze G. Statistical analysis principles for Omics data. Methods Mol Biol. 2011;719:113–131.PubMed Dunkler D, Sánchez-Cabo F, Heinze G. Statistical analysis principles for Omics data. Methods Mol Biol. 2011;719:113–131.PubMed
67.
go back to reference Lan Zeng IS, Thomas LT. Review of statistical learning methods in integrated omics studies (an integrated information science). Bioinform Biol Insights. 2018;12:1177932218759292. Lan Zeng IS, Thomas LT. Review of statistical learning methods in integrated omics studies (an integrated information science). Bioinform Biol Insights. 2018;12:1177932218759292.
79.
go back to reference Mortveit HS, Reidys CM. An introduction to sequential dynamical systems. Universitext: Springer; 2007. Mortveit HS, Reidys CM. An introduction to sequential dynamical systems. Universitext: Springer; 2007.
80.
go back to reference Zhao C, Ivanov I, Bittner ML, Dougherty ER. Pathway regulatory analysis in the context of Bayesian networks using the coefficient of determination. J Biol Syst. 2011;19:651–682. Zhao C, Ivanov I, Bittner ML, Dougherty ER. Pathway regulatory analysis in the context of Bayesian networks using the coefficient of determination. J Biol Syst. 2011;19:651–682.
81.
go back to reference Jensen FV, Nielsen TD. Bayesian networks and decision graphs., Information science and statistics seriesNew York: Springer; 2007. Jensen FV, Nielsen TD. Bayesian networks and decision graphs., Information science and statistics seriesNew York: Springer; 2007.
82.
go back to reference Ivanov I. Boolean models of genomic regulatory networks: reduction mappings, inference, and external control. Curr Genom. 2009;10:375–387. Ivanov I. Boolean models of genomic regulatory networks: reduction mappings, inference, and external control. Curr Genom. 2009;10:375–387.
83.
go back to reference Shmulevich I, Dougherty ER. Genomic signal processing. Princeton: Princeton University Press; 2007. Shmulevich I, Dougherty ER. Genomic signal processing. Princeton: Princeton University Press; 2007.
84.
go back to reference Shmulevich I, Dougherty ER, Kim S, Zhang W. Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks. Bioinformatics. 2002;18:261–274.PubMed Shmulevich I, Dougherty ER, Kim S, Zhang W. Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks. Bioinformatics. 2002;18:261–274.PubMed
85.
go back to reference Dougherty ER, Kim S, Chen Y. Coeffcient of determination in nonlinear signal processing. Signal Process. 2000;80:2219–2235. Dougherty ER, Kim S, Chen Y. Coeffcient of determination in nonlinear signal processing. Signal Process. 2000;80:2219–2235.
86.
go back to reference Kim E, Ivanov I, Dougherty ER. Quantifying the notions of canalizing and master genes in a gene regulatory network—a Boolean network modeling perspective. Bioinformatics. 2018;35:643–649.PubMedCentral Kim E, Ivanov I, Dougherty ER. Quantifying the notions of canalizing and master genes in a gene regulatory network—a Boolean network modeling perspective. Bioinformatics. 2018;35:643–649.PubMedCentral
87.
go back to reference Dougherty ER, Brun M, Trent JM, Bittner ML. Conditioning-based modeling of contextual genomic regulation. IEEE/ACM Trans Comput Biol Bioinform. 2009;6:310–320.PubMed Dougherty ER, Brun M, Trent JM, Bittner ML. Conditioning-based modeling of contextual genomic regulation. IEEE/ACM Trans Comput Biol Bioinform. 2009;6:310–320.PubMed
88.
go back to reference Zhou X, Wang X, Pal R, et al. A bayesian connectivity-based approach to constructing probabilistic gene regulatory networks. Bioinformatics. 2004;20:2918–2927.PubMed Zhou X, Wang X, Pal R, et al. A bayesian connectivity-based approach to constructing probabilistic gene regulatory networks. Bioinformatics. 2004;20:2918–2927.PubMed
89.
go back to reference Faryabi B, Vahedi G, Datta A, Chamberland JF, Dougherty ER. Recent advances in intervention in markovian regulatory networks. Curr Genom. 2009;10:463–477. Faryabi B, Vahedi G, Datta A, Chamberland JF, Dougherty ER. Recent advances in intervention in markovian regulatory networks. Curr Genom. 2009;10:463–477.
90.
go back to reference Qian X, Ivanov I, Ghaffari N, Dougherty ER. Intervention in gene regulatory networks via greedy control policies based on long-run behavior. BMC Syst Biol. 2009;3:1. Qian X, Ivanov I, Ghaffari N, Dougherty ER. Intervention in gene regulatory networks via greedy control policies based on long-run behavior. BMC Syst Biol. 2009;3:1.
91.
go back to reference Zhao C, Ivanov I, Shah M, et al. Conditioning-based model for the regulatory activities of microRNAs in specific dietary contexts. IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS); 2010. Zhao C, Ivanov I, Shah M, et al. Conditioning-based model for the regulatory activities of microRNAs in specific dietary contexts. IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS); 2010.
Metadata
Title
Gut-host Crosstalk: Methodological and Computational Challenges
Author
Ivan Ivanov
Publication date
01-03-2020
Publisher
Springer US
Published in
Digestive Diseases and Sciences / Issue 3/2020
Print ISSN: 0163-2116
Electronic ISSN: 1573-2568
DOI
https://doi.org/10.1007/s10620-020-06105-9

Other articles of this Issue 3/2020

Digestive Diseases and Sciences 3/2020 Go to the issue
Live Webinar | 27-06-2024 | 18:00 (CEST)

Keynote webinar | Spotlight on medication adherence

Live: Thursday 27th June 2024, 18:00-19:30 (CEST)

WHO estimates that half of all patients worldwide are non-adherent to their prescribed medication. The consequences of poor adherence can be catastrophic, on both the individual and population level.

Join our expert panel to discover why you need to understand the drivers of non-adherence in your patients, and how you can optimize medication adherence in your clinics to drastically improve patient outcomes.

Prof. Kevin Dolgin
Prof. Florian Limbourg
Prof. Anoop Chauhan
Developed by: Springer Medicine
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discusses last year's major advances in heart failure and cardiomyopathies.