Skip to main content
Top
Published in: BMC Public Health 1/2017

Open Access 01-12-2017 | Research article

A systematic review of data mining and machine learning for air pollution epidemiology

Authors: Colin Bellinger, Mohomed Shazan Mohomed Jabbar, Osmar Zaïane, Alvaro Osornio-Vargas

Published in: BMC Public Health | Issue 1/2017

Login to get access

Abstract

Background

Data measuring airborne pollutants, public health and environmental factors are increasingly being stored and merged. These big datasets offer great potential, but also challenge traditional epidemiological methods. This has motivated the exploration of alternative methods to make predictions, find patterns and extract information. To this end, data mining and machine learning algorithms are increasingly being applied to air pollution epidemiology.

Methods

We conducted a systematic literature review on the application of data mining and machine learning methods in air pollution epidemiology. We carried out our search process in PubMed, the MEDLINE database and Google Scholar. Research articles applying data mining and machine learning methods to air pollution epidemiology were queried and reviewed.

Results

Our search queries resulted in 400 research articles. Our fine-grained analysis employed our inclusion/exclusion criteria to reduce the results to 47 articles, which we separate into three primary areas of interest: 1) source apportionment; 2) forecasting/prediction of air pollution/quality or exposure; and 3) generating hypotheses. Early applications had a preference for artificial neural networks. In more recent work, decision trees, support vector machines, k-means clustering and the APRIORI algorithm have been widely applied. Our survey shows that the majority of the research has been conducted in Europe, China and the USA, and that data mining is becoming an increasingly common tool in environmental health. For potential new directions, we have identified that deep learning and geo-spacial pattern mining are two burgeoning areas of data mining that have good potential for future applications in air pollution epidemiology.

Conclusions

We carried out a systematic review identifying the current trends, challenges and new directions to explore in the application of data mining methods to air pollution epidemiology. This work shows that data mining is increasingly being applied in air pollution epidemiology.
The potential to support air pollution epidemiology continues to grow with advancements in data mining related to temporal and geo-spacial mining, and deep learning. This is further supported by new sensors and storage mediums that enable larger, better quality data. This suggests that many more fruitful applications can be expected in the future.
Literature
1.
go back to reference Domingos P. A few useful things to know about machine learning. Commun ACM. 2012; 55(10):78–87.CrossRef Domingos P. A few useful things to know about machine learning. Commun ACM. 2012; 55(10):78–87.CrossRef
3.
go back to reference Dietterich TG, et al. Ensemble methods in machine learning. Multiple Classif Syst. 2000; 1857:1–15.CrossRef Dietterich TG, et al. Ensemble methods in machine learning. Multiple Classif Syst. 2000; 1857:1–15.CrossRef
4.
go back to reference Lary DJ, Faruque FS, Malakar N, Moore A, Roscoe B, Adams ZL, Eggelston Y. Estimating the global abundance of ground level presence of particulate matter (pm2. 5). Geospatial Health. 2014; 8(3):611–30.CrossRef Lary DJ, Faruque FS, Malakar N, Moore A, Roscoe B, Adams ZL, Eggelston Y. Estimating the global abundance of ground level presence of particulate matter (pm2. 5). Geospatial Health. 2014; 8(3):611–30.CrossRef
5.
go back to reference Neto UMB, Dougherty ER. Error estimation for pattern recognition. Hoboken: John Wiley & Sons; 2015.CrossRef Neto UMB, Dougherty ER. Error estimation for pattern recognition. Hoboken: John Wiley & Sons; 2015.CrossRef
6.
go back to reference Japkowicz N, Shah M. Evaluating Learning Algorithms: a Classification Perspective. Cambridge: Cambridge University Press; 2011.CrossRef Japkowicz N, Shah M. Evaluating Learning Algorithms: a Classification Perspective. Cambridge: Cambridge University Press; 2011.CrossRef
7.
go back to reference Bellinger C, Amid A, Japkowicz N, Victor H. Multi-label classification of anemia patients. In: Proceedings of the IEEE 14th International Conference on Machine Learning and Applications (ICMLA).2015. p. 825–30. Bellinger C, Amid A, Japkowicz N, Victor H. Multi-label classification of anemia patients. In: Proceedings of the IEEE 14th International Conference on Machine Learning and Applications (ICMLA).2015. p. 825–30.
8.
go back to reference Bishop C. Pattern Recognition and Machine Learning (Information Science and Statistics). 1613-9011. Cambridge: Springer; 2006. Bishop C. Pattern Recognition and Machine Learning (Information Science and Statistics). 1613-9011. Cambridge: Springer; 2006.
9.
go back to reference Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and Regression Trees. Chicago: CRC Press; 1984. Breiman L, Friedman J, Stone CJ, Olshen RA. Classification and Regression Trees. Chicago: CRC Press; 1984.
10.
go back to reference Quinlan JR. Induction of decision trees. Mach Learn. 1986; 1(1):81–106. Quinlan JR. Induction of decision trees. Mach Learn. 1986; 1(1):81–106.
11.
go back to reference Quinlan JR. C4. 5: Programs for Machine Learning. San Francisco: Morgan Kaufmann; 1993. Quinlan JR. C4. 5: Programs for Machine Learning. San Francisco: Morgan Kaufmann; 1993.
12.
go back to reference Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. Parallel Distributed Processing: Exploration of the Micro-structure of Cognition. 1986; 1:1–34. Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. Parallel Distributed Processing: Exploration of the Micro-structure of Cognition. 1986; 1:1–34.
13.
go back to reference Maas AL, Hannun AY, Ng AY. Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the International Conference on Machine Learning.2013. Maas AL, Hannun AY, Ng AY. Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the International Conference on Machine Learning.2013.
14.
go back to reference Vapnik V. The Nature of Statistical Learning Theory. New York: Springer; 1995.CrossRef Vapnik V. The Nature of Statistical Learning Theory. New York: Springer; 1995.CrossRef
15.
go back to reference Schölkopf B, Smola AJ. Learning with kernels. Cambridge: MIT Press; 2002. Schölkopf B, Smola AJ. Learning with kernels. Cambridge: MIT Press; 2002.
16.
go back to reference Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB).1994. p. 487–99. Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB).1994. p. 487–99.
17.
go back to reference Kitchenham B. Procedure for undertaking systematic reviews. Technical report, Computer Science Department, Keele University (TRISE-0401) and National ICT Australia Ltd (0400011T. 1), Joint Technical Report. 2004. Kitchenham B. Procedure for undertaking systematic reviews. Technical report, Computer Science Department, Keele University (TRISE-0401) and National ICT Australia Ltd (0400011T. 1), Joint Technical Report. 2004.
18.
go back to reference Moher D, Liberati A, Tetzlaff J, Altman D. Preferred reporting items for systematic reviews and meta-analyses: the prisma statement. J Clin Epidemiol. 2009; 62(10):1006–12.CrossRefPubMed Moher D, Liberati A, Tetzlaff J, Altman D. Preferred reporting items for systematic reviews and meta-analyses: the prisma statement. J Clin Epidemiol. 2009; 62(10):1006–12.CrossRefPubMed
19.
go back to reference Runge-Ranzinger S, Horstick O, Marx M, Kroeger A. What does dengue disease surveillance contribute to predicting and detecting outbreaks and describing trends?Trop Med Int Health. 2008; 13(8):1022–41.CrossRefPubMed Runge-Ranzinger S, Horstick O, Marx M, Kroeger A. What does dengue disease surveillance contribute to predicting and detecting outbreaks and describing trends?Trop Med Int Health. 2008; 13(8):1022–41.CrossRefPubMed
20.
go back to reference Gass K, Klein M, Chang HH, Flanders WD, Strickland MJ. Classification and regression trees for epidemiologic research: an air pollution example. Environ Health. 2014; 13(1):17.CrossRefPubMedPubMedCentral Gass K, Klein M, Chang HH, Flanders WD, Strickland MJ. Classification and regression trees for epidemiologic research: an air pollution example. Environ Health. 2014; 13(1):17.CrossRefPubMedPubMedCentral
21.
go back to reference Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The weka data mining software: An update. SIGKDD Explor Newsl. 2009; 11(1):10–18.CrossRef Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The weka data mining software: An update. SIGKDD Explor Newsl. 2009; 11(1):10–18.CrossRef
22.
go back to reference Chen HW, Tsai CT, She CW, Lin YC, Chiang CF. Exploring the background features of acidic and basic air pollutants around an industrial complex using data mining approach. Chemosphere. 2010; 81(10):1358–67.CrossRefPubMed Chen HW, Tsai CT, She CW, Lin YC, Chiang CF. Exploring the background features of acidic and basic air pollutants around an industrial complex using data mining approach. Chemosphere. 2010; 81(10):1358–67.CrossRefPubMed
23.
go back to reference Chen M, Wang P, Chen Q, Wu J, Chen X. A clustering algorithm for sample data based on environmental pollution characteristics. Atmos Environ. 2015; 107:194–203.CrossRef Chen M, Wang P, Chen Q, Wu J, Chen X. A clustering algorithm for sample data based on environmental pollution characteristics. Atmos Environ. 2015; 107:194–203.CrossRef
24.
go back to reference Singh KP, Gupta S, Rai P. Identifying pollution sources and predicting urban air quality using ensemble learning methods. Atmos Environ. 2013; 80:426–37.CrossRef Singh KP, Gupta S, Rai P. Identifying pollution sources and predicting urban air quality using ensemble learning methods. Atmos Environ. 2013; 80:426–37.CrossRef
25.
go back to reference Thurston GD, Spengler JD. A quantitative assessment of source contributions to inhalable particulate matter pollution in metropolitan boston. Atmos Environ (1967). 1985; 19(1):9–25.CrossRef Thurston GD, Spengler JD. A quantitative assessment of source contributions to inhalable particulate matter pollution in metropolitan boston. Atmos Environ (1967). 1985; 19(1):9–25.CrossRef
26.
go back to reference Jiang W, Wang Y, Tsou MH, Fu X. Using social media to detect outdoor air pollution and monitor air quality index (aqi): a geo-targeted spatiotemporal analysis framework with sina weibo (chinese twitter). PloS ONE. 2015; 10(10):0141185. Jiang W, Wang Y, Tsou MH, Fu X. Using social media to detect outdoor air pollution and monitor air quality index (aqi): a geo-targeted spatiotemporal analysis framework with sina weibo (chinese twitter). PloS ONE. 2015; 10(10):0141185.
27.
go back to reference Wang S, Paul MJ, Dredze M. Social media as a sensor of air quality and public response in china. J Med Internet Res. 2015; 17(3):22.CrossRef Wang S, Paul MJ, Dredze M. Social media as a sensor of air quality and public response in china. J Med Internet Res. 2015; 17(3):22.CrossRef
28.
go back to reference Freund Y, Schapire RE. Experiments with a new boosting algorithm. In: Proceedings of the Thirteenth International Conference on Machine Learning: 1996. p. 148–56. Freund Y, Schapire RE. Experiments with a new boosting algorithm. In: Proceedings of the Thirteenth International Conference on Machine Learning: 1996. p. 148–56.
29.
go back to reference Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001; 29(5):1189–232.CrossRef Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001; 29(5):1189–232.CrossRef
30.
go back to reference Xu Y, Yang W, Wang J. Air quality early-warning system for cities in china. Atmos Environ. 2017; 148:239–57.CrossRef Xu Y, Yang W, Wang J. Air quality early-warning system for cities in china. Atmos Environ. 2017; 148:239–57.CrossRef
31.
go back to reference Bobb JF, Valeri L, Henn BC, Christiani DC, Wright RO, Mazumdar M, Godleski JJ, Coull BA. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics. 2014; 16(3):058. Bobb JF, Valeri L, Henn BC, Christiani DC, Wright RO, Mazumdar M, Godleski JJ, Coull BA. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics. 2014; 16(3):058.
32.
go back to reference Payus C, Sulaiman N, Shahani M, Bakar AA. Association rules of data mining application for respiratory illness by air pollution database. Int J Basic Appl Sci. 2013; 13(3):11–16. Payus C, Sulaiman N, Shahani M, Bakar AA. Association rules of data mining application for respiratory illness by air pollution database. Int J Basic Appl Sci. 2013; 13(3):11–16.
33.
go back to reference Kukkonen J, Partanen L, Karppinen A, Ruuskanen J, Junninen H, Kolehmainen M, Niska H, Dorling S, Chatterton T, Foxall R, et al. Extensive evaluation of neural network models for the prediction of no 2 and pm 10 concentrations, compared with a deterministic modelling system and measurements in central helsinki. Atmos Environ. 2003; 37(32):4539–550.CrossRef Kukkonen J, Partanen L, Karppinen A, Ruuskanen J, Junninen H, Kolehmainen M, Niska H, Dorling S, Chatterton T, Foxall R, et al. Extensive evaluation of neural network models for the prediction of no 2 and pm 10 concentrations, compared with a deterministic modelling system and measurements in central helsinki. Atmos Environ. 2003; 37(32):4539–550.CrossRef
34.
go back to reference Reid CE, Jerrett M, Petersen ML, Pfister GG, Morefield PE, Tager IB, Raffuse SM, Balmes JR. Spatiotemporal prediction of fine particulate matter during the 2008 northern california wildfires using machine learning. Environ Sci Technol. 2015; 49(6):3887–96.CrossRefPubMed Reid CE, Jerrett M, Petersen ML, Pfister GG, Morefield PE, Tager IB, Raffuse SM, Balmes JR. Spatiotemporal prediction of fine particulate matter during the 2008 northern california wildfires using machine learning. Environ Sci Technol. 2015; 49(6):3887–96.CrossRefPubMed
35.
go back to reference Zhu W, Wang J, Zhang W, Sun D. Short-term effects of air pollution on lower respiratory diseases and forecasting by the group method of data handling. Atmos Environ. 2012; 51:29–38.CrossRef Zhu W, Wang J, Zhang W, Sun D. Short-term effects of air pollution on lower respiratory diseases and forecasting by the group method of data handling. Atmos Environ. 2012; 51:29–38.CrossRef
36.
go back to reference Toti G, Vilalta R, Lindner P, Lefer B, Macias C, Price D. Analysis of correlation between pediatric asthma exacerbation and exposure to pollutant mixtures with association rule mining. Artif Intell Med. 2016; 74:44–52.CrossRefPubMed Toti G, Vilalta R, Lindner P, Lefer B, Macias C, Price D. Analysis of correlation between pediatric asthma exacerbation and exposure to pollutant mixtures with association rule mining. Artif Intell Med. 2016; 74:44–52.CrossRefPubMed
37.
go back to reference Setiono R, Liu H. Understanding neural networks via rule extraction. In: Proceedings of the International Joint Conference of Artificial Intelligence (IJCAI).1995. p. 480–5. Setiono R, Liu H. Understanding neural networks via rule extraction. In: Proceedings of the International Joint Conference of Artificial Intelligence (IJCAI).1995. p. 480–5.
38.
go back to reference Pandey G, Zhang B, Jian L. Predicting submicron air pollution indicators: a machine learning approach. Environ Sci Process Impacts. 2013; 15(5):996–1005.CrossRefPubMed Pandey G, Zhang B, Jian L. Predicting submicron air pollution indicators: a machine learning approach. Environ Sci Process Impacts. 2013; 15(5):996–1005.CrossRefPubMed
39.
go back to reference Aquilina NJ, Delgado-Saborit JM, Gauci AP, Baker S, Meddings C, Harrison RM. Comparative modeling approaches for personal exposure to particle-associated pah. Environ Sci Technol. 2010; 44(24):9370–6.CrossRefPubMed Aquilina NJ, Delgado-Saborit JM, Gauci AP, Baker S, Meddings C, Harrison RM. Comparative modeling approaches for personal exposure to particle-associated pah. Environ Sci Technol. 2010; 44(24):9370–6.CrossRefPubMed
40.
go back to reference Hämäläinen W. Statapriori: an efficient algorithm for searching statistically significant association rules. Knowl Inform Syst. 2010; 23(3):373–99.CrossRef Hämäläinen W. Statapriori: an efficient algorithm for searching statistically significant association rules. Knowl Inform Syst. 2010; 23(3):373–99.CrossRef
41.
go back to reference Hämäläinen W. Kingfisher: an efficient algorithm for searching for both positive and negative dependency rules with statistical significance measures. Knowl Inform Syst. 2012; 32(2):383–414.CrossRef Hämäläinen W. Kingfisher: an efficient algorithm for searching for both positive and negative dependency rules with statistical significance measures. Knowl Inform Syst. 2012; 32(2):383–414.CrossRef
42.
go back to reference He H, Ma Y. Imbalanced Learning: Foundations, Algorithms, and Applications. Piscataway: John Wiley & Sons; 2013.CrossRef He H, Ma Y. Imbalanced Learning: Foundations, Algorithms, and Applications. Piscataway: John Wiley & Sons; 2013.CrossRef
43.
go back to reference Min KD, Kwon HJ, Kim K, Kim SY. Air pollution monitoring design for epidemiological application in a densely populated city. Int J Environ Res Public Health. 2017; 14(7):686.CrossRefPubMedCentral Min KD, Kwon HJ, Kim K, Kim SY. Air pollution monitoring design for epidemiological application in a densely populated city. Int J Environ Res Public Health. 2017; 14(7):686.CrossRefPubMedCentral
44.
go back to reference Keller JP, Drton M, Larson T, Kaufman JD, Sandler DP, Szpiro AA. Covariate-adaptive clustering of exposures for air pollution epidemiology cohorts. Ann Appl Stat. 2017; 11(1):93.CrossRefPubMedPubMedCentral Keller JP, Drton M, Larson T, Kaufman JD, Sandler DP, Szpiro AA. Covariate-adaptive clustering of exposures for air pollution epidemiology cohorts. Ann Appl Stat. 2017; 11(1):93.CrossRefPubMedPubMedCentral
45.
go back to reference Li D, Liu J, Zhang J, Gui H, Du P, Yu T, Wang J, Lu Y, Liu W, Cheng Y. Identification of long-range transport pathways and potential sources of pm2.5 and pm10 in beijing from 2014 to 2015. J Environ Sci. 2017; 56:214–29.CrossRef Li D, Liu J, Zhang J, Gui H, Du P, Yu T, Wang J, Lu Y, Liu W, Cheng Y. Identification of long-range transport pathways and potential sources of pm2.5 and pm10 in beijing from 2014 to 2015. J Environ Sci. 2017; 56:214–29.CrossRef
46.
go back to reference Stingone JA, Pandey OP, Claudio L, Pandey G. Using machine learning to identify air pollution exposure profiles associated with early cognitive skills among u.s. children. Environ Pollut. 2017; 230:730–40.CrossRefPubMed Stingone JA, Pandey OP, Claudio L, Pandey G. Using machine learning to identify air pollution exposure profiles associated with early cognitive skills among u.s. children. Environ Pollut. 2017; 230:730–40.CrossRefPubMed
47.
go back to reference Liu BC, Binaykia A, Chang PC, Tiwari MK, Tsao CC. Urban air quality forecasting based on multi-dimensional collaborative support vector regression (svr): A case study of beijing-tianjin-shijiazhuang. PLOS ONE. 2017; 12(7):1–17. Liu BC, Binaykia A, Chang PC, Tiwari MK, Tsao CC. Urban air quality forecasting based on multi-dimensional collaborative support vector regression (svr): A case study of beijing-tianjin-shijiazhuang. PLOS ONE. 2017; 12(7):1–17.
48.
go back to reference Mirto M, Fiore S, Conte L, Bruno LV, Aloisio G. A spatial data analysis infrastructure for environmental health research. In: Proceedings of the International Conference on High Performance Computing and Simulation (HPCS): 2016. p. 435–42. Mirto M, Fiore S, Conte L, Bruno LV, Aloisio G. A spatial data analysis infrastructure for environmental health research. In: Proceedings of the International Conference on High Performance Computing and Simulation (HPCS): 2016. p. 435–42.
49.
go back to reference Lewis AC, Lee JD, Edwards PM, Shaw MD, Evans MJ, Moller SJ, Smith KR, Buckley JW, Ellis M, Gillot SR, et al. Evaluating the performance of low cost chemical sensors for air pollution research. Faraday Discuss. 2016; 189:85–103.CrossRefPubMed Lewis AC, Lee JD, Edwards PM, Shaw MD, Evans MJ, Moller SJ, Smith KR, Buckley JW, Ellis M, Gillot SR, et al. Evaluating the performance of low cost chemical sensors for air pollution research. Faraday Discuss. 2016; 189:85–103.CrossRefPubMed
50.
go back to reference Reid CE, Jerrett M, Tager IB, Petersen ML, Mann JK, Balmes JR. Differential respiratory health effects from the 2008 northern california wildfires: a spatiotemporal approach. Environ Res. 2016; 150:227–35.CrossRefPubMed Reid CE, Jerrett M, Tager IB, Petersen ML, Mann JK, Balmes JR. Differential respiratory health effects from the 2008 northern california wildfires: a spatiotemporal approach. Environ Res. 2016; 150:227–35.CrossRefPubMed
51.
go back to reference Fernández-Camacho R, Cabeza IB, Aroba J, Gómez-Bravo F, Rodríguez S, de la Rosa J. Assessment of ultra-fine particles and noise measurements using fuzzy logic and data mining techniques. Sci Total Environ. 2015; 512:103–13.CrossRefPubMed Fernández-Camacho R, Cabeza IB, Aroba J, Gómez-Bravo F, Rodríguez S, de la Rosa J. Assessment of ultra-fine particles and noise measurements using fuzzy logic and data mining techniques. Sci Total Environ. 2015; 512:103–13.CrossRefPubMed
52.
go back to reference Lary D, Lary T, Sattler B. Using machine learning to estimate global pm2. 5 for environmental health studies. Environ Health Insights. 2015; 9(Suppl 1):41.PubMedPubMedCentral Lary D, Lary T, Sattler B. Using machine learning to estimate global pm2. 5 for environmental health studies. Environ Health Insights. 2015; 9(Suppl 1):41.PubMedPubMedCentral
53.
go back to reference Qin S, Liu F, Wang C, Song Y, Qu J. Spatial-temporal analysis and projection of extreme particulate matter (pm 10 and pm 2.5) levels using association rules: A case study of the jing-jin-ji region, china. Atmos Environ. 2015; 120:339–50.CrossRef Qin S, Liu F, Wang C, Song Y, Qu J. Spatial-temporal analysis and projection of extreme particulate matter (pm 10 and pm 2.5) levels using association rules: A case study of the jing-jin-ji region, china. Atmos Environ. 2015; 120:339–50.CrossRef
55.
go back to reference Dias D, Tchepel O. Modelling of human exposure to air pollution in the urban environment: a gps-based approach. Environ Sci Pollut Res. 2014; 21(5):3558–71.CrossRef Dias D, Tchepel O. Modelling of human exposure to air pollution in the urban environment: a gps-based approach. Environ Sci Pollut Res. 2014; 21(5):3558–71.CrossRef
56.
go back to reference Philibert A, Loyce C, Makowski D. Prediction of n 2 o emission from local information with random forest. Environ Pollut. 2013; 177:156–63.CrossRefPubMed Philibert A, Loyce C, Makowski D. Prediction of n 2 o emission from local information with random forest. Environ Pollut. 2013; 177:156–63.CrossRefPubMed
57.
go back to reference Padula AM, Mortimer K, Hubbard A, Lurmann F, Jerrett M, Tager IB. Exposure to traffic-related air pollution during pregnancy and term low birth weight: estimation of causal associations in a semiparametric model. Am J Epidemiol. 2012; 176(9):148.CrossRef Padula AM, Mortimer K, Hubbard A, Lurmann F, Jerrett M, Tager IB. Exposure to traffic-related air pollution during pregnancy and term low birth weight: estimation of causal associations in a semiparametric model. Am J Epidemiol. 2012; 176(9):148.CrossRef
58.
59.
go back to reference Niska H, Hiltunen T, Karppinen A, Ruuskanen J, Kolehmainen M. Evolving the neural network model for forecasting air pollution time series. Eng Appl Artif Intell. 2004; 17(2):159–67.CrossRef Niska H, Hiltunen T, Karppinen A, Ruuskanen J, Kolehmainen M. Evolving the neural network model for forecasting air pollution time series. Eng Appl Artif Intell. 2004; 17(2):159–67.CrossRef
60.
go back to reference Kolehmainen M, Martikainen H, Ruuskanen J. Neural networks and periodic components used in air quality forecasting. Atmos Environ. 2001; 35(5):815–25.CrossRef Kolehmainen M, Martikainen H, Ruuskanen J. Neural networks and periodic components used in air quality forecasting. Atmos Environ. 2001; 35(5):815–25.CrossRef
61.
go back to reference Challoner A, Pilla F, Gill L. Prediction of indoor air exposure from outdoor air quality using an artificial neural network model for inner city commercial buildings. Int J Environ Res Public health. 2015; 12(12):15233–53.CrossRefPubMedPubMedCentral Challoner A, Pilla F, Gill L. Prediction of indoor air exposure from outdoor air quality using an artificial neural network model for inner city commercial buildings. Int J Environ Res Public health. 2015; 12(12):15233–53.CrossRefPubMedPubMedCentral
62.
go back to reference Pegoretti S, Verdi L. Machine learning for the analysis of indoor radon distribution, compared with ordinary kriging. Radiat Prot Dosim. 2009; 137(3-4):324–8.CrossRef Pegoretti S, Verdi L. Machine learning for the analysis of indoor radon distribution, compared with ordinary kriging. Radiat Prot Dosim. 2009; 137(3-4):324–8.CrossRef
63.
go back to reference Bell SM, Edwards SW. Identification and prioritization of relationships between environmental stressors and adverse human health impacts. Environ Health Perspec. 2015; 123(11):1193–9.CrossRef Bell SM, Edwards SW. Identification and prioritization of relationships between environmental stressors and adverse human health impacts. Environ Health Perspec. 2015; 123(11):1193–9.CrossRef
64.
go back to reference Bell SM, Edwards SW. Building associations between markers of environmental stressors and adverse human health impacts using frequent itemset mining. In: Proceedings of the SIAM International Conference on Data Mining: 2014. p. 551–9. Bell SM, Edwards SW. Building associations between markers of environmental stressors and adverse human health impacts using frequent itemset mining. In: Proceedings of the SIAM International Conference on Data Mining: 2014. p. 551–9.
65.
go back to reference Hu M, Li W, Li L, Houston D, Wu J. Refining time-activity classification of human subjects using the global positioning system. PloS ONE. 2016; 11(2):0148875. Hu M, Li W, Li L, Houston D, Wu J. Refining time-activity classification of human subjects using the global positioning system. PloS ONE. 2016; 11(2):0148875.
66.
go back to reference Beckerman BS, Jerrett M, Martin RV, van Donkelaar A, Ross Z, Burnett RT. Application of the deletion/substitution/addition algorithm to selecting land use regression models for interpolating air pollution measurements in california. Atmospheric Environ. 2013; 77:172–7.CrossRef Beckerman BS, Jerrett M, Martin RV, van Donkelaar A, Ross Z, Burnett RT. Application of the deletion/substitution/addition algorithm to selecting land use regression models for interpolating air pollution measurements in california. Atmospheric Environ. 2013; 77:172–7.CrossRef
67.
go back to reference Dominici F, Peng RD, Bell ML, Pham L, McDermott A, Zeger SL, Samet JM. Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. Jama. 2006; 295(10):1127–34.CrossRefPubMedPubMedCentral Dominici F, Peng RD, Bell ML, Pham L, McDermott A, Zeger SL, Samet JM. Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. Jama. 2006; 295(10):1127–34.CrossRefPubMedPubMedCentral
68.
go back to reference Corani G. Air quality prediction in milan: feed-forward neural networks, pruned neural networks and lazy learning. Ecol Model. 2005; 185(2):513–29.CrossRef Corani G. Air quality prediction in milan: feed-forward neural networks, pruned neural networks and lazy learning. Ecol Model. 2005; 185(2):513–29.CrossRef
69.
go back to reference Ghanem M, Guo Y, Hassard J, Osmond M, Richards M. Sensor grids for air pollution monitoring. In: Proc. 3rd UK e-Science All Hands Meeting.Nottingham: 2004. Ghanem M, Guo Y, Hassard J, Osmond M, Richards M. Sensor grids for air pollution monitoring. In: Proc. 3rd UK e-Science All Hands Meeting.Nottingham: 2004.
Metadata
Title
A systematic review of data mining and machine learning for air pollution epidemiology
Authors
Colin Bellinger
Mohomed Shazan Mohomed Jabbar
Osmar Zaïane
Alvaro Osornio-Vargas
Publication date
01-12-2017
Publisher
BioMed Central
Published in
BMC Public Health / Issue 1/2017
Electronic ISSN: 1471-2458
DOI
https://doi.org/10.1186/s12889-017-4914-3

Other articles of this Issue 1/2017

BMC Public Health 1/2017 Go to the issue