Skip to main content
Top
Published in: Malaria Journal 1/2015

Open Access 01-12-2015 | Research

Re-examining environmental correlates of Plasmodium falciparum malaria endemicity: a data-intensive variable selection approach

Authors: Daniel J Weiss, Bonnie Mappin, Ursula Dalrymple, Samir Bhatt, Ewan Cameron, Simon I Hay, Peter W Gething

Published in: Malaria Journal | Issue 1/2015

Login to get access

Abstract

Background

Malaria risk maps play an increasingly important role in disease control planning, implementation, and evaluation. The construction of these maps using modern geospatial techniques relies on covariate grids: continuous surfaces quantifying environmental factors that partially explain spatial heterogeneity in malaria endemicity. Although crucial, past variable selection processes for this purpose have often been subjective and ad-hoc, with many covariates used in modeling with little quantitative justification.

Methods

This research consists of an extensive covariate construction and selection process for predicting Plasmodium falciparum parasite rates (PfPR) in Africa for years 2000-2012. First, a literature review was conducted to establish a comprehensive list of covariates used for malaria mapping. Second, a library of covariate data was assembled to reflect this list, a process that included the construction of multiple, temporally dynamic datasets. Third, the resulting set of covariates was leveraged to create more than 50 million possible covariate terms via factorial combinations of different spatial and temporal aggregations, transformations, and pairwise interactions. Fourth, the expanded set of covariates was reduced via successive selection criteria to yield a robust covariate subset that was assessed using an out-of-sample validation approach.

Results

The final covariate subset included predominately dynamic covariates and it substantially out-performed earlier sets used by the Malaria Atlas Project (MAP) for creating global malaria risk maps, with the pseudo-R2 value for the out-of-sample validation increasing from 0.43 to 0.52. Dynamic covariates improved the model, with 17 of the 20 new covariates consisting of monthly or annual products, but the selected covariates were typically interaction terms that included both dynamic and synoptic datasets. Thus the interplay between normal (i.e., long-term averages) and immediate conditions may be key for characterizing environmental controls on parasite rate.

Conclusions

This analysis represents the first effort to systematically audit covariate utility for malaria mapping and then derive an objective, empirically based set of environmental covariates for modeling PfPR. The new covariates produce more reliable representations of malaria risk patterns and how they are changing through time, and these covariates will be used to characterize spatially and temporally varying environmental conditions affecting PfPR within a geostatistical-modeling framework, thus building upon previous research by MAP that produced global malaria maps for 2007 and 2010.
Appendix
Available only for authorised users
Literature
1.
go back to reference Lysenko AJ, Semashko IN, editors. [Geography of malaria. A medico-geographic profile of an ancient disease](in Russian). Moscow: Academy of Sciences, USSR; 1968. Lysenko AJ, Semashko IN, editors. [Geography of malaria. A medico-geographic profile of an ancient disease](in Russian). Moscow: Academy of Sciences, USSR; 1968.
2.
go back to reference Diggle PJ, Tawn JA, Moyeed RA. Model-based geostatistics. J R Stat Soc Ser C Appl Stat. 1998;47:299–326.CrossRef Diggle PJ, Tawn JA, Moyeed RA. Model-based geostatistics. J R Stat Soc Ser C Appl Stat. 1998;47:299–326.CrossRef
3.
4.
go back to reference Hay SI, Guerra CA, Gething PW, Patil AP, Tatem AJ, Noor AM CWK, et al. A world malaria map: Plasmodium falciparum endemicity in 2007. PLoS Med. 2009;6:e1000048.CrossRefPubMedCentralPubMed Hay SI, Guerra CA, Gething PW, Patil AP, Tatem AJ, Noor AM CWK, et al. A world malaria map: Plasmodium falciparum endemicity in 2007. PLoS Med. 2009;6:e1000048.CrossRefPubMedCentralPubMed
5.
go back to reference Gething PW, Patil A, Smith DL, Guerra C, Elyazar IRF GJ, Tatem AJ, et al. A new world malaria map: Plasmodium falciparum endemicity in 2010. Malar J. 2011;10:378.CrossRefPubMedCentralPubMed Gething PW, Patil A, Smith DL, Guerra C, Elyazar IRF GJ, Tatem AJ, et al. A new world malaria map: Plasmodium falciparum endemicity in 2010. Malar J. 2011;10:378.CrossRefPubMedCentralPubMed
6.
go back to reference Gething PW, Elyazar IRF, Moyes CM, Smith DL, Battle KE, Guerra CA, et al. A long neglected world malaria map: Plasmodium vivax endemicity in 2010. PLoS Negl Trop Dis. 2012;6:e1814.CrossRefPubMedCentralPubMed Gething PW, Elyazar IRF, Moyes CM, Smith DL, Battle KE, Guerra CA, et al. A long neglected world malaria map: Plasmodium vivax endemicity in 2010. PLoS Negl Trop Dis. 2012;6:e1814.CrossRefPubMedCentralPubMed
7.
go back to reference WHO. World Malaria Report 2013. Geneva: World Health Organization; 2013. WHO. World Malaria Report 2013. Geneva: World Health Organization; 2013.
8.
go back to reference Gething PW, Battle KE, Bhatt S, Smith DL, Eisele TP, Cibulskis RE, et al. Declining malaria in Africa: improving the measurement of progress. Malar J. 2013;13:39.CrossRef Gething PW, Battle KE, Bhatt S, Smith DL, Eisele TP, Cibulskis RE, et al. Declining malaria in Africa: improving the measurement of progress. Malar J. 2013;13:39.CrossRef
9.
go back to reference Gething PW, Patil AP, Hay SI. Quantifying aggregated uncertainty in Plasmodium falciparum malaria prevalence and populations at risk via efficient space-time geostatistical joint simulation. PLoS Comput Biol. 2010;6:e1000724.CrossRefPubMedCentralPubMed Gething PW, Patil AP, Hay SI. Quantifying aggregated uncertainty in Plasmodium falciparum malaria prevalence and populations at risk via efficient space-time geostatistical joint simulation. PLoS Comput Biol. 2010;6:e1000724.CrossRefPubMedCentralPubMed
10.
go back to reference Gelfand AE, Vounatsou P. Proper multivariate conditional autoregressive models for spatial data analysis. Biostatistics. 2003;4:11–5.CrossRefPubMed Gelfand AE, Vounatsou P. Proper multivariate conditional autoregressive models for spatial data analysis. Biostatistics. 2003;4:11–5.CrossRefPubMed
11.
go back to reference Hugh-Jones M. Applications of remote sensing to the identification of the habitats of parasites and disease vectors. Parasitol Today. 1989;5:244–51.CrossRefPubMed Hugh-Jones M. Applications of remote sensing to the identification of the habitats of parasites and disease vectors. Parasitol Today. 1989;5:244–51.CrossRefPubMed
12.
go back to reference Bennett A, Yukich J, Miller J, Vounatsou P, Hamainza B, Ingwe M, et al. A methodological framework for the improved use of routine health system data to evaluate national malaria control programs: evidence from Zambia. Popul Health Metr. 2014;12:30.CrossRefPubMedCentralPubMed Bennett A, Yukich J, Miller J, Vounatsou P, Hamainza B, Ingwe M, et al. A methodological framework for the improved use of routine health system data to evaluate national malaria control programs: evidence from Zambia. Popul Health Metr. 2014;12:30.CrossRefPubMedCentralPubMed
14.
go back to reference Nikolaev BP. [The influence of temperature on the development of the malaria parasite in the mosquito](in Russian). Trans Pasteur Inst Epidem Bakt (Leningr). 1935;2:108. Nikolaev BP. [The influence of temperature on the development of the malaria parasite in the mosquito](in Russian). Trans Pasteur Inst Epidem Bakt (Leningr). 1935;2:108.
15.
go back to reference Clements AN, Paterson GD. The analysis of mortality and survival rates in wild populations of mosquitoes. J Appl Ecol. 1981;18:373–99.CrossRef Clements AN, Paterson GD. The analysis of mortality and survival rates in wild populations of mosquitoes. J Appl Ecol. 1981;18:373–99.CrossRef
16.
go back to reference Wan Z, Zhang Y, Zhang Q, Li Z-l. Validation of the land-surface temperature products retrieved from Terra Moderate Resolution Imaging Spectroradiometer data. Remote Sens Environ. 2002;83:163–80.CrossRef Wan Z, Zhang Y, Zhang Q, Li Z-l. Validation of the land-surface temperature products retrieved from Terra Moderate Resolution Imaging Spectroradiometer data. Remote Sens Environ. 2002;83:163–80.CrossRef
17.
go back to reference Weiss DJ, Atkinson PM, Bhatt S, Mappin B, Hay SI, Gething PW. An effective approach for gap-filling continental scale remotely sensed time-series. ISPRS J Photogramm Remote Sens. 2014;98:106–18.CrossRefPubMedCentralPubMed Weiss DJ, Atkinson PM, Bhatt S, Mappin B, Hay SI, Gething PW. An effective approach for gap-filling continental scale remotely sensed time-series. ISPRS J Photogramm Remote Sens. 2014;98:106–18.CrossRefPubMedCentralPubMed
18.
go back to reference Weiss DJ, Bhatt S, Mappin B, Van Boeckel T, Smith DL, Hay SI, et al. Air temperature suitability for Plasmodium falciparum malaria transmission in Africa 2000-2012: a high-resolution spatiotemporal prediction. Malar J. 2014;13:171.CrossRefPubMedCentralPubMed Weiss DJ, Bhatt S, Mappin B, Van Boeckel T, Smith DL, Hay SI, et al. Air temperature suitability for Plasmodium falciparum malaria transmission in Africa 2000-2012: a high-resolution spatiotemporal prediction. Malar J. 2014;13:171.CrossRefPubMedCentralPubMed
19.
go back to reference Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A. Very high resolution interpolated climate surfaces for global land areas. Int J Climatol. 2005;25:1965–78.CrossRef Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A. Very high resolution interpolated climate surfaces for global land areas. Int J Climatol. 2005;25:1965–78.CrossRef
20.
go back to reference Hay SI, Tatem AJ, Graham AJ, Goetz SJ, Rogers DJ. Global environmental data for mapping infectious disease distribution. Adv Parasitol. 2006;62:37–77.CrossRefPubMedCentralPubMed Hay SI, Tatem AJ, Graham AJ, Goetz SJ, Rogers DJ. Global environmental data for mapping infectious disease distribution. Adv Parasitol. 2006;62:37–77.CrossRefPubMedCentralPubMed
21.
go back to reference Friedl MA, Sulla-Menashe D, Tan B, Schneider A, Ramankutty N, Sibley A, et al. MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sens Environ. 2010;114:168–82.CrossRef Friedl MA, Sulla-Menashe D, Tan B, Schneider A, Ramankutty N, Sibley A, et al. MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sens Environ. 2010;114:168–82.CrossRef
22.
go back to reference Gao B. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens Environ. 1996;58:257–66.CrossRef Gao B. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens Environ. 1996;58:257–66.CrossRef
23.
go back to reference Kauth RJ, Thomas GS. The tasseled cap — A graphic description of the spectral-temporal development of agricultural crops as seen by Landsat. In: Proceedings of the Symposium on Machine Processing of Remotely Sensed Data; Purdue University, West Lafayette, Indiana. 1976. p. 4B-41-44B-50. Kauth RJ, Thomas GS. The tasseled cap — A graphic description of the spectral-temporal development of agricultural crops as seen by Landsat. In: Proceedings of the Symposium on Machine Processing of Remotely Sensed Data; Purdue University, West Lafayette, Indiana. 1976. p. 4B-41-44B-50.
24.
go back to reference Lobser SE, Cohen WB. MODIS tasselled cap: land cover characteristics expressed through transformed MODIS data. Int J Remote Sens. 2007;28:5079–101.CrossRef Lobser SE, Cohen WB. MODIS tasselled cap: land cover characteristics expressed through transformed MODIS data. Int J Remote Sens. 2007;28:5079–101.CrossRef
25.
go back to reference Weiss DJ, Crabtree RL. Percent surface water estimation from MODIS BRDF 16-day image composites. Remote Sens Environ. 2011;115:2035–46.CrossRef Weiss DJ, Crabtree RL. Percent surface water estimation from MODIS BRDF 16-day image composites. Remote Sens Environ. 2011;115:2035–46.CrossRef
26.
go back to reference Trabucco A, Zomer RJ. “Global aridity index (global-aridity) and global potential evapo-transpiration (global-PET) geospatial database”. CGIAR Consortium for Spatial Information. Published online, available from the CGIAR-CSI GeoPortal at: http://www.cgiar-csi.org/. Global Aridity Index (Global-Aridity) and Global Potential Evapo-Transpiration (Global-PET) Geospatial Database. In. CGIAR Consortium for Spatial Information (2009). Trabucco A, Zomer RJ. “Global aridity index (global-aridity) and global potential evapo-transpiration (global-PET) geospatial database”. CGIAR Consortium for Spatial Information. Published online, available from the CGIAR-CSI GeoPortal at: http://​www.​cgiar-csi.​org/​. Global Aridity Index (Global-Aridity) and Global Potential Evapo-Transpiration (Global-PET) Geospatial Database. In. CGIAR Consortium for Spatial Information (2009).
28.
go back to reference Huete A, Justice C, Van Leeuwen W. MODIS vegetation index (MOD13). Algorithm theoretical basis document. NASA Goddard Space Flight Center. 1999. Huete A, Justice C, Van Leeuwen W. MODIS vegetation index (MOD13). Algorithm theoretical basis document. NASA Goddard Space Flight Center. 1999.
29.
go back to reference Farr TG, Rosen PA, Caro E, Crippen R, Duren R, Hensley S, et al. The Shuttle Radar Topography Mission. Rev Geophys. 2007;45:RG2004.CrossRef Farr TG, Rosen PA, Caro E, Crippen R, Duren R, Hensley S, et al. The Shuttle Radar Topography Mission. Rev Geophys. 2007;45:RG2004.CrossRef
30.
go back to reference Beven KJ, Kirkby MJ. A physically based, variable contributing area model of basin hydrology. Hydrol Sci Bull. 1979;24:43–69.CrossRef Beven KJ, Kirkby MJ. A physically based, variable contributing area model of basin hydrology. Hydrol Sci Bull. 1979;24:43–69.CrossRef
31.
go back to reference Tatem AJ, Guerra CA, Kabaria CW, Noor AM, Hay SI. Human population, urban settlement patterns and their impact on Plasmodium falciparum malaria endemicity. Malar J. 2008;7:218.CrossRefPubMedCentralPubMed Tatem AJ, Guerra CA, Kabaria CW, Noor AM, Hay SI. Human population, urban settlement patterns and their impact on Plasmodium falciparum malaria endemicity. Malar J. 2008;7:218.CrossRefPubMedCentralPubMed
32.
go back to reference Tatem AJ, Noor AM, Von Hagen C, Di Gregorio A, Hay SI. High resolution population maps for low income nations: combining land cover and census in East Africa. PLoS ONE. 2007;2:e1298.CrossRefPubMedCentralPubMed Tatem AJ, Noor AM, Von Hagen C, Di Gregorio A, Hay SI. High resolution population maps for low income nations: combining land cover and census in East Africa. PLoS ONE. 2007;2:e1298.CrossRefPubMedCentralPubMed
33.
go back to reference Center for International Earth Science Information Network - CIESIN - Columbia University and Centro Internacional de Agricultura Tropical - CIAT. Gridded Population of the World, Version 3 (GPWv3): Population Density Grid. Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC); 2005. Center for International Earth Science Information Network - CIESIN - Columbia University and Centro Internacional de Agricultura Tropical - CIAT. Gridded Population of the World, Version 3 (GPWv3): Population Density Grid. Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC); 2005.
34.
go back to reference Noor AM, Alegana VA, Gething PW, Tatem AJ, Snow RW. Using remotely sensed night-time light as a proxy for poverty in Africa. Popul Health Metr. 2008;6:5.CrossRefPubMedCentralPubMed Noor AM, Alegana VA, Gething PW, Tatem AJ, Snow RW. Using remotely sensed night-time light as a proxy for poverty in Africa. Popul Health Metr. 2008;6:5.CrossRefPubMedCentralPubMed
35.
go back to reference Nelson A. Travel time to major cities: A global map of Accessibility. Ispra Italy: Global Environment Monitoring Unit - Joint Research Centre of the European Commission; 2008. Nelson A. Travel time to major cities: A global map of Accessibility. Ispra Italy: Global Environment Monitoring Unit - Joint Research Centre of the European Commission; 2008.
36.
go back to reference Akaike H. A new look at the statistical model identification. IEEE Trans Automat Contr. 1974;19:716–23.CrossRef Akaike H. A new look at the statistical model identification. IEEE Trans Automat Contr. 1974;19:716–23.CrossRef
37.
go back to reference Olson DM, Dinerstein E. The Global 200: Priority ecoregions for global conservation. Ann Mo Bot Gard 2002:199-224 Olson DM, Dinerstein E. The Global 200: Priority ecoregions for global conservation. Ann Mo Bot Gard 2002:199-224
38.
go back to reference Dormann CF, Elith J, Bacher S, Buchmann C, Carl G, Carré G, et al. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography. 2013;36:27–46.CrossRef Dormann CF, Elith J, Bacher S, Buchmann C, Carl G, Carré G, et al. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography. 2013;36:27–46.CrossRef
39.
go back to reference George EI, McCulloch RE. Variable selection via Gibbs sampling. J Am Stat Assoc. 1993;88:881–9.CrossRef George EI, McCulloch RE. Variable selection via Gibbs sampling. J Am Stat Assoc. 1993;88:881–9.CrossRef
40.
go back to reference McFadden D. Conditional logit analysis of qualitative choice behavior. In Frontiers in econometrics. Edited by Zarembka P. New York 1974:105-142. McFadden D. Conditional logit analysis of qualitative choice behavior. In Frontiers in econometrics. Edited by Zarembka P. New York 1974:105-142.
Metadata
Title
Re-examining environmental correlates of Plasmodium falciparum malaria endemicity: a data-intensive variable selection approach
Authors
Daniel J Weiss
Bonnie Mappin
Ursula Dalrymple
Samir Bhatt
Ewan Cameron
Simon I Hay
Peter W Gething
Publication date
01-12-2015
Publisher
BioMed Central
Published in
Malaria Journal / Issue 1/2015
Electronic ISSN: 1475-2875
DOI
https://doi.org/10.1186/s12936-015-0574-x

Other articles of this Issue 1/2015

Malaria Journal 1/2015 Go to the issue
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discuss last year's major advances in heart failure and cardiomyopathies.