Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2013

Open Access 01-12-2013 | Research article

A semi-parametric approach to estimate risk functions associated with multi-dimensional exposure profiles: application to smoking and lung cancer

Authors: David I Hastie, Silvia Liverani, Lamiae Azizi, Sylvia Richardson, Isabelle Stücker

Published in: BMC Medical Research Methodology | Issue 1/2013

Login to get access

Abstract

Background

A common characteristic of environmental epidemiology is the multi-dimensional aspect of exposure patterns, frequently reduced to a cumulative exposure for simplicity of analysis. By adopting a flexible Bayesian clustering approach, we explore the risk function linking exposure history to disease. This approach is applied here to study the relationship between different smoking characteristics and lung cancer in the framework of a population based case control study.

Methods

Our study includes 4658 males (1995 cases, 2663 controls) with full smoking history (intensity, duration, time since cessation, pack-years) from the ICARE multi-centre study conducted from 2001-2007. We extend Bayesian clustering techniques to explore predictive risk surfaces for covariate profiles of interest.

Results

We were able to partition the population into 12 clusters with different smoking profiles and lung cancer risk. Our results confirm that when compared to intensity, duration is the predominant driver of risk. On the other hand, using pack-years of cigarette smoking as a single summary leads to a considerable loss of information.

Conclusions

Our method estimates a disease risk associated to a specific exposure profile by robustly accounting for the different dimensions of exposure and will be helpful in general to give further insight into the effect of exposures that are accumulated through different time patterns.
Appendix
Available only for authorised users
Literature
1.
go back to reference Thurston SW, Liu G, Miller DP, Christiani DC: Modeling lung cancer risk in case-control studies using a new dose metric of smoking. Cancer Epidemiol Biomarkers Prev. 2005, 14 (10): 296-302.CrossRef Thurston SW, Liu G, Miller DP, Christiani DC: Modeling lung cancer risk in case-control studies using a new dose metric of smoking. Cancer Epidemiol Biomarkers Prev. 2005, 14 (10): 296-302.CrossRef
2.
go back to reference Lubin JH, Alavanja MC, Caporaso N, et al: Cigarette smoking and cancer risk: modeling total exposure and intensity. Am J Epidemiol. 2007, 166 (4): 479-489. 10.1093/aje/kwm089.CrossRefPubMed Lubin JH, Alavanja MC, Caporaso N, et al: Cigarette smoking and cancer risk: modeling total exposure and intensity. Am J Epidemiol. 2007, 166 (4): 479-489. 10.1093/aje/kwm089.CrossRefPubMed
3.
go back to reference Lubin JH, Caporaso NE: Cigarette smoking and cancer risk: modeling total exposure and intensity. Am J Epidemiol. 2007, 166 (4): 479-489. 10.1093/aje/kwm089.CrossRefPubMed Lubin JH, Caporaso NE: Cigarette smoking and cancer risk: modeling total exposure and intensity. Am J Epidemiol. 2007, 166 (4): 479-489. 10.1093/aje/kwm089.CrossRefPubMed
4.
go back to reference Abrahamowicz M, Siemiatycki J, Rachet B, Leffondré K: Modeling smoking history: a comparison of different approaches. Am J Epidemiol. 2002, 156 (9): 813-823. 10.1093/aje/kwf122.CrossRefPubMed Abrahamowicz M, Siemiatycki J, Rachet B, Leffondré K: Modeling smoking history: a comparison of different approaches. Am J Epidemiol. 2002, 156 (9): 813-823. 10.1093/aje/kwf122.CrossRefPubMed
5.
go back to reference Lacourt A, Gramond C, Leffondré K, et al: Temporal patterns of occupational asbestos exposure and risk of pleural mesothelioma. Eur Respir J. 2012, 39 (6): 1304-1312. 10.1183/09031936.00005111.CrossRefPubMed Lacourt A, Gramond C, Leffondré K, et al: Temporal patterns of occupational asbestos exposure and risk of pleural mesothelioma. Eur Respir J. 2012, 39 (6): 1304-1312. 10.1183/09031936.00005111.CrossRefPubMed
6.
7.
go back to reference Peto R: Influence of dose and duration of smoking on lung cancer rates. IARC Sci Publ. 1986, 74: 23-33.PubMed Peto R: Influence of dose and duration of smoking on lung cancer rates. IARC Sci Publ. 1986, 74: 23-33.PubMed
8.
go back to reference Breiman L, Friedman JH, Olshen RA, Stone C: Classification and Regression Trees. 1984, Monterey, CA: Wadsworth & Brooks/Cole Advanced Books & Software Breiman L, Friedman JH, Olshen RA, Stone C: Classification and Regression Trees. 1984, Monterey, CA: Wadsworth & Brooks/Cole Advanced Books & Software
9.
go back to reference Ritchie MD, Hahn LW, Roodi N, et al: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet. 2001, 69 (1): 138-147. 10.1086/321276.CrossRefPubMedPubMedCentral Ritchie MD, Hahn LW, Roodi N, et al: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet. 2001, 69 (1): 138-147. 10.1086/321276.CrossRefPubMedPubMedCentral
10.
go back to reference Molitor J, Papathomas M, Jerrett M, Richardson S: Bayesian profile regression with an application to the National Survey of Children’s Health. Biostatistics. 2010, 11 (3): 484-498. 10.1093/biostatistics/kxq013.CrossRefPubMed Molitor J, Papathomas M, Jerrett M, Richardson S: Bayesian profile regression with an application to the National Survey of Children’s Health. Biostatistics. 2010, 11 (3): 484-498. 10.1093/biostatistics/kxq013.CrossRefPubMed
11.
go back to reference Papathomas M, Molitor J, Richardson S, Riboli E, Vineis P: Examining the joint effect of multiple risk factors using exposure risk profiles: lung cancer in nonsmokers. Environ Health Perspect. 2011, 119 (1): 84-91.CrossRefPubMed Papathomas M, Molitor J, Richardson S, Riboli E, Vineis P: Examining the joint effect of multiple risk factors using exposure risk profiles: lung cancer in nonsmokers. Environ Health Perspect. 2011, 119 (1): 84-91.CrossRefPubMed
12.
go back to reference Papathomas M, Molitor J, Hoggart C, Hastie D, Richardson S: Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene x gene patterns. Genet Epidemiol. 2012, 36: 663-674. 10.1002/gepi.21661.CrossRefPubMed Papathomas M, Molitor J, Hoggart C, Hastie D, Richardson S: Exploring data from genetic association studies using Bayesian variable selection and the Dirichlet process: application to searching for gene x gene patterns. Genet Epidemiol. 2012, 36: 663-674. 10.1002/gepi.21661.CrossRefPubMed
13.
go back to reference Luce D, Stücker I, Group IS: Investigation of occupational and environmental causes of respiratory cancers (ICARE): a multicenter, population-based case-control study in France. BMC Public Health. 2011, 11: 928-10.1186/1471-2458-11-928.CrossRefPubMedPubMedCentral Luce D, Stücker I, Group IS: Investigation of occupational and environmental causes of respiratory cancers (ICARE): a multicenter, population-based case-control study in France. BMC Public Health. 2011, 11: 928-10.1186/1471-2458-11-928.CrossRefPubMedPubMedCentral
14.
go back to reference Consonni D, De Matteis S, Lubin JH, et al: Lung cancer and occupation in a population-based case-control study. Am J Epidemiol. 2010, 171 (3): 323-333. 10.1093/aje/kwp391.CrossRefPubMedPubMedCentral Consonni D, De Matteis S, Lubin JH, et al: Lung cancer and occupation in a population-based case-control study. Am J Epidemiol. 2010, 171 (3): 323-333. 10.1093/aje/kwp391.CrossRefPubMedPubMedCentral
15.
go back to reference Green PJ, Richardson S: Modelling heterogeneity with and without the Dirichlet process. Scandinavian Journal of Statistics. 2001, 28: 355-375. 10.1111/1467-9469.00242.CrossRef Green PJ, Richardson S: Modelling heterogeneity with and without the Dirichlet process. Scandinavian Journal of Statistics. 2001, 28: 355-375. 10.1111/1467-9469.00242.CrossRef
16.
go back to reference Antoniak CE: Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann Stat. 1974, 2 (6): 1152-1174. 10.1214/aos/1176342871.CrossRef Antoniak CE: Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. Ann Stat. 1974, 2 (6): 1152-1174. 10.1214/aos/1176342871.CrossRef
17.
go back to reference Ishwaran H, James L: Gibbs sampling methods for stick-breaking priors. J Am Statist Assoc. 2011, 96: 161-173.CrossRef Ishwaran H, James L: Gibbs sampling methods for stick-breaking priors. J Am Statist Assoc. 2011, 96: 161-173.CrossRef
18.
go back to reference Ishwaran H, James L: Markov chain Monte Carlo in approximate Dirichlet and beta two parameter process hierarchical models. Biometrika. 2000, 83: 371-390.CrossRef Ishwaran H, James L: Markov chain Monte Carlo in approximate Dirichlet and beta two parameter process hierarchical models. Biometrika. 2000, 83: 371-390.CrossRef
19.
go back to reference Ishwaran H: Inference for the random effects in Bayesian generalized linear mixed models. ASA Proc Bayesian Stat Sci Sect. 2000, 11: 371-390. Ishwaran H: Inference for the random effects in Bayesian generalized linear mixed models. ASA Proc Bayesian Stat Sci Sect. 2000, 11: 371-390.
20.
go back to reference Gelfand AE, Kottas A: A computational approach for full nonparametric Bayesian inference under Dirichlet process mixture models. J Comput Graphical Stat. 2002, 11: 289-305. 10.1198/106186002760180518.CrossRef Gelfand AE, Kottas A: A computational approach for full nonparametric Bayesian inference under Dirichlet process mixture models. J Comput Graphical Stat. 2002, 11: 289-305. 10.1198/106186002760180518.CrossRef
21.
go back to reference Walker SG: Sampling the Dirichlet mixture model with slices. Commun Stat-Simul C. 2007, 36: 45-54. 10.1080/03610910601096262.CrossRef Walker SG: Sampling the Dirichlet mixture model with slices. Commun Stat-Simul C. 2007, 36: 45-54. 10.1080/03610910601096262.CrossRef
22.
go back to reference Papaspiliopoulos P, Roberts GO: Retrospective Markov chain Monte Carlo methods for Dirichlet process hierarchical models. Biometrika. 2008, 65 (1): 169-186.CrossRef Papaspiliopoulos P, Roberts GO: Retrospective Markov chain Monte Carlo methods for Dirichlet process hierarchical models. Biometrika. 2008, 65 (1): 169-186.CrossRef
23.
go back to reference Kalli M, Griffin JE, Walker SG: Slice sampling mixture models. Stat Comput. 2011, 21: 93-105. 10.1007/s11222-009-9150-y.CrossRef Kalli M, Griffin JE, Walker SG: Slice sampling mixture models. Stat Comput. 2011, 21: 93-105. 10.1007/s11222-009-9150-y.CrossRef
24.
go back to reference Kaufman L, Rousseeuw PJ: Finding groups in data: an introduction to cluster analysis. 2005, Hoboken NJ: Wiley-Interscience Kaufman L, Rousseeuw PJ: Finding groups in data: an introduction to cluster analysis. 2005, Hoboken NJ: Wiley-Interscience
25.
go back to reference Dahl D: Model-based clustering for expression data via a Dirichlet process mixture model. : Kim-Anh D, Muller P and Vannuci M (Eds.) Bayesian Inference for Gene Expression and Proteomics. 2006, Cambridge: Cambridge University Press, 201-218.CrossRef Dahl D: Model-based clustering for expression data via a Dirichlet process mixture model. : Kim-Anh D, Muller P and Vannuci M (Eds.) Bayesian Inference for Gene Expression and Proteomics. 2006, Cambridge: Cambridge University Press, 201-218.CrossRef
26.
go back to reference Ohlssen DI, Sharples LD, Spiegelhalter DJ: Flexible random-effects models using Bayesian semi-parametric models: applications to institutional comparisons. Stat Med. 2007, 26: 2088-2112. 10.1002/sim.2666.CrossRefPubMed Ohlssen DI, Sharples LD, Spiegelhalter DJ: Flexible random-effects models using Bayesian semi-parametric models: applications to institutional comparisons. Stat Med. 2007, 26: 2088-2112. 10.1002/sim.2666.CrossRefPubMed
27.
go back to reference Liu JS: Nonparametric hierarchical Bayes via sequential imputation. Ann Stat. 1996, 24: 911-930.CrossRef Liu JS: Nonparametric hierarchical Bayes via sequential imputation. Ann Stat. 1996, 24: 911-930.CrossRef
28.
go back to reference Zhang H, Bonney G: Use of Classification Trees for Association Studies. Genet Epidemiol. 2000, 19: 323-332. 10.1002/1098-2272(200012)19:4<323::AID-GEPI4>3.0.CO;2-5.CrossRefPubMed Zhang H, Bonney G: Use of Classification Trees for Association Studies. Genet Epidemiol. 2000, 19: 323-332. 10.1002/1098-2272(200012)19:4<323::AID-GEPI4>3.0.CO;2-5.CrossRefPubMed
29.
go back to reference Lemon SC, Roy J, Clark MA, Friedmann PD, Rakowski W: Classification and regression tree analysis in public health: methodological review and comparison with logistic regression. Ann Behav Med. 2003, 26 (3): 172-181. 10.1207/S15324796ABM2603_02.CrossRefPubMed Lemon SC, Roy J, Clark MA, Friedmann PD, Rakowski W: Classification and regression tree analysis in public health: methodological review and comparison with logistic regression. Ann Behav Med. 2003, 26 (3): 172-181. 10.1207/S15324796ABM2603_02.CrossRefPubMed
30.
go back to reference Wang W, Spitz MR, Yang H, Lu C, Stewart DJ, Wu X: Genetic variants in cell cycle control pathway confer susceptibility to lung cancer. Clin Cancer Res. 2007, 13 (19): 5974-5981. 10.1158/1078-0432.CCR-07-0113.CrossRefPubMed Wang W, Spitz MR, Yang H, Lu C, Stewart DJ, Wu X: Genetic variants in cell cycle control pathway confer susceptibility to lung cancer. Clin Cancer Res. 2007, 13 (19): 5974-5981. 10.1158/1078-0432.CCR-07-0113.CrossRefPubMed
31.
go back to reference Goel R, Misra A, Kondal D, Vikram NK, Wasirt JS, Pandey RM, et al: Identification of insulin resistance in Asian Indian adolescents: classification and regression tree (CART) and logistic regression based classification rules. Clin Endocrinol. 2009, 70: 717-724. 10.1111/j.1365-2265.2008.03409.x.CrossRef Goel R, Misra A, Kondal D, Vikram NK, Wasirt JS, Pandey RM, et al: Identification of insulin resistance in Asian Indian adolescents: classification and regression tree (CART) and logistic regression based classification rules. Clin Endocrinol. 2009, 70: 717-724. 10.1111/j.1365-2265.2008.03409.x.CrossRef
32.
go back to reference Pesch B, Kenzia B, Gustavsson P, et al: Cigarette smoking and lung cancer- relative risk estimates for the major histological types from a pooled analysis of case-control studies. Int J Cancer. 2012, 131: 1210-1219. 10.1002/ijc.27339.CrossRefPubMed Pesch B, Kenzia B, Gustavsson P, et al: Cigarette smoking and lung cancer- relative risk estimates for the major histological types from a pooled analysis of case-control studies. Int J Cancer. 2012, 131: 1210-1219. 10.1002/ijc.27339.CrossRefPubMed
33.
go back to reference Tarnaud C, Guida F, Papadopoulos A, et al: Body mass index and lung cancer risk: results from the, ICARE study, a large, population-based case-control study. Cancer Causes Control. 2012, 23 (7): 1113-1126. 10.1007/s10552-012-9980-3.CrossRefPubMed Tarnaud C, Guida F, Papadopoulos A, et al: Body mass index and lung cancer risk: results from the, ICARE study, a large, population-based case-control study. Cancer Causes Control. 2012, 23 (7): 1113-1126. 10.1007/s10552-012-9980-3.CrossRefPubMed
34.
go back to reference MacLehose RF, Dunson DB, Herring AH, Hoppin J: Bayesian methods for highly correlated exposure data. Epidemiology. 2007, 18: 199-207. 10.1097/01.ede.0000256320.30737.c0.CrossRefPubMed MacLehose RF, Dunson DB, Herring AH, Hoppin J: Bayesian methods for highly correlated exposure data. Epidemiology. 2007, 18: 199-207. 10.1097/01.ede.0000256320.30737.c0.CrossRefPubMed
Metadata
Title
A semi-parametric approach to estimate risk functions associated with multi-dimensional exposure profiles: application to smoking and lung cancer
Authors
David I Hastie
Silvia Liverani
Lamiae Azizi
Sylvia Richardson
Isabelle Stücker
Publication date
01-12-2013
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2013
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-13-129

Other articles of this Issue 1/2013

BMC Medical Research Methodology 1/2013 Go to the issue