Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2015

Open Access 01-12-2015 | Research article

Bayesian estimation of a cancer population by capture-recapture with individual capture heterogeneity and small sample

Authors: Laurent Bailly, Jean Pierre Daurès, Brigitte Dunais, Christian Pradier

Published in: BMC Medical Research Methodology | Issue 1/2015

Login to get access

Abstract

Background

Cancer incidence and prevalence estimates are necessary to inform health policy, to predict public health impact and to identify etiological factors. Registers have been used to estimate the number of cancer cases. To be reliable and useful, cancer registry data should be complete. Capture-recapture is a method for estimating the number of cases missed, originally developed in ecology to estimate the size of animal populations. Capture recapture methods in cancer epidemiology involve modelling the overlap between lists of individuals using log-linear models. These models rely on assumption of independence of sources and equal catchability between individuals, unlikely to be satisfied in cancer population as severe cases are more likely to be captured than simple cases.

Methods

To estimate cancer population and completeness of cancer registry, we applied Mth models that rely on parameters that influence capture as time of capture (t) and individual heterogeneity (h) and compared results to the ones obtained with classical log-linear models and sample coverage approach. For three sources collecting breast and colorectal cancer cases (Histopathological cancer registry, hospital Multidisciplinary Team Meetings, and cancer screening programmes), individual heterogeneity is suspected in cancer population due to age, gender, screening history or presence of metastases. Individual heterogeneity is hardly analysed as classical log-linear models usually pool it with between-“list” dependence. We applied Bayesian Model Averaging which can be applied with small sample without asymptotic assumption, contrary to the maximum likelihood estimate procedure.

Results

Cancer population estimates were based on the results of the Mh model, with an averaged estimate of 803 cases of breast cancer and 521 cases of colorectal cancer. In the log-linear model, estimates were of 791 cases of breast cancer and 527 cases of colorectal cancer according to the retained models (729 and 481 histological cases, respectively).

Conclusions

We applied Mth models and Bayesian population estimation to small sample of a cancer population. Advantage of Mth models applied to cancer datasets, is the ability to explore individual factors associated with capture heterogeneity, as equal capture probability assumption is unlikely. Mth models and Bayesian population estimation are well-suited for capture-recapture in a heterogeneous cancer population.
Appendix
Available only for authorised users
Literature
1.
go back to reference Belot A, Grosclaude P, Bossard N, et al. Cancer incidence and mortality in France over the period 1980-2005. Rev Epidemiol Sante Publique. 2008;56(3):159–75.CrossRefPubMed Belot A, Grosclaude P, Bossard N, et al. Cancer incidence and mortality in France over the period 1980-2005. Rev Epidemiol Sante Publique. 2008;56(3):159–75.CrossRefPubMed
2.
go back to reference Bray F, Parkin DM. Evaluation of data quality in the cancer registry: Principles and methods. Part I: Comparability, validity and timeliness. Eur J Cancer. 2009;45:747–55.CrossRefPubMed Bray F, Parkin DM. Evaluation of data quality in the cancer registry: Principles and methods. Part I: Comparability, validity and timeliness. Eur J Cancer. 2009;45:747–55.CrossRefPubMed
3.
go back to reference Chapman DG. The estimation of biological populations. Ann Math Stat. 1954;25:1–15.CrossRef Chapman DG. The estimation of biological populations. Ann Math Stat. 1954;25:1–15.CrossRef
4.
go back to reference Cormack RM. The statistics of capture-recapture methods. Oceanogr Mar Biol Ann Rev. 1968;6:455–506. Cormack RM. The statistics of capture-recapture methods. Oceanogr Mar Biol Ann Rev. 1968;6:455–506.
5.
go back to reference Wittes JT, Sidel VW. A generalization of the simple capture recapture model with applications to epidemiological research. J Chronic Dis. 1968;21:287–301.CrossRefPubMed Wittes JT, Sidel VW. A generalization of the simple capture recapture model with applications to epidemiological research. J Chronic Dis. 1968;21:287–301.CrossRefPubMed
6.
go back to reference Wittes JT. Applications of a multinomial capture-recapture model to epidemiological data. J Am Stat. 1974;69:93–7.CrossRef Wittes JT. Applications of a multinomial capture-recapture model to epidemiological data. J Am Stat. 1974;69:93–7.CrossRef
7.
go back to reference Sekar CC, Deming WE. On a method of estimating birth and death rates and the extent of registration. American Stat Assoc J. 1949;44:101–15.CrossRef Sekar CC, Deming WE. On a method of estimating birth and death rates and the extent of registration. American Stat Assoc J. 1949;44:101–15.CrossRef
8.
go back to reference Himes CL, Clogg CC. An overview of demographic analysis as a method for evaluating census coverage in the US Population. Index. 1992;58:587–607.CrossRef Himes CL, Clogg CC. An overview of demographic analysis as a method for evaluating census coverage in the US Population. Index. 1992;58:587–607.CrossRef
9.
go back to reference Hook EB, Regal RR. Internal validity analysis: a method for adjusting capture-recapture estimates of prevalence. Am J Epidemiol. 1995;142(9):48–52.CrossRef Hook EB, Regal RR. Internal validity analysis: a method for adjusting capture-recapture estimates of prevalence. Am J Epidemiol. 1995;142(9):48–52.CrossRef
10.
go back to reference Crocetti E, Miccinesi G, Paci E, Zappa M. An application of the two-source capture-recapture method to estimate the completeness of the Tuscany Cancer Registry. Italy Eur J Cancer Prev. 2001;10(5):417–23.CrossRefPubMed Crocetti E, Miccinesi G, Paci E, Zappa M. An application of the two-source capture-recapture method to estimate the completeness of the Tuscany Cancer Registry. Italy Eur J Cancer Prev. 2001;10(5):417–23.CrossRefPubMed
11.
go back to reference Ballivet S, Rachid Salmi L, Dubourdieu D. Capture-recapture method to determine the best design of a surveillance system. Application to a thyroid cancer registry. Eur J Epidemiol. 2000;16:147–53.CrossRefPubMed Ballivet S, Rachid Salmi L, Dubourdieu D. Capture-recapture method to determine the best design of a surveillance system. Application to a thyroid cancer registry. Eur J Epidemiol. 2000;16:147–53.CrossRefPubMed
12.
go back to reference Seddon DJ, Williams EM. Data quality in population-based cancer registration: an assessment of the Merseyside and Cheshire Cancer Registry. Br J Cancer. 1997;76(5):667–74.CrossRefPubMedPubMedCentral Seddon DJ, Williams EM. Data quality in population-based cancer registration: an assessment of the Merseyside and Cheshire Cancer Registry. Br J Cancer. 1997;76(5):667–74.CrossRefPubMedPubMedCentral
13.
go back to reference International Working Group for Disease Monitoring and Forecasting. Capture-recapture and multiple-record systems estimation I: history and development. Am J Epidemiol. 1995;142(10):1047–58. International Working Group for Disease Monitoring and Forecasting. Capture-recapture and multiple-record systems estimation I: history and development. Am J Epidemiol. 1995;142(10):1047–58.
14.
go back to reference International Working Group for Disease Monitoring and Forecasting. Capture-recapture and multiple-record systems estimation II: applications in human diseases. Am J Epidemiol. 1995;142(10):1059–68. International Working Group for Disease Monitoring and Forecasting. Capture-recapture and multiple-record systems estimation II: applications in human diseases. Am J Epidemiol. 1995;142(10):1059–68.
16.
go back to reference Goodman LA. A general model for the analysis of surveys. American J Socio. 1972;77(6):1035–86.CrossRef Goodman LA. A general model for the analysis of surveys. American J Socio. 1972;77(6):1035–86.CrossRef
17.
go back to reference Bishop YMM, Fienberg SE, Holland PW. Discrete multivariate Analysis: Theory and practice. Cambridge. MIT press, 1975, chapter 5-6, ISBN 978-0-387-72805-6 © 2007 Springer Science+Business Media, LLC Bishop YMM, Fienberg SE, Holland PW. Discrete multivariate Analysis: Theory and practice. Cambridge. MIT press, 1975, chapter 5-6, ISBN 978-0-387-72805-6 © 2007 Springer Science+Business Media, LLC
18.
go back to reference Tilling K, Sterne JAC. Capture-recapture models including covariate effects. Am J Epidemiol. 1999;149(4):392–400.CrossRefPubMed Tilling K, Sterne JAC. Capture-recapture models including covariate effects. Am J Epidemiol. 1999;149(4):392–400.CrossRefPubMed
19.
go back to reference Chao A, Tsay PK, Lin SH, Shau WY, Chao DY. The applications of capture-recapture models to epidemiological data. Stat Med. 2001;20:3123–57.CrossRefPubMed Chao A, Tsay PK, Lin SH, Shau WY, Chao DY. The applications of capture-recapture models to epidemiological data. Stat Med. 2001;20:3123–57.CrossRefPubMed
20.
go back to reference King R, Bird SM, Hay G, Hutchinson SJ. Estimating current injectors in Scotland and their drug-related death rate by sex, region and age-group via Bayesian capture-recapture methods. Stat Methods Med Res. 2009;18(4):341–59.CrossRefPubMed King R, Bird SM, Hay G, Hutchinson SJ. Estimating current injectors in Scotland and their drug-related death rate by sex, region and age-group via Bayesian capture-recapture methods. Stat Methods Med Res. 2009;18(4):341–59.CrossRefPubMed
21.
go back to reference Schmidtmann I. Estimating completeness in cancer registries --comparing capture-recapture methods in a simulation study. Biom J. 2008;6(50):1077–92.CrossRef Schmidtmann I. Estimating completeness in cancer registries --comparing capture-recapture methods in a simulation study. Biom J. 2008;6(50):1077–92.CrossRef
22.
go back to reference Silcocks PB, Robinson D. Completeness of ascertainment by cancer registries: putting bounds on the number of missing cases. J Public Health (Oxf). 2004;26(2):161–7.CrossRef Silcocks PB, Robinson D. Completeness of ascertainment by cancer registries: putting bounds on the number of missing cases. J Public Health (Oxf). 2004;26(2):161–7.CrossRef
23.
go back to reference Chao A, Pan HY, Chiang SC. The Petersen–Lincoln Estimator and its extension to estimate the size of a shared population. Biom J. 2008;6(50):957–70.CrossRef Chao A, Pan HY, Chiang SC. The Petersen–Lincoln Estimator and its extension to estimate the size of a shared population. Biom J. 2008;6(50):957–70.CrossRef
24.
go back to reference Mao CX. Computing an NPMLE for a mixing distribution in two closed heterogeneous population size models. Biom J. 2008;6(50):983–92.CrossRef Mao CX. Computing an NPMLE for a mixing distribution in two closed heterogeneous population size models. Biom J. 2008;6(50):983–92.CrossRef
25.
go back to reference Manrique-Vallier D, Fienberg SE. Population size estimation using individual level mixture models. Biom J. 2008;6(50):1051–63.CrossRef Manrique-Vallier D, Fienberg SE. Population size estimation using individual level mixture models. Biom J. 2008;6(50):1051–63.CrossRef
26.
go back to reference Otis DL, Burnham KP, White GC, Anderson DR. Statistical inference from capture data on closed animal populations. Wildlife Monographs. 1978;62:1–135. Otis DL, Burnham KP, White GC, Anderson DR. Statistical inference from capture data on closed animal populations. Wildlife Monographs. 1978;62:1–135.
27.
go back to reference King R, Brooks SP. On the Bayesian estimation of a closed population size in the presence of heterogeneity and model uncertainty. Biometrics. 2008;64(3):816–24.CrossRefPubMed King R, Brooks SP. On the Bayesian estimation of a closed population size in the presence of heterogeneity and model uncertainty. Biometrics. 2008;64(3):816–24.CrossRefPubMed
28.
go back to reference Bailly L, Daurès JP, Pradier C. Investigating the completeness of a histopathological cancer registry: estimation by capture-recapture analysis in a French geographical unit Alpes-Maritimes, 2008. Cancer Epidemiol. 2011;35(6):62–8.CrossRef Bailly L, Daurès JP, Pradier C. Investigating the completeness of a histopathological cancer registry: estimation by capture-recapture analysis in a French geographical unit Alpes-Maritimes, 2008. Cancer Epidemiol. 2011;35(6):62–8.CrossRef
29.
go back to reference Chao DY, Shau WY, Lu CWK, Chen KT, Chu CL, Shu HM, et al. A large outbreak of hepatitis A in a college school in Taiwan: associated with contaminated food and water dissemination. Taiwan Government: Epidemiology Bulletin, Department of Health, Executive Yuan; 1997. Chao DY, Shau WY, Lu CWK, Chen KT, Chu CL, Shu HM, et al. A large outbreak of hepatitis A in a college school in Taiwan: associated with contaminated food and water dissemination. Taiwan Government: Epidemiology Bulletin, Department of Health, Executive Yuan; 1997.
30.
go back to reference Bruno GB, Biggeri A, LaPorte RE, McCarty D, Merletti F, Pagono G. Application of capture-recapture to count diabetes. Diabetes Care. 1994;17:548–56.CrossRefPubMed Bruno GB, Biggeri A, LaPorte RE, McCarty D, Merletti F, Pagono G. Application of capture-recapture to count diabetes. Diabetes Care. 1994;17:548–56.CrossRefPubMed
31.
go back to reference Wittes JT, Colton T, Sidel VW. Capture-recapture methods for assessing the completeness of cases ascertainment when using multiple information sources. J Chronic Dis. 1974;27:25–36.CrossRefPubMed Wittes JT, Colton T, Sidel VW. Capture-recapture methods for assessing the completeness of cases ascertainment when using multiple information sources. J Chronic Dis. 1974;27:25–36.CrossRefPubMed
32.
go back to reference Fienberg SE. The multiple recapture census for closed populations and incomplete 2 k contingency tables. Biometrika. 1972;59:591–603. Fienberg SE. The multiple recapture census for closed populations and incomplete 2 k contingency tables. Biometrika. 1972;59:591–603.
33.
go back to reference Pledger S. Unified maximum likelihood estimates for closed capture-recapture models using mixtures. Biometrics. 2000;56(2):434–42.CrossRefPubMed Pledger S. Unified maximum likelihood estimates for closed capture-recapture models using mixtures. Biometrics. 2000;56(2):434–42.CrossRefPubMed
34.
go back to reference Hoeting JA, Madigan D, Raftery AE, Kronmal RA. Bayesian model averaging: a tutorial. Stat Sci. 1999;14(4):382–417.CrossRef Hoeting JA, Madigan D, Raftery AE, Kronmal RA. Bayesian model averaging: a tutorial. Stat Sci. 1999;14(4):382–417.CrossRef
35.
go back to reference Gelfand AE, Smith AFM. Sampling-based approaches to calculating marginal densities. J Am Stat Assoc. 1990;85(410):398–409.CrossRef Gelfand AE, Smith AFM. Sampling-based approaches to calculating marginal densities. J Am Stat Assoc. 1990;85(410):398–409.CrossRef
36.
go back to reference Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS – a Bayesian modelling framework: concepts, structure, and extensibility. Stat Com. 2000;10:325–37.CrossRef Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS – a Bayesian modelling framework: concepts, structure, and extensibility. Stat Com. 2000;10:325–37.CrossRef
37.
go back to reference Link WA, Barker RJ. Bayesian Inference with ecological applications. Elsevier, London: Academic; 2010. p. 201–24. Link WA, Barker RJ. Bayesian Inference with ecological applications. Elsevier, London: Academic; 2010. p. 201–24.
38.
go back to reference Tilling K. Capture–recapture methods–useful or misleading ? Int J Epidemiol. 2001;30(1):12–4.CrossRefPubMed Tilling K. Capture–recapture methods–useful or misleading ? Int J Epidemiol. 2001;30(1):12–4.CrossRefPubMed
39.
go back to reference Brenner H, Stegmaier C, Ziegler H. Estimating completeness of cancer registration: an empirical evaluation of the two source capture-recapture approach in Germany. J Epidemiol Community Health. 1995;49(4):426–30.CrossRefPubMedPubMedCentral Brenner H, Stegmaier C, Ziegler H. Estimating completeness of cancer registration: an empirical evaluation of the two source capture-recapture approach in Germany. J Epidemiol Community Health. 1995;49(4):426–30.CrossRefPubMedPubMedCentral
40.
go back to reference Coull BA, Agresti A. The use of mixed logit models to reflect heterogeneity in capture-recapture studies. Biometrics. 1999;55:294–301.CrossRefPubMed Coull BA, Agresti A. The use of mixed logit models to reflect heterogeneity in capture-recapture studies. Biometrics. 1999;55:294–301.CrossRefPubMed
Metadata
Title
Bayesian estimation of a cancer population by capture-recapture with individual capture heterogeneity and small sample
Authors
Laurent Bailly
Jean Pierre Daurès
Brigitte Dunais
Christian Pradier
Publication date
01-12-2015
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2015
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-015-0029-7

Other articles of this Issue 1/2015

BMC Medical Research Methodology 1/2015 Go to the issue