Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2005

Open Access 01-12-2005 | Research article

Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis

Authors: Amir R Razavi, Hans Gill, Olle Stål, Marie Sundquist, Sten Thorstenson, Hans Åhlfeldt, Nosrat Shahsavar, the South-East Swedish Breast Cancer Study Group

Published in: BMC Medical Informatics and Decision Making | Issue 1/2005

Login to get access

Abstract

Background

A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time.
One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model.

Methods

Data for 637 malignant breast cancer patients admitted in the south-east region of Sweden were analyzed. By using CCA and looking at the structure coefficients (loadings), relationships between tumor specifications and the two outcomes during different time intervals were analyzed and a correlation model was built.

Results

The analysis successfully detected known predictors for breast cancer recurrence during the first two years and distant metastasis 2–4 years after diagnosis. Nottingham Histologic Grading (NHG) was the most important predictor, while age of the patient at the time of diagnosis was not an important predictor.

Conclusion

In cancer registers with high dimensionality, CCA can be used for identifying the importance of risk factors for breast cancer recurrence. This technique can result in a model ready for further processing by data mining methods through reducing the number of variables to important ones.
Appendix
Available only for authorised users
Literature
1.
go back to reference Tejler G, Norberg B, Dufmats M, Nordenskjold B: Survival after treatment for breast cancer in a geographically defined population. The British journal of surgery. 2004, 91 (10): 1307-1312. 10.1002/bjs.4697.CrossRefPubMed Tejler G, Norberg B, Dufmats M, Nordenskjold B: Survival after treatment for breast cancer in a geographically defined population. The British journal of surgery. 2004, 91 (10): 1307-1312. 10.1002/bjs.4697.CrossRefPubMed
2.
go back to reference Berg P, Wennberg AM, Tuominen R, Sander B, Rozell BL, Platz A, Hansson J: Germline CDKN2A mutations are rare in child and adolescent cutaneous melanoma. Melanoma Research. 2004, 14 (4): 251-255. 10.1097/01.cmr.0000131014.79262.bf.CrossRefPubMed Berg P, Wennberg AM, Tuominen R, Sander B, Rozell BL, Platz A, Hansson J: Germline CDKN2A mutations are rare in child and adolescent cutaneous melanoma. Melanoma Research. 2004, 14 (4): 251-255. 10.1097/01.cmr.0000131014.79262.bf.CrossRefPubMed
3.
go back to reference Bull A, Mountney L, Sanderson H: Use of a cancer register for monitoring and evaluating a breast cancer screening programme. Community Medicine. 1989, 11 (3): 220-224.PubMed Bull A, Mountney L, Sanderson H: Use of a cancer register for monitoring and evaluating a breast cancer screening programme. Community Medicine. 1989, 11 (3): 220-224.PubMed
4.
go back to reference Feltelius N, Ekbom A, Blomqvist P: Cancer incidence among patients with ankylosing spondylitis in Sweden 1965-95: a population based cohort study. Annals of the rheumatic diseases. 2003, 62 (12): 1185-1188. 10.1136/ard.2002.004721.CrossRefPubMedPubMedCentral Feltelius N, Ekbom A, Blomqvist P: Cancer incidence among patients with ankylosing spondylitis in Sweden 1965-95: a population based cohort study. Annals of the rheumatic diseases. 2003, 62 (12): 1185-1188. 10.1136/ard.2002.004721.CrossRefPubMedPubMedCentral
5.
go back to reference Vogel RL, Ackermann RJ: Is primary care physician supply correlated with health outcomes?. International journal of health services. 1998, 28 (1): 183-196.CrossRefPubMed Vogel RL, Ackermann RJ: Is primary care physician supply correlated with health outcomes?. International journal of health services. 1998, 28 (1): 183-196.CrossRefPubMed
6.
go back to reference Levine MS: Canonical Analysis and Factor Comparison. Quantitative Applications in the Social Sciences series. 1977, Sage Publications Inc, 6: Levine MS: Canonical Analysis and Factor Comparison. Quantitative Applications in the Social Sciences series. 1977, Sage Publications Inc, 6:
7.
go back to reference Cox DR: Regression models and life tables (with discussion). Journal of the Royal Statistical Society: Series B. 1972, 34: 187–220- Cox DR: Regression models and life tables (with discussion). Journal of the Royal Statistical Society: Series B. 1972, 34: 187–220-
8.
go back to reference Sundquist M, Thorstenson S, Brudin L, Nordenskjold B: Applying the Nottingham Prognostic Index to a Swedish breast cancer population. South East Swedish Breast Cancer Study Group. Breast cancer research and treatment. 1999, 53 (1): 1-8. 10.1023/A:1006052115874.CrossRefPubMed Sundquist M, Thorstenson S, Brudin L, Nordenskjold B: Applying the Nottingham Prognostic Index to a Swedish breast cancer population. South East Swedish Breast Cancer Study Group. Breast cancer research and treatment. 1999, 53 (1): 1-8. 10.1023/A:1006052115874.CrossRefPubMed
9.
go back to reference McLachlan GJ, Krishnan T: The EM algorithm and extensions. 1997, John Wiley & Sons McLachlan GJ, Krishnan T: The EM algorithm and extensions. 1997, John Wiley & Sons
10.
go back to reference McKeon JJ: Canonical analysis: Some relations between canonical correlation, factor analysis, discriminant function analysis, and scaling theory. Psychometric Monographs. 1965, Chicago , Univ. of Chicago Press, 13: McKeon JJ: Canonical analysis: Some relations between canonical correlation, factor analysis, discriminant function analysis, and scaling theory. Psychometric Monographs. 1965, Chicago , Univ. of Chicago Press, 13:
11.
go back to reference Lambert ZV, Durand RM: Some precautions in using canonical analysis. Journal of Marketing Research. 1975, 7: 468-475.CrossRef Lambert ZV, Durand RM: Some precautions in using canonical analysis. Journal of Marketing Research. 1975, 7: 468-475.CrossRef
12.
go back to reference Thompson B: Canonical correlation analysis: Uses and interpretation. Quantitative Applications in the Social Sciences. 1984 , Thousand Oaks, CA , Sage Publications, 47 (47): Thompson B: Canonical correlation analysis: Uses and interpretation. Quantitative Applications in the Social Sciences. 1984 , Thousand Oaks, CA , Sage Publications, 47 (47):
13.
go back to reference Ridderstolpe L: Canonical correlation analysis of risk factors and clinical outcomes in cardiac surgery. Dept of Biomedical Engineering. 2003, Linköping University, Sweden Ridderstolpe L: Canonical correlation analysis of risk factors and clinical outcomes in cardiac surgery. Dept of Biomedical Engineering. 2003, Linköping University, Sweden
14.
go back to reference SPSS Inc.: SPSS for Windows. 2001, Chicago , SPSS Inc., 11.0.1 SPSS Inc.: SPSS for Windows. 2001, Chicago , SPSS Inc., 11.0.1
15.
16.
go back to reference Cooley WW, Lohnes PR: Multivariate data analysis. 1971, New York , Wiley Cooley WW, Lohnes PR: Multivariate data analysis. 1971, New York , Wiley
17.
go back to reference Dunlap WP, Landis RS: Interpretations of multiple regression borrowed from factor analysis and canonical correlation. Journal of General Psychology. 1998, 125 (4): 397-407.CrossRef Dunlap WP, Landis RS: Interpretations of multiple regression borrowed from factor analysis and canonical correlation. Journal of General Psychology. 1998, 125 (4): 397-407.CrossRef
18.
go back to reference Barcikowski R, Stevens JP: A Monte Carlo study of the stability of canonical correlations, canonical weights, and canonical variate-variable correlations. Multivariate behavioral research. 1975, 10: 353-364.CrossRefPubMed Barcikowski R, Stevens JP: A Monte Carlo study of the stability of canonical correlations, canonical weights, and canonical variate-variable correlations. Multivariate behavioral research. 1975, 10: 353-364.CrossRefPubMed
19.
go back to reference Pugh RC, Hu Y: Use and interpretation of canonical correlation analyses in Journal of Educational Research articles: 1978-1989. Journal of Educational Research. 1991, 84 (3): 147-152.CrossRef Pugh RC, Hu Y: Use and interpretation of canonical correlation analyses in Journal of Educational Research articles: 1978-1989. Journal of Educational Research. 1991, 84 (3): 147-152.CrossRef
20.
go back to reference Elston CW, Ellis IO: Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991, 19 (5): 403-410.CrossRefPubMed Elston CW, Ellis IO: Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology. 1991, 19 (5): 403-410.CrossRefPubMed
21.
go back to reference Wenger CR, Clark GM: S-phase fraction and breast cancer--a decade of experience. Breast Cancer Res Treat. 1998, 51 (3): 255-265. 10.1023/A:1006188512927.CrossRefPubMed Wenger CR, Clark GM: S-phase fraction and breast cancer--a decade of experience. Breast Cancer Res Treat. 1998, 51 (3): 255-265. 10.1023/A:1006188512927.CrossRefPubMed
22.
go back to reference Sundquist M, Thorstenson S, Brudin L, Wingren S, Nordenskjold B: Incidence and prognosis in early onset breast cancer. Breast. 2002, 11 (1): 30-35. 10.1054/brst.2001.0358.CrossRefPubMed Sundquist M, Thorstenson S, Brudin L, Wingren S, Nordenskjold B: Incidence and prognosis in early onset breast cancer. Breast. 2002, 11 (1): 30-35. 10.1054/brst.2001.0358.CrossRefPubMed
23.
go back to reference Adami HO, Graffman S, Johansson H, Rimsten A: Survival and recurrences five years after selective treatment for breast carcinoma. British Journal of Cancer. 1978, 38 (5): 624-630.CrossRefPubMedPubMedCentral Adami HO, Graffman S, Johansson H, Rimsten A: Survival and recurrences five years after selective treatment for breast carcinoma. British Journal of Cancer. 1978, 38 (5): 624-630.CrossRefPubMedPubMedCentral
24.
go back to reference Ciocca DR, Elledge R: Molecular markers for predicting response to tamoxifen in breast cancer patients. Endocrine. 2000, 13 (1): 1-10. 10.1385/ENDO:13:1:1.CrossRefPubMed Ciocca DR, Elledge R: Molecular markers for predicting response to tamoxifen in breast cancer patients. Endocrine. 2000, 13 (1): 1-10. 10.1385/ENDO:13:1:1.CrossRefPubMed
25.
go back to reference Rzymowska J, Skierski J, Kurylcio L, Dyrda Z: DNA index as prognostic factor in breast cancer. Neoplasma. 1995, 42 (5): 239-242.PubMed Rzymowska J, Skierski J, Kurylcio L, Dyrda Z: DNA index as prognostic factor in breast cancer. Neoplasma. 1995, 42 (5): 239-242.PubMed
26.
go back to reference Wenger CR, Beardslee S, Owens MA, Pounds G, Oldaker T, Vendely P, Pandian MR, Harrington D, Clark GM, McGuire WL: DNA ploidy, S-phase, and steroid receptors in more than 127,000 breast cancer patients. Breast Cancer Res Treat. 1993, 28 (1): 9-20. 10.1007/BF00666351.CrossRefPubMed Wenger CR, Beardslee S, Owens MA, Pounds G, Oldaker T, Vendely P, Pandian MR, Harrington D, Clark GM, McGuire WL: DNA ploidy, S-phase, and steroid receptors in more than 127,000 breast cancer patients. Breast Cancer Res Treat. 1993, 28 (1): 9-20. 10.1007/BF00666351.CrossRefPubMed
27.
go back to reference Lyman GH, Lyman S, Balducci L, Kuderer N, Reintgen D, Cox C, Baekey P, Greenberg H, Horton J: Age and the Risk of Breast Cancer Recurrence. Cancer Control. 1996, 3 (5): 421-427.PubMed Lyman GH, Lyman S, Balducci L, Kuderer N, Reintgen D, Cox C, Baekey P, Greenberg H, Horton J: Age and the Risk of Breast Cancer Recurrence. Cancer Control. 1996, 3 (5): 421-427.PubMed
Metadata
Title
Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
Authors
Amir R Razavi
Hans Gill
Olle Stål
Marie Sundquist
Sten Thorstenson
Hans Åhlfeldt
Nosrat Shahsavar
the South-East Swedish Breast Cancer Study Group
Publication date
01-12-2005
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2005
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/1472-6947-5-29

Other articles of this Issue 1/2005

BMC Medical Informatics and Decision Making 1/2005 Go to the issue