Skip to main content
Top
Published in: Journal of Medical Systems 10/2014

01-10-2014 | Patient Facing Systems

Construction the Model on the Breast Cancer Survival Analysis Use Support Vector Machine, Logistic Regression and Decision Tree

Authors: Cheng-Min Chao, Ya-Wen Yu, Bor-Wen Cheng, Yao-Lung Kuo

Published in: Journal of Medical Systems | Issue 10/2014

Login to get access

Abstract

The aim of the paper is to use data mining technology to establish a classification of breast cancer survival patterns, and offers a treatment decision-making reference for the survival ability of women diagnosed with breast cancer in Taiwan. We studied patients with breast cancer in a specific hospital in Central Taiwan to obtain 1,340 data sets. We employed a support vector machine, logistic regression, and a C5.0 decision tree to construct a classification model of breast cancer patients’ survival rates, and used a 10-fold cross-validation approach to identify the model. The results show that the establishment of classification tools for the classification of the models yielded an average accuracy rate of more than 90 % for both; the SVM provided the best method for constructing the three categories of the classification system for the survival mode. The results of the experiment show that the three methods used to create the classification system, established a high accuracy rate, predicted a more accurate survival ability of women diagnosed with breast cancer, and could be used as a reference when creating a medical decision-making frame.
Literature
4.
go back to reference Jerez-Aragonés, J. M., Gomez-Ruiz, J. A., Ramos-Jimenez, G., Munoz-Perez, J., and Alba-Conejo, E., A combined neural network and decision trees model for prognosis of breast cancer relapse. Artif. Intell. Med. 27(1):45–63, 2003. doi:10.1016/S0933-3657(02)00086-6.CrossRef Jerez-Aragonés, J. M., Gomez-Ruiz, J. A., Ramos-Jimenez, G., Munoz-Perez, J., and Alba-Conejo, E., A combined neural network and decision trees model for prognosis of breast cancer relapse. Artif. Intell. Med. 27(1):45–63, 2003. doi:10.​1016/​S0933-3657(02)00086-6.CrossRef
5.
go back to reference O’Malley, C. D., Le, G. M., Glaser, S. L., Shema, S. J., and West, D. W., Socioeconomic status and breast carcinoma survival in four racial/ethnic groups: A population-based study. Am. Cancer Soc. 97(5):1303–1311, 2003. doi:10.1002/cncr.11160. O’Malley, C. D., Le, G. M., Glaser, S. L., Shema, S. J., and West, D. W., Socioeconomic status and breast carcinoma survival in four racial/ethnic groups: A population-based study. Am. Cancer Soc. 97(5):1303–1311, 2003. doi:10.​1002/​cncr.​11160.
6.
go back to reference Nahar, J., Imam, T., Tickle, K. S., Ali, A. B. M. S., and Chen, Y.-P. P., Computational intelligence for microarray data and biomedical image analysis for the early diagnosis of breast cancer. Expert Syst. Appl. 39(16):12371–12377, 2012. doi:10.1016/j.eswa.2012.04.045.CrossRef Nahar, J., Imam, T., Tickle, K. S., Ali, A. B. M. S., and Chen, Y.-P. P., Computational intelligence for microarray data and biomedical image analysis for the early diagnosis of breast cancer. Expert Syst. Appl. 39(16):12371–12377, 2012. doi:10.​1016/​j.​eswa.​2012.​04.​045.CrossRef
9.
10.
go back to reference D’Eredita, G., Giardina, C., Martellotta, M., Natale, T., and Ferrarese, F., Prognostic factors in breast cancer: the predictive value of the Nottingham Prognostic Index in patients with a long-term follow-up that were treated in a single institution. Eur. J. Cancer 37(1):591–596, 2001. doi:10.1016/s0959-8049(00)00435-4.CrossRef D’Eredita, G., Giardina, C., Martellotta, M., Natale, T., and Ferrarese, F., Prognostic factors in breast cancer: the predictive value of the Nottingham Prognostic Index in patients with a long-term follow-up that were treated in a single institution. Eur. J. Cancer 37(1):591–596, 2001. doi:10.​1016/​s0959-8049(00)00435-4.CrossRef
12.
go back to reference Chhatwal, J., Alagoz, O., Lindstrom, M. J., Kahn, C. E., Jr., Shaffer, K. A., and Burnside, E. S., A logistic regression model based on the national mammography database format to aid breast cancer diagnosis. Am. J. Roentgenol. 192(4):1117–1127, 2009. doi:10.2214/AJR.07.3345.CrossRef Chhatwal, J., Alagoz, O., Lindstrom, M. J., Kahn, C. E., Jr., Shaffer, K. A., and Burnside, E. S., A logistic regression model based on the national mammography database format to aid breast cancer diagnosis. Am. J. Roentgenol. 192(4):1117–1127, 2009. doi:10.​2214/​AJR.​07.​3345.CrossRef
13.
go back to reference Richards, G., Rayward-Smith, V. J., Sonksen, P. H., Carey, S., and Weng, C., Data mining for indicators of early mortality in a database of clinical records. Artif. Intell. Med. 22(3):215–231, 2001. doi:10.1016/S0933-3657(00)00110-X.CrossRef Richards, G., Rayward-Smith, V. J., Sonksen, P. H., Carey, S., and Weng, C., Data mining for indicators of early mortality in a database of clinical records. Artif. Intell. Med. 22(3):215–231, 2001. doi:10.​1016/​S0933-3657(00)00110-X.CrossRef
14.
go back to reference Pendharkar, P. C., Rodger, J. A., Yaverbaum, G., Herman, N., and Benner, M., Association, statistical, mathematical and neural approaches for mining breast cancer patterns. Expert Syst. Appl. 17(3):223–232, 1999. doi:10.1016/S0957-4174(99)00036-6.CrossRef Pendharkar, P. C., Rodger, J. A., Yaverbaum, G., Herman, N., and Benner, M., Association, statistical, mathematical and neural approaches for mining breast cancer patterns. Expert Syst. Appl. 17(3):223–232, 1999. doi:10.​1016/​S0957-4174(99)00036-6.CrossRef
15.
go back to reference Acharya, U. R., Ng, E. Y., Tan, J. H., and Sree, S. V., Thermography based breast cancer detection using texture features and Support Vector Machine. J. Med. Syst. 36(3):1503–1510, 2012. doi:10.1007/s10916-010-9611-z.CrossRef Acharya, U. R., Ng, E. Y., Tan, J. H., and Sree, S. V., Thermography based breast cancer detection using texture features and Support Vector Machine. J. Med. Syst. 36(3):1503–1510, 2012. doi:10.​1007/​s10916-010-9611-z.CrossRef
18.
go back to reference Huang, M. L., Hung, Y. H., et al., Usage of case-based reasoning, neural network and adaptive neuro-fuzzy inference system classification techniques in breast cancer dataset classification diagnosis. J. Med. Syst. 36(2):407–414, 2012.MathSciNetCrossRef Huang, M. L., Hung, Y. H., et al., Usage of case-based reasoning, neural network and adaptive neuro-fuzzy inference system classification techniques in breast cancer dataset classification diagnosis. J. Med. Syst. 36(2):407–414, 2012.MathSciNetCrossRef
20.
23.
24.
go back to reference Stoean, R., Stoean, C., et al., Evolutionary-driven support vector machines for determining the degree of liver fibrosis in chronic hepatitis C. Artif. Intell. Med. 51(1):53–65, 2011.MathSciNetCrossRef Stoean, R., Stoean, C., et al., Evolutionary-driven support vector machines for determining the degree of liver fibrosis in chronic hepatitis C. Artif. Intell. Med. 51(1):53–65, 2011.MathSciNetCrossRef
25.
go back to reference Cristianini, N., and Taylor, J., An introduction to support vector machines. Cambridge University Press, Cambridge, UK, 2000. Cristianini, N., and Taylor, J., An introduction to support vector machines. Cambridge University Press, Cambridge, UK, 2000.
26.
go back to reference Quinlan, J. R., C4.5: Programs for machine learning. Morgan Kaufmann Publishers, San Mateo, 1993. Quinlan, J. R., C4.5: Programs for machine learning. Morgan Kaufmann Publishers, San Mateo, 1993.
28.
go back to reference Pradhan, B., A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 51(1):350–365, 2013.CrossRef Pradhan, B., A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput. Geosci. 51(1):350–365, 2013.CrossRef
30.
go back to reference Biggs, D., et al., A method of choosing multiway partitions for classification and decision trees. J. Appl. Stat. 18(1):49–62, 1991.CrossRef Biggs, D., et al., A method of choosing multiway partitions for classification and decision trees. J. Appl. Stat. 18(1):49–62, 1991.CrossRef
31.
go back to reference Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J., Classification and regression trees. Wadsworth & Brooks/Cole Advanced Books & Software, Monterey, CA, 1984.MATH Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J., Classification and regression trees. Wadsworth & Brooks/Cole Advanced Books & Software, Monterey, CA, 1984.MATH
33.
go back to reference Szalay, A., and Gray, J., Science in an exponential world. Nature 440(1):413–414, 2006.CrossRef Szalay, A., and Gray, J., Science in an exponential world. Nature 440(1):413–414, 2006.CrossRef
Metadata
Title
Construction the Model on the Breast Cancer Survival Analysis Use Support Vector Machine, Logistic Regression and Decision Tree
Authors
Cheng-Min Chao
Ya-Wen Yu
Bor-Wen Cheng
Yao-Lung Kuo
Publication date
01-10-2014
Publisher
Springer US
Published in
Journal of Medical Systems / Issue 10/2014
Print ISSN: 0148-5598
Electronic ISSN: 1573-689X
DOI
https://doi.org/10.1007/s10916-014-0106-1

Other articles of this Issue 10/2014

Journal of Medical Systems 10/2014 Go to the issue