Skip to main content
Top
Published in: Journal of Medical Systems 4/2012

01-08-2012 | ORIGINAL PAPER

SVM Feature Selection Based Rotation Forest Ensemble Classifiers to Improve Computer-Aided Diagnosis of Parkinson Disease

Author: Akin Ozcift

Published in: Journal of Medical Systems | Issue 4/2012

Login to get access

Abstract

Parkinson disease (PD) is an age-related deterioration of certain nerve systems, which affects movement, balance, and muscle control of clients. PD is one of the common diseases which affect 1% of people older than 60 years. A new classification scheme based on support vector machine (SVM) selected features to train rotation forest (RF) ensemble classifiers is presented for improving diagnosis of PD. The dataset contains records of voice measurements from 31 people, 23 with PD and each record in the dataset is defined with 22 features. The diagnosis model first makes use of a linear SVM to select ten most relevant features from 22. As a second step of the classification model, six different classifiers are trained with the subset of features. Subsequently, at the third step, the accuracies of classifiers are improved by the utilization of RF ensemble classification strategy. The results of the experiments are evaluated using three metrics; classification accuracy (ACC), Kappa Error (KE) and Area under the Receiver Operating Characteristic (ROC) Curve (AUC). Performance measures of two base classifiers, i.e. KStar and IBk, demonstrated an apparent increase in PD diagnosis accuracy compared to similar studies in literature. After all, application of RF ensemble classification scheme improved PD diagnosis in 5 of 6 classifiers significantly. We, numerically, obtained about 97% accuracy in RF ensemble of IBk (a K-Nearest Neighbor variant) algorithm, which is a quite high performance for Parkinson disease diagnosis.
Literature
1.
go back to reference Wahed, M., and Wahba, K., Data mining based-assistant tools for physicians to diagnose diseases, Micro-nano mechatronics and human science, 2003 IEEE Int. Symp. 388–391, 2003. Wahed, M., and Wahba, K., Data mining based-assistant tools for physicians to diagnose diseases, Micro-nano mechatronics and human science, 2003 IEEE Int. Symp. 388–391, 2003.
2.
go back to reference Ozcift, A., and Gulten, A., Assessing effects of preprocessing mass spectrometry data on classification performance. Eur. J. Mass Spectrom. 267-273, 2008. Ozcift, A., and Gulten, A., Assessing effects of preprocessing mass spectrometry data on classification performance. Eur. J. Mass Spectrom. 267-273, 2008.
3.
go back to reference Kononenko, I., Machine learning for medical diagnosis: History, state of the art and perspective. Artif. Intell. Med. 23(1):89–109, 2001.MathSciNetCrossRef Kononenko, I., Machine learning for medical diagnosis: History, state of the art and perspective. Artif. Intell. Med. 23(1):89–109, 2001.MathSciNetCrossRef
4.
go back to reference Djebbari, A., An ensemble machine learning approach to predict survival in breast cancer. Int. J. Comput. Biol. Drug Des. 1(3):275–294, 2008.CrossRef Djebbari, A., An ensemble machine learning approach to predict survival in breast cancer. Int. J. Comput. Biol. Drug Des. 1(3):275–294, 2008.CrossRef
5.
go back to reference Das, R., and Sengur, A., Evaluation of ensemble methods for diagnosing of valvular heart disease. Expert Syst. Appl. 2010. Das, R., and Sengur, A., Evaluation of ensemble methods for diagnosing of valvular heart disease. Expert Syst. Appl. 2010.
6.
go back to reference Duangsoithong, R., and Windeatt, T., Relevant and redundant feature analysis with ensemble classification. Seventh International Conference on Advances in Pattern Recognition. 247-250, 2009. Duangsoithong, R., and Windeatt, T., Relevant and redundant feature analysis with ensemble classification. Seventh International Conference on Advances in Pattern Recognition. 247-250, 2009.
7.
go back to reference Shadabi, F., Sharma, D., and Cox, R., Learning from ensembles: Using artificial neural network ensemble for medical outcomes prediction. Innovations in Information Technology, IEEE. 1-5, 2006. Shadabi, F., Sharma, D., and Cox, R., Learning from ensembles: Using artificial neural network ensemble for medical outcomes prediction. Innovations in Information Technology, IEEE. 1-5, 2006.
8.
go back to reference Bruha, I., Meta-learner for unknown attribute values processing: Dealing with inconsistency of meta-databases. J.I I.S. 71-87, 2004. Bruha, I., Meta-learner for unknown attribute values processing: Dealing with inconsistency of meta-databases. J.I I.S. 71-87, 2004.
9.
go back to reference Polikar, R., An ensemble based data fusion approach for early diagnosis of Alzheimer’s disease. Inf. Fusion. 83-95, 2008. Polikar, R., An ensemble based data fusion approach for early diagnosis of Alzheimer’s disease. Inf. Fusion. 83-95, 2008.
10.
go back to reference Das, R., A comparison of multiple classification methods for diagnosis of Parkinson disease. Expert Syst. Appl. 1568-1572, 2010. Das, R., A comparison of multiple classification methods for diagnosis of Parkinson disease. Expert Syst. Appl. 1568-1572, 2010.
11.
go back to reference Little, M., and McSharry, P., Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. Nature Precedings. 1-27, 2008. Little, M., and McSharry, P., Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. Nature Precedings. 1-27, 2008.
12.
go back to reference Chaudhuri, K., and Healy, D. G., Non-motor symptoms of Parkinson’s disease: Diagnosis and management. Lancet Neurol. 235–245, 2006. Chaudhuri, K., and Healy, D. G., Non-motor symptoms of Parkinson’s disease: Diagnosis and management. Lancet Neurol. 235–245, 2006.
13.
go back to reference Rosen, K., and Kent R. D., Parametric quantitative acoustic analysis of conversation produced by speakers with dysarthria and healthy speakers. J. Speech Lang. Hear. Res. 395–411, 2006. Rosen, K., and Kent R. D., Parametric quantitative acoustic analysis of conversation produced by speakers with dysarthria and healthy speakers. J. Speech Lang. Hear. Res. 395–411, 2006.
14.
go back to reference Little, M., and McSharry, P., Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans. Biomed. Eng. 1-8, 2008. Little, M., and McSharry, P., Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. IEEE Trans. Biomed. Eng. 1-8, 2008.
15.
go back to reference Lee, M., and Boroczky, L., A two-step approach for feature selection and classifier ensemble construction in computer-aided diagnosis. Computer-Based Med. Syst. 548–553, 2008. Lee, M., and Boroczky, L., A two-step approach for feature selection and classifier ensemble construction in computer-aided diagnosis. Computer-Based Med. Syst. 548–553, 2008.
16.
go back to reference Martinez, A. M., and Zhu, M., Where are linear feature extraction methods applicable? IEEE Transaction on Pattern Analysis and Machine Intelligence, pp. 1934–1944, 2005. Martinez, A. M., and Zhu, M., Where are linear feature extraction methods applicable? IEEE Transaction on Pattern Analysis and Machine Intelligence, pp. 1934–1944, 2005.
17.
go back to reference Fodor, I. K., A survey of dimension reduction techniques, Department of Energy by the University of California. 1-7, 2002. Fodor, I. K., A survey of dimension reduction techniques, Department of Energy by the University of California. 1-7, 2002.
18.
go back to reference Saeys, Y., A review of feature selection techniques in bioinformatics. Bioinformatics. Review. 2507–2517, 2007. Saeys, Y., A review of feature selection techniques in bioinformatics. Bioinformatics. Review. 2507–2517, 2007.
19.
go back to reference Jong, K., Feature selection in proteomic pattern data with support vector machines, computational intelligence in bioinformatics and computational biology, Proceedings of the 2004 IEEE Symposium. 41–48, 2004. Jong, K., Feature selection in proteomic pattern data with support vector machines, computational intelligence in bioinformatics and computational biology, Proceedings of the 2004 IEEE Symposium. 41–48, 2004.
20.
go back to reference Chang, Y., Feature ranking using linear SVM, JMLR: workshop and conference proceedings. 53-64, 2008. Chang, Y., Feature ranking using linear SVM, JMLR: workshop and conference proceedings. 53-64, 2008.
21.
go back to reference Guyon, I., Gene selection for cancer classification using support vector machines. Mach. Learn. 389-422, 2002. Guyon, I., Gene selection for cancer classification using support vector machines. Mach. Learn. 389-422, 2002.
22.
go back to reference Kuncheva, L. I., Combining pattern classifiers: Methods and algorithms .Wiley. 251.267, 2004. Kuncheva, L. I., Combining pattern classifiers: Methods and algorithms .Wiley. 251.267, 2004.
23.
go back to reference Polikar, R., Ensemble based system in decision making. IEEE Circuits Syst. Mag. 21-44. 2006. Polikar, R., Ensemble based system in decision making. IEEE Circuits Syst. Mag. 21-44. 2006.
24.
go back to reference Zhang, C. and Zhang,J. S., RotBoost: a technique for combining rotation forest and adaboost. Pattern Recogn. Lett. 1524–1536, 2008. Zhang, C. and Zhang,J. S., RotBoost: a technique for combining rotation forest and adaboost. Pattern Recogn. Lett. 1524–1536, 2008.
25.
go back to reference Rodriguez, J., and Kuncheva L., Rotation forest: a new classifier ensemble method. IEEE Trans. Pattern Anal. Mach. Intell. 1619–1630, 2006. Rodriguez, J., and Kuncheva L., Rotation forest: a new classifier ensemble method. IEEE Trans. Pattern Anal. Mach. Intell. 1619–1630, 2006.
26.
go back to reference Kuncheva, L., and Rodriguez, J., An experimental study on rotation forest ensembles. Lect. Notes Comput. Sci. 459–468, 2007. Kuncheva, L., and Rodriguez, J., An experimental study on rotation forest ensembles. Lect. Notes Comput. Sci. 459–468, 2007.
27.
go back to reference Beale, R., and Jackson, T., Neural computing: an introduction. Institute of Physics Publishing, 1-22, 1990. Beale, R., and Jackson, T., Neural computing: an introduction. Institute of Physics Publishing, 1-22, 1990.
28.
go back to reference Aha, D. W., and Kibler, D., Instance-based learning algorithms. Mach. Learn. 37–66, 1991. Aha, D. W., and Kibler, D., Instance-based learning algorithms. Mach. Learn. 37–66, 1991.
29.
go back to reference Alpaydin, E., Introduction to machine learning. MIT Press, 173-197, 2004. Alpaydin, E., Introduction to machine learning. MIT Press, 173-197, 2004.
30.
go back to reference Witten, I. H. and Ian, H., Data mining : practical machine learning tools and techniques. Morgan Kaufmann Ser. Data Manage. Syst. 153-168, 2005. Witten, I. H. and Ian, H., Data mining : practical machine learning tools and techniques. Morgan Kaufmann Ser. Data Manage. Syst. 153-168, 2005.
31.
go back to reference Huang, J.,and Ling, C., Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 299-310, 2005. Huang, J.,and Ling, C., Using AUC and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 299-310, 2005.
32.
go back to reference David, A., Comparison of classification accuracy using Cohen’s weighted kappa. Expert Syst. Appl. 825-832, 2008. David, A., Comparison of classification accuracy using Cohen’s weighted kappa. Expert Syst. Appl. 825-832, 2008.
33.
go back to reference Mangasarian, O., and Wolberg, W., Cancer diagnosis via linear programming. SIAM News. 1–18, 1990. Mangasarian, O., and Wolberg, W., Cancer diagnosis via linear programming. SIAM News. 1–18, 1990.
34.
go back to reference Kahn, M., Automated interpretation of diabetes patient data: Detecting temporal changes in insulin therapy. Proc Symp. Comp. Appl. Med. Care, IEEE Computer Society Press, 569–573, 1990. Kahn, M., Automated interpretation of diabetes patient data: Detecting temporal changes in insulin therapy. Proc Symp. Comp. Appl. Med. Care, IEEE Computer Society Press, 569–573, 1990.
Metadata
Title
SVM Feature Selection Based Rotation Forest Ensemble Classifiers to Improve Computer-Aided Diagnosis of Parkinson Disease
Author
Akin Ozcift
Publication date
01-08-2012
Publisher
Springer US
Published in
Journal of Medical Systems / Issue 4/2012
Print ISSN: 0148-5598
Electronic ISSN: 1573-689X
DOI
https://doi.org/10.1007/s10916-011-9678-1

Other articles of this Issue 4/2012

Journal of Medical Systems 4/2012 Go to the issue