Skip to main content
Top
Published in: Journal of Medical Systems 3/2012

01-06-2012 | ORIGINAL PAPER

Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques

Authors: M. Hariharan, Lim Sin Chee, Ooi Chia Ai, Sazali Yaacob

Published in: Journal of Medical Systems | Issue 3/2012

Login to get access

Abstract

The goal of this paper is to discuss and compare three feature extraction methods: Linear Predictive Coefficients (LPC), Linear Prediction Cepstral Coefficients (LPCC) and Weighted Linear Prediction Cepstral Coefficients (WLPCC) for recognizing the stuttered events. Speech samples from the University College London Archive of Stuttered Speech (UCLASS) were used for our analysis. The stuttered events were identified through manual segmentation and were used for feature extraction. Two simple classifiers namely, k-nearest neighbour (kNN) and Linear Discriminant Analysis (LDA) were employed for speech dysfluencies classification. Conventional validation method was used for testing the reliability of the classifier results. The study on the effect of different frame length, percentage of overlapping, value of ã in a first order pre-emphasizer and different order p were discussed. The speech dysfluencies classification accuracy was found to be improved by applying statistical normalization before feature extraction. The experimental investigation elucidated LPC, LPCC and WLPCC features can be used for identifying the stuttered events and WLPCC features slightly outperforms LPCC features and LPC features.
Literature
1.
go back to reference Awad, S. S., The application of digital speech processing to stuttering therapy. In Instrumentation and Measurement Technology Conference, 1997. IMTC/97. Proceedings. 'Sensing, Processing, Networking'., IEEE, 1997, pp. 1361–1367. Awad, S. S., The application of digital speech processing to stuttering therapy. In Instrumentation and Measurement Technology Conference, 1997. IMTC/97. Proceedings. 'Sensing, Processing, Networking'., IEEE, 1997, pp. 1361–1367.
2.
go back to reference Van Borsel, J., Achten, E., Santens, P., Lahorte, P., and Voet, T., fMRI of developmental stuttering: a pilot study. Brain and Language 85:369–376, 2003.CrossRef Van Borsel, J., Achten, E., Santens, P., Lahorte, P., and Voet, T., fMRI of developmental stuttering: a pilot study. Brain and Language 85:369–376, 2003.CrossRef
3.
go back to reference Tian-Swee, T., Helbin, L., Ariff, A. K., Chee-Ming, T., and Salleh, S. H., Application of Malay speech technology in Malay Speech Therapy Assistance Tools. In Intelligent and Advanced Systems, 2007. ICIAS 2007. International Conference on, 2007, pp. 330–334. Tian-Swee, T., Helbin, L., Ariff, A. K., Chee-Ming, T., and Salleh, S. H., Application of Malay speech technology in Malay Speech Therapy Assistance Tools. In Intelligent and Advanced Systems, 2007. ICIAS 2007. International Conference on, 2007, pp. 330–334.
4.
go back to reference Ravikumar, K. M., Rajagopal, R., and Nagaraj, H. C., An approach for objective assessment of stuttered speech using MFCC features. ICGST International Journal on Digital Signal Processing DSP 9:19–24, 2009. Ravikumar, K. M., Rajagopal, R., and Nagaraj, H. C., An approach for objective assessment of stuttered speech using MFCC features. ICGST International Journal on Digital Signal Processing DSP 9:19–24, 2009.
5.
go back to reference Ravikumar, K., Reddy, B., Rajagopal, R., and Nagaraj, H., Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. In Proceedings of World Academy Science, Engineering and Technology, 2008, pp. 270–273. Ravikumar, K., Reddy, B., Rajagopal, R., and Nagaraj, H., Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. In Proceedings of World Academy Science, Engineering and Technology, 2008, pp. 270–273.
6.
go back to reference Howell, P., Sackin, S., and Glenn, K., Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: I. Psychometric procedures appropriate for selection of training material for lexical dysfluency classifiers. Journal of Speech, Language, and Hearing Research 40:1073, 1997. Howell, P., Sackin, S., and Glenn, K., Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: I. Psychometric procedures appropriate for selection of training material for lexical dysfluency classifiers. Journal of Speech, Language, and Hearing Research 40:1073, 1997.
7.
go back to reference Howell, P., Sackin, S., and Glenn, K., Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: II. ANN recognition of repetitions and prolongations with supplied word segment markers. Journal of Speech, Language, and Hearing Research 40:1085, 1997. Howell, P., Sackin, S., and Glenn, K., Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: II. ANN recognition of repetitions and prolongations with supplied word segment markers. Journal of Speech, Language, and Hearing Research 40:1085, 1997.
8.
go back to reference Nöth, E., Niemann, H., Haderlein, T., Decher, M., Eysholdt, U., Rosanowski, F., and Wittenberg, T., Automatic stuttering recognition using hidden Markov models. 2000. Nöth, E., Niemann, H., Haderlein, T., Decher, M., Eysholdt, U., Rosanowski, F., and Wittenberg, T., Automatic stuttering recognition using hidden Markov models. 2000.
9.
go back to reference Howell, P., and Sackin, S., Automatic recognition of repetitions and prolongations in stuttered speech. In Proceedings of the First World Congress on Fluency Disorders, 1995, pp. 372–374. Howell, P., and Sackin, S., Automatic recognition of repetitions and prolongations in stuttered speech. In Proceedings of the First World Congress on Fluency Disorders, 1995, pp. 372–374.
10.
go back to reference Geetha, Y. V., Pratibha, K., Ashok, R., and Ravindra, S. K., Classification of childhood disfluencies using neural networks. Journal of fluency disorders 25:99–117, 2000.CrossRef Geetha, Y. V., Pratibha, K., Ashok, R., and Ravindra, S. K., Classification of childhood disfluencies using neural networks. Journal of fluency disorders 25:99–117, 2000.CrossRef
11.
go back to reference Czyzewski, A., Kaczmarek, A., and Kostek, B., Intelligent processing of stuttered speech. Journal of Intelligent Information Systems 21:143–171, 2003.CrossRef Czyzewski, A., Kaczmarek, A., and Kostek, B., Intelligent processing of stuttered speech. Journal of Intelligent Information Systems 21:143–171, 2003.CrossRef
12.
go back to reference P. B., Acoustic Measures In The Speech of Children with Stuttering and Normal Non Fluency- A Key to Differential Diagnosis. in Workshop on Spoken Language Processing Mumbai, India, January 9–11, 2003. P. B., Acoustic Measures In The Speech of Children with Stuttering and Normal Non Fluency- A Key to Differential Diagnosis. in Workshop on Spoken Language Processing Mumbai, India, January 9–11, 2003.
13.
go back to reference Szczurowska, I., Kuniszyk-Jozkowiak, W., and Smolka, E., The application of Kohonen and Multilayer Perceptron Networks in the speech nonfluency analysis. Archives of Acoustics 31:205, 2006. Szczurowska, I., Kuniszyk-Jozkowiak, W., and Smolka, E., The application of Kohonen and Multilayer Perceptron Networks in the speech nonfluency analysis. Archives of Acoustics 31:205, 2006.
14.
go back to reference Wiśniewski, M., Kuniszyk-Jóźkowiak, W., Smołka, E., and Suszyński, W., Automatic detection of disorders in a continuous speech with the hidden markov models approach. In Computer Recognition Systems 2. vol. 45/2008: Springer Berlin/Heidelberg, October 18, 2007, pp. 445–453. Wiśniewski, M., Kuniszyk-Jóźkowiak, W., Smołka, E., and Suszyński, W., Automatic detection of disorders in a continuous speech with the hidden markov models approach. In Computer Recognition Systems 2. vol. 45/2008: Springer Berlin/Heidelberg, October 18, 2007, pp. 445–453.
15.
go back to reference Wiśniewski, M., Kuniszyk-Jóźkowiak, W., Smołka, E., and Suszyński, W., Automatic detection of prolonged fricative phonemes with the hidden markov models approach. Journal of Medical Informatics & Technologies 11:2007, 2007. Wiśniewski, M., Kuniszyk-Jóźkowiak, W., Smołka, E., and Suszyński, W., Automatic detection of prolonged fricative phonemes with the hidden markov models approach. Journal of Medical Informatics & Technologies 11:2007, 2007.
16.
go back to reference Świetlicka, I., Kuniszyk-Jóźkowiak, W., and Smołka, E., Artificial neural networks in the disabled speech analysis. In Computer Recognition System 3. vol. 57/2009: Springer Berlin/Heidelberg, May 12, 2009, pp. 347–354. Świetlicka, I., Kuniszyk-Jóźkowiak, W., and Smołka, E., Artificial neural networks in the disabled speech analysis. In Computer Recognition System 3. vol. 57/2009: Springer Berlin/Heidelberg, May 12, 2009, pp. 347–354.
17.
go back to reference Sin Chee, L., Chia Ai, O., Hariharan, M., and Yaacob, S., MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA. In Student Conference on Research and Development (SCOReD 2009), 2009, pp. 146–149. Sin Chee, L., Chia Ai, O., Hariharan, M., and Yaacob, S., MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA. In Student Conference on Research and Development (SCOReD 2009), 2009, pp. 146–149.
18.
go back to reference Sin Chee, L., Chia Ai, O., Hariharan, M., and Yaacob, S., Automatic detection of prolongations and repetitions using LPCC. In Technical Postgraduates (TECHPOS), 2009 International Conference for, 2009, pp. 1–4. Sin Chee, L., Chia Ai, O., Hariharan, M., and Yaacob, S., Automatic detection of prolongations and repetitions using LPCC. In Technical Postgraduates (TECHPOS), 2009 International Conference for, 2009, pp. 1–4.
19.
go back to reference Howell, P., Davis, S., and Bartrip, J., The UCLASS archive of stuttered speech. Journal of speech, language, and hearing research: JSLHR, 2009. Howell, P., Davis, S., and Bartrip, J., The UCLASS archive of stuttered speech. Journal of speech, language, and hearing research: JSLHR, 2009.
20.
go back to reference Howell, P., and Huckvale, M., Facilities to assist people to research into stammered speech. Stammering research: an on-line journal published by the British Stammering Association 1:130, 2004. Howell, P., and Huckvale, M., Facilities to assist people to research into stammered speech. Stammering research: an on-line journal published by the British Stammering Association 1:130, 2004.
21.
go back to reference Huang, X., Acero, A., and Hon, H., Spoken language processing: A guide to theory, algorithm, and system development. Prentice Hall PTR Upper Saddle River, USA, 2001. Huang, X., Acero, A., and Hon, H., Spoken language processing: A guide to theory, algorithm, and system development. Prentice Hall PTR Upper Saddle River, USA, 2001.
22.
go back to reference Rabiner, L., and Juang, B., Fundamentals of speech recognition: Prentice hall, 1993. Rabiner, L., and Juang, B., Fundamentals of speech recognition: Prentice hall, 1993.
23.
go back to reference Antoniol, G., Rollo, V. F., and Venturi, G., Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In Proceedings of the 2005 international workshop on Mining software repositories St. Louis, Missouri: ACM, 2005. Antoniol, G., Rollo, V. F., and Venturi, G., Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In Proceedings of the 2005 international workshop on Mining software repositories St. Louis, Missouri: ACM, 2005.
24.
go back to reference Hariharan, M., Sin Chee, L., and Yaacob, S., Analysis of infant cry through weighted linear prediction cepstral coefficients and probabilistic neural network. Journal of Medical Systems, pp. 1–7, 2010. Hariharan, M., Sin Chee, L., and Yaacob, S., Analysis of infant cry through weighted linear prediction cepstral coefficients and probabilistic neural network. Journal of Medical Systems, pp. 1–7, 2010.
25.
go back to reference Ting, H. N., Yunus, J., and Salleh, S., Speaker-independent malay syllable recognition using singular and modular neural networks. Jurnal Teknologi D (35D), pp. 65–76, 2001. Ting, H. N., Yunus, J., and Salleh, S., Speaker-independent malay syllable recognition using singular and modular neural networks. Jurnal Teknologi D (35D), pp. 65–76, 2001.
26.
go back to reference Mahadeva Prasanna, S. R., Gupta, C. S., and Yegnanarayana, B., Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Communication 48:1243–1261, 2006.CrossRef Mahadeva Prasanna, S. R., Gupta, C. S., and Yegnanarayana, B., Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Communication 48:1243–1261, 2006.CrossRef
27.
go back to reference Pallabi, P., and Bhavani, T., Face recognition using multiple classifiers. In Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on, 2006, pp. 179–186. Pallabi, P., and Bhavani, T., Face recognition using multiple classifiers. In Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on, 2006, pp. 179–186.
28.
go back to reference Liu, C.-L., Lee, C.-H., and Lin, P.-M., A fall detection system using k-nearest neighbor classifier. Expert Systems with Applications, vol. In Press, Corrected Proof, 2010. Liu, C.-L., Lee, C.-H., and Lin, P.-M., A fall detection system using k-nearest neighbor classifier. Expert Systems with Applications, vol. In Press, Corrected Proof, 2010.
29.
go back to reference Park, C. H., and Park, H., A comparison of generalized linear discriminant analysis algorithms. Pattern Recognition 41:1083–1097, 2008.MATHCrossRef Park, C. H., and Park, H., A comparison of generalized linear discriminant analysis algorithms. Pattern Recognition 41:1083–1097, 2008.MATHCrossRef
30.
go back to reference Gajsek, R., and Miheliˇc, F., Comparison of speech parameterization techniques for Slovenian language. In 9th International PhD Workshop on Systems and Control: Young Generation Viewpoint Izola, Slovenia, 2008. Gajsek, R., and Miheliˇc, F., Comparison of speech parameterization techniques for Slovenian language. In 9th International PhD Workshop on Systems and Control: Young Generation Viewpoint Izola, Slovenia, 2008.
31.
go back to reference Watts, D. M. G., Speaker identification-prototype development and performance. In Faculty of Engineering & Surveying. vol. Bachelor of Engineering (Computer Systems)/Bachelor of Information Technology (Applied Computer Science): University of Southern Queensland, 2006, p. 116. Watts, D. M. G., Speaker identification-prototype development and performance. In Faculty of Engineering & Surveying. vol. Bachelor of Engineering (Computer Systems)/Bachelor of Information Technology (Applied Computer Science): University of Southern Queensland, 2006, p. 116.
Metadata
Title
Classification of Speech Dysfluencies Using LPC Based Parameterization Techniques
Authors
M. Hariharan
Lim Sin Chee
Ooi Chia Ai
Sazali Yaacob
Publication date
01-06-2012
Publisher
Springer US
Published in
Journal of Medical Systems / Issue 3/2012
Print ISSN: 0148-5598
Electronic ISSN: 1573-689X
DOI
https://doi.org/10.1007/s10916-010-9641-6

Other articles of this Issue 3/2012

Journal of Medical Systems 3/2012 Go to the issue