Skip to main content
Top
Published in: European Archives of Oto-Rhino-Laryngology 10/2004

01-11-2004 | Laryngology

Objective evaluation of the quality of substitution voices

Authors: Mieke Moerman, Glenn Pieters, Jean-Pierre Martens, Marie-Jeanne Van der Borgt, Phillippe Dejonckere

Published in: European Archives of Oto-Rhino-Laryngology | Issue 10/2004

Login to get access

Abstract

This paper describes our first attempts to develop a method for the objective assessment of quality in substitution voices. The objective analysis deals with acoustic parameters characterising short voice and speech samples like a sequence of isolated vowels, a sequence of VCV and CVCVCV syllables, a short sentence, etc. A database of 113 registrations from 68 patients (53 total laryngectomy patients with tracheo-esophageal speech, 14 total laryngectomy patients with esophageal speech and 5 patients with partial frontolateral laryngectomy) and 6 registrations from healthy control persons was collected. Each registration consisted of seven speech utterances and was subjected to an acoustic analysis as well as to a perceptual evaluation, the latter involving eight parameters like “overall impression”, “tonicity”, etc. Since the goal of our work is to find out the best acoustical measurement for supporting perception and making it precise, it seemed logical to strive for a perceptually based acoustic analysis. We therefore performed the analysis by means of a peripheral auditory model with a built-in fundamental frequency (pitch) extractor. From the frame-level outputs (a frame is 10 ms) of the analyser, global objective parameters, such as (1) the percentage of voiced frames, (2) the average voicing evidence, (3) the voicing length distribution and (4) the fundamental frequency jitter, were computed for the different speech utterances. So as to reduce the parameter variability arising from the nature of the speech utterances (e.g., the presence of pauses in the signal, errors caused by the pitch extractor, etc.), the objective parameters were computed using non-standard averaging schemes involving energy weighting and frame selection. A statistical analysis of the objective parameters confirms that the quality of tracheo-esophageal speech is superior to that of esophageal speech, but inferior to that of normal speech and speech with the preservation of one vocal fold. Correlations between the objective parameters and the perceptual parameters are moderate.
Literature
1.
go back to reference De Bodt M (1997) Een onderzoeksmodel voor stemevaluatie. De relatie tussen subjectieve en objectieve parameters in de beoordeling van de normale en pathologische stemfunctie. Doctoral thesis, Universiteit Antwerpen De Bodt M (1997) Een onderzoeksmodel voor stemevaluatie. De relatie tussen subjectieve en objectieve parameters in de beoordeling van de normale en pathologische stemfunctie. Doctoral thesis, Universiteit Antwerpen
2.
go back to reference Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, Van De Heyning P, Remacle M, Woisard V (2001) A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessments techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS). Eur Arch Otorhinolaryngol 258:77–82PubMed Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G, Van De Heyning P, Remacle M, Woisard V (2001) A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessments techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS). Eur Arch Otorhinolaryngol 258:77–82PubMed
3.
go back to reference Wuyts F, De Bodt M, Molenberghs G, Remacle M, Heylen L, Millet B, Van Lierde K, Raes J, Van Den Heyning P (2000) The Dysphonia Severity Index: an objective measure of vocal quality based on a multiparameter approach. J Speech Lang Hear Res 43:796–809PubMed Wuyts F, De Bodt M, Molenberghs G, Remacle M, Heylen L, Millet B, Van Lierde K, Raes J, Van Den Heyning P (2000) The Dysphonia Severity Index: an objective measure of vocal quality based on a multiparameter approach. J Speech Lang Hear Res 43:796–809PubMed
4.
go back to reference Van As CJ, Hilgers FJM, Verdonck-de Leeuw IM, Koopmans-van Beinum FJ (1998) Acoustical analysis and perceptual evaluation of tracheoesophageal prosthetic voice. J Voice 12:239–248PubMed Van As CJ, Hilgers FJM, Verdonck-de Leeuw IM, Koopmans-van Beinum FJ (1998) Acoustical analysis and perceptual evaluation of tracheoesophageal prosthetic voice. J Voice 12:239–248PubMed
5.
go back to reference Van As CJ. Tracheo-esophageal speech: a multidimensional assessment of voice quality. Doctoral thesis Nieuwegein, Budde-Elinkwijk Van As CJ. Tracheo-esophageal speech: a multidimensional assessment of voice quality. Doctoral thesis Nieuwegein, Budde-Elinkwijk
6.
go back to reference Boersma P, Weeninck D (1996) Praat: a system for doing phonetics by computer. Institute of Phonetic Sciences, University Amsterdam, report 132 Boersma P, Weeninck D (1996) Praat: a system for doing phonetics by computer. Institute of Phonetic Sciences, University Amsterdam, report 132
7.
go back to reference Crevier-Buchman L, Laccourreye O, Papon JF, Monfrais-Pfauwadel MC, Brasnu D (1996) Apports et limites de l’analyse acoustique de la voix et de la parole alaryngée au moyen d’un système informatique. Ann Otolaryngol Chir Cervicofac 113:61–68PubMed Crevier-Buchman L, Laccourreye O, Papon JF, Monfrais-Pfauwadel MC, Brasnu D (1996) Apports et limites de l’analyse acoustique de la voix et de la parole alaryngée au moyen d’un système informatique. Ann Otolaryngol Chir Cervicofac 113:61–68PubMed
8.
go back to reference Van Immerseel LM, Martens JP (1992) Pitch and voiced unvoiced determination with an auditory model. J Acoust Soc Am 91:3511–3526PubMed Van Immerseel LM, Martens JP (1992) Pitch and voiced unvoiced determination with an auditory model. J Acoust Soc Am 91:3511–3526PubMed
9.
go back to reference Buhmann J, Caspers J, van Heven V, Hoekstra H, Martens J.P, Swerts M (2002) Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus. Procs LREC-2002, Las Palmas, pp 779–785 Buhmann J, Caspers J, van Heven V, Hoekstra H, Martens J.P, Swerts M (2002) Annotation of prominent words, prosodic boundaries and segmental lengthening by non-expert transcribers in the Spoken Dutch Corpus. Procs LREC-2002, Las Palmas, pp 779–785
10.
go back to reference Debruyne F, Delaere P, Wouters J, Uwents P (1994) Acoustic analysis of tracheo-esophageal versus esophageal speech. J Laryngol Otol 108:325–328PubMed Debruyne F, Delaere P, Wouters J, Uwents P (1994) Acoustic analysis of tracheo-esophageal versus esophageal speech. J Laryngol Otol 108:325–328PubMed
11.
go back to reference Bertino G, Bellomo A, Miani C, Ferrero F, Staffieri A (1996) Spectrographic differences between tracheo-esophageal and esophageal voice. Folia Phoniatrica et Logop 48:255–261 Bertino G, Bellomo A, Miani C, Ferrero F, Staffieri A (1996) Spectrographic differences between tracheo-esophageal and esophageal voice. Folia Phoniatrica et Logop 48:255–261
12.
go back to reference Robbins J (1984) Acoustic differentiation of laryngeal, esophageal and tracheoesophageal speech. J Speech Hear Res 27:577–585PubMed Robbins J (1984) Acoustic differentiation of laryngeal, esophageal and tracheoesophageal speech. J Speech Hear Res 27:577–585PubMed
13.
go back to reference Kreiman J, Gerrat BR, Kempster GB, Erman A, Berke GS (1993) Perceptual evaluationof voice quality: review, tutorial, and a framework for future research. J Speech Hear Res 36:21–40PubMed Kreiman J, Gerrat BR, Kempster GB, Erman A, Berke GS (1993) Perceptual evaluationof voice quality: review, tutorial, and a framework for future research. J Speech Hear Res 36:21–40PubMed
14.
go back to reference Abe H, Yonekawa H, Ohta F, Imaizumi S (1986) Reproducablity of hoarse voice psychoacoustic evaluation. Jpn J Logoped Phoniat 27:168–177 Abe H, Yonekawa H, Ohta F, Imaizumi S (1986) Reproducablity of hoarse voice psychoacoustic evaluation. Jpn J Logoped Phoniat 27:168–177
Metadata
Title
Objective evaluation of the quality of substitution voices
Authors
Mieke Moerman
Glenn Pieters
Jean-Pierre Martens
Marie-Jeanne Van der Borgt
Phillippe Dejonckere
Publication date
01-11-2004
Publisher
Springer-Verlag
Published in
European Archives of Oto-Rhino-Laryngology / Issue 10/2004
Print ISSN: 0937-4477
Electronic ISSN: 1434-4726
DOI
https://doi.org/10.1007/s00405-003-0681-0

Other articles of this Issue 10/2004

European Archives of Oto-Rhino-Laryngology 10/2004 Go to the issue