skip to main content
article
Free Access

Jitter estimation algorithms for detection of pathological voices

Published:01 January 2009Publication History
Skip Abstract Section

Abstract

This work is focused on the evaluation of different methods to estimate the amount of jitter present in speech signals. The jitter value is a measure of the irregularity of a quasiperiodic signal and is a good indicator of the presence of pathologies in the larynx such as vocal fold nodules or a vocal fold polyp. Given the irregular nature of the speech signal, each jitter estimation algorithm relies on its own model making a direct comparison of the results very difficult. For this reason, the evaluation of the different jitter estimation methods was target on their ability to detect pathological voices. Two databases were used for this evaluation: a subset of the MEEI database and a smaller database acquired in the scope of this work. The results showed that there were significant differences in the performance of the algorithms being evaluated. Surprisingly, in the largest database the best results were not achieved with the commonly used relative jitter, measured as a percentage of the glottal cycle, but with absolute jitter values measured in microseconds. Also, the new proposed measure for jitter, LocJitt, performed in general is equal to or better than the commonly used tools of MDVP and Praat.

References

  1. J. P. Dworkin and R. J. Meleca, Vocal Pathologies: Diagnosis, Treatment & Case Studies, Singular, San Diego, Calif, USA, 1996.Google ScholarGoogle Scholar
  2. J. Kreiman, B. R. Gerratt, G. B. Kempster, A. Erman, and G. S. Berke, "Perceptual evaluation of voice quality: review, tutorial, and a framework for future research," Journal of Speech and Hearing Research, vol. 36, no. 1, pp. 21-40, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  3. "Multi-Dimensional Voice Program, Model 5105".Google ScholarGoogle Scholar
  4. P. Boersma and D. Weenink, "Praat, a system for doing phonetics by computer," Glot International, vol. 5, pp. 341-345, 2001.Google ScholarGoogle Scholar
  5. J. Schoentgen, "Stochastic models of jitter," Journal of the Acoustical Society of America, vol. 109, no. 4, pp. 1631-1650, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  6. O. Amir, M. Wolf, and N. Amir, "A clinical comparison between two acoustic analysis softwares: MDVP and Praat," Biomedical Signal Processing and Control, vol. 4, no. 3, pp. 202-205, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  7. J. I. Godino-Llorente and P. Gomez-Vilda, "Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors," IEEE Transactions on Biomedical Engineering, vol. 51, no. 2, pp. 380-384, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  8. R. J. Moran, R. B. Reilly, P. de Chazal, and P. D. Lacy, "Telephony-based voice pathology assessment using automated speech analysis," IEEE Transactions on Biomedical Engineering, vol. 53, no. 3, pp. 468-477, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  9. P. Gómez-Vilda, R. Fernández-Baillo, V. Rodellar-Biarge, et al., "Glottal source biometrical signature for voice pathology detection," Speech Communication, vol. 50, no. 9, pp. 759-781, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Wong, M. R. Ito, N. B. Cox, and I. R. Titze, "Observation of perturbations in a lumped-element model of the vocal folds with application to some pathological cases," The Journal of the Acoustical Society of America, vol. 89, no. 1, pp. 383-394, 1991.Google ScholarGoogle ScholarCross RefCross Ref
  11. L. Lehto, M. Airas, E. Björkner, J. Sundberg, and P. Alku, "Comparison of two inverse filtering methods in parameterization of the glottal closing phase characteristics in different phonation types," The Journal of Voice, vol. 21, no. 2, pp. 138-150, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  12. B. S. Atal and S. L. Hanauer, "Speech analysis and synthesis by linear prediction of the speech wave," The Journal of the Acoustical Society of America, vol. 50, no. 2B, pp. 637-655, 1971.Google ScholarGoogle ScholarCross RefCross Ref
  13. "Disordered Voice Database and Program, Model 4337," 1994.Google ScholarGoogle Scholar
  14. A. Kounoudes, P. Naylor, and M. Brookes, "The DYPSA algorithm for estimation of glottal closure instants in voiced speech," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'02), vol. 1, pp. 349-352, Orlando, Fla, USA, May 2002.Google ScholarGoogle Scholar
  15. M. Brookes, "VOICEBOX: Speech Processing Toolbox for MATLAB," 2003.Google ScholarGoogle Scholar
  16. M. Vasilakis and Y. Stylianou, "A mathematical model for accurate measurement of jitter," in Proceedings of the 5th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Firenze University Press, Firenze, Italy, December 2007.Google ScholarGoogle Scholar

Index Terms

  1. Jitter estimation algorithms for detection of pathological voices

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image EURASIP Journal on Advances in Signal Processing
            EURASIP Journal on Advances in Signal Processing  Volume 2009, Issue
            Special issue on analysis and signal processing of oesophageal and pathological voices
            January 2009
            128 pages

            Publisher

            Hindawi Limited

            London, United Kingdom

            Publication History

            • Accepted: 30 June 2009
            • Revised: 15 April 2009
            • Published: 1 January 2009
            • Received: 27 November 2008

            Qualifiers

            • article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader