skip to main content
10.1145/1899503.1899552acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
short-paper

Acoustic modelling of Sepedi affricates for ASR

Published:11 October 2010Publication History

ABSTRACT

Automatic speech recognition (ASR) systems are increasingly being developed for under-resourced languages, especially for use in multilingual spoken dialogue systems. We investigate different approaches to the acoustic modelling of Sepedi affricates for ASR. We determine that it is possible to model various of these complex consonants as a sequence of much simpler sounds. This approach reduces the Sepedi phoneme inventory from 45 to 32, resulting in simpler dictionary development and transcription processes, as well as more accurate acoustic modelling.

References

  1. E. Barnard, M. Davel, and C. van Heerden. ASR corpus design for resource-scarce languages. In Proc. Interspeech, pages 2847--2850, Brighton, UK, Sept. 2009.Google ScholarGoogle Scholar
  2. N. G. Clements and E. Hume. The handbook of phonological theory, chapter The internal organization of speech sounds, pages 245--306. Blackwell, 1995.Google ScholarGoogle Scholar
  3. M. Davel and O. Martirosian. Pronunciation dictionary development in resource-scarce environments. In Proc. Interspeech, pages 2851--2854, Brighton, UK, Sept. 2009.Google ScholarGoogle Scholar
  4. M. H. Davel and E. Barnard. A unified phoneme set for the south african languages. in prep.Google ScholarGoogle Scholar
  5. P. Lehohla. Census 2001: Census in brief. Statistics South Africa, 2003.Google ScholarGoogle Scholar
  6. T. M. Modiba. Aspects of automatic speech recognition with respect to Northern Sotho. Master's thesis, University of the North, South Africa, 2004.Google ScholarGoogle Scholar
  7. J. Roux, E. Botha, and J. du Preez. Developing a multilingual telephone based information system in african languages. In Proc. LREC, pages 975--980, Athens, Greece, June 2000.Google ScholarGoogle Scholar
  8. C. van Heerden, E. Barnard, and M. Davel. Basic speech recognition for spoken dialogues. In Proc. Interspeech, pages 3003--3006, Brighton, UK, Sept. 2009.Google ScholarGoogle Scholar
  9. D. van Niekerk and E. Barnard. Phonetic alignment for speech synthesis in under-resourced languages. In Proc. Interspeech, pages 880--883, Brighton, UK, Sept. 2009.Google ScholarGoogle Scholar
  10. S. Zerbian. Onset consonants in Tswana: CW-sequences and affricates. 2009.Google ScholarGoogle Scholar

Index Terms

  1. Acoustic modelling of Sepedi affricates for ASR

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SAICSIT '10: Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists
        October 2010
        447 pages
        ISBN:9781605589503
        DOI:10.1145/1899503

        Copyright © 2010 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 October 2010

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • short-paper

        Acceptance Rates

        Overall Acceptance Rate187of439submissions,43%

        Upcoming Conference

        HT '24
        35th ACM Conference on Hypertext and Social Media
        September 10 - 13, 2024
        Poznan , Poland

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader