short-paper

Acoustic modelling of Sepedi affricates for ASR

Authors:
Thipe Modipa

University of Pretoria

University of Pretoria
View Profile

,
Marelie Davel

Meraka Institute, CSIR

Meraka Institute, CSIR
View Profile

,
Febe de Wet

Meraka Institute, CSIR

Meraka Institute, CSIR
View Profile

SAICSIT '10: Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information TechnologistsOctober 2010Pages 394–398https://doi.org/10.1145/1899503.1899552

Published:11 October 2010Publication History

SAICSIT '10: Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists

Pages 394–398

ABSTRACT

Automatic speech recognition (ASR) systems are increasingly being developed for under-resourced languages, especially for use in multilingual spoken dialogue systems. We investigate different approaches to the acoustic modelling of Sepedi affricates for ASR. We determine that it is possible to model various of these complex consonants as a sequence of much simpler sounds. This approach reduces the Sepedi phoneme inventory from 45 to 32, resulting in simpler dictionary development and transcription processes, as well as more accurate acoustic modelling.

References

E. Barnard, M. Davel, and C. van Heerden. ASR corpus design for resource-scarce languages. In Proc. Interspeech, pages 2847--2850, Brighton, UK, Sept. 2009.Google Scholar
N. G. Clements and E. Hume. The handbook of phonological theory, chapter The internal organization of speech sounds, pages 245--306. Blackwell, 1995.Google Scholar
M. Davel and O. Martirosian. Pronunciation dictionary development in resource-scarce environments. In Proc. Interspeech, pages 2851--2854, Brighton, UK, Sept. 2009.Google Scholar
M. H. Davel and E. Barnard. A unified phoneme set for the south african languages. in prep.Google Scholar
P. Lehohla. Census 2001: Census in brief. Statistics South Africa, 2003.Google Scholar
T. M. Modiba. Aspects of automatic speech recognition with respect to Northern Sotho. Master's thesis, University of the North, South Africa, 2004.Google Scholar
J. Roux, E. Botha, and J. du Preez. Developing a multilingual telephone based information system in african languages. In Proc. LREC, pages 975--980, Athens, Greece, June 2000.Google Scholar
C. van Heerden, E. Barnard, and M. Davel. Basic speech recognition for spoken dialogues. In Proc. Interspeech, pages 3003--3006, Brighton, UK, Sept. 2009.Google Scholar
D. van Niekerk and E. Barnard. Phonetic alignment for speech synthesis in under-resourced languages. In Proc. Interspeech, pages 880--883, Brighton, UK, Sept. 2009.Google Scholar
S. Zerbian. Onset consonants in Tswana: CW-sequences and affricates. 2009.Google Scholar

Index Terms

Acoustic modelling of Sepedi affricates for ASR
1. Applied computing
  1. Arts and humanities
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Speech recognition

Recommendations

Improving Acoustic Models with Captioned Multimedia Speech
ICMCS '99: Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2

Speech recognition can be used to create searchable transcripts for audio indexing in digital video libraries. Large amounts of hand-transcribed speech training data are required to build or improve acoustic models of highly accurate speech recognition ...
Read More
Consonant gemination in Italian: The affricate and fricative case
Highlights
- Consonant duration is the primary acoustic cue of gemination in intervocalic italian fricatives.
Abstract
Consonant gemination in Italian affricates and fricatives was investigated, completing the overall study of gemination of Italian consonants. Results of the analysis of other consonant categories, i.e. stops, nasals, and liquids, ...
Read More
European Portuguese Accent in Acoustic Models for Non-native English Speakers
Progress in Pattern Recognition, Image Analysis and Applications
Abstract
The development of automatic speech recognition systems poses several known difficulties. One of them concerns the recognizer’s accuracy when dealing with non-native speakers of a given language. Normally a recognizer precision is lower for non-...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SAICSIT '10: Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists
October 2010
447 pages
ISBN:9781605589503
DOI:10.1145/1899503
Conference Chair:
Paula Kotzé
CSIR Meraka Institute, Pretoria, South Africa
,
Program Chairs:
Alta van der Merwe
CSIR Meraka Institute, Pretoria, South Africa
,
Aurona Gerber
CSIR Meraka Institute, Pretoria, South Africa
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 October 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Sepedi
acoustic models
affricates
phonemes
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate187of439submissions,43%
Upcoming Conference
HT '24

Sponsor:

sigweb

35th ACM Conference on Hypertext and Social Media

September 10 - 13, 2024

Poznan , Poland
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 101
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Acoustic modelling of Sepedi affricates for ASR

SAICSIT '10: Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving Acoustic Models with Captioned Multimedia Speech

Consonant gemination in Italian: The affricate and fricative case

European Portuguese Accent in Acoustic Models for Non-native English Speakers

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media