Sphinx-4 | Guide books

Sphinx-4: a flexible open source framework for speech recognitionMarch 2004

March 2004

2004 Technical Report

Authors:
Willie Walker
Sun Microsystems
,
Paul Lamere
Sun Microsystems
,
Philip Kwok
Sun Microsystems
,
Bhiksha Raj
Mitsubishi Electric Research Labs
,
Rita Singh
Carnegie Mellon University
,
Evandro Gouvea
Carnegie Mellon University
,
Peter Wolf
Mitsubishi Electric Research Labs
,
Joe Woelfel
Mitsubishi Electric Research Labs

Publisher:

Sun Microsystems, Inc.
An Imprint of Prentice Hall PTR 2500 Garcia Avenue Mountain View, CA
United States

Published:01 March 2004

Pages:

PDF eReader

Bibliometrics

Abstract

Sphinx-4 is a flexible, modular and pluggable framework to help foster new innovations in the core research of hidden Markov model (HMM) speech recognition systems. The design of Sphinx-4 is based on patterns that have emerged from the design of past systems as well as new requirements based on areas that researchers currently want to explore. To exercise this framework, and to provide researchers with a "researchready" system, Sphinx-4 also includes several implementations of both simple and state-of-the-art techniques. The framework and the implementations are all freely available via open source.

Cited By

Contributors

Willie Walker
Sun Microsystems
- Publication Years2000 - 2004
- Publication counts3
- Citation count46
- Available for Download3
- Downloads (cumulative)829
- Downloads (12 months)117
- Downloads (6 weeks)13
- Average Downloads per Article276
- Average Citation per Article15
View Full Profile
Paul B Lamere
Spotify USA Inc
- Publication Years2002 - 2023
- Publication counts17
- Citation count235
- Available for Download15
- Downloads (cumulative)9,320
- Downloads (12 months)1,640
- Downloads (6 weeks)187
- Average Downloads per Article621
- Average Citation per Article14
View Full Profile
Philip Kwok
Sun Microsystems
- Publication Years2004 - 2004
- Publication counts1
- Citation count45
- Available for Download1
- Downloads (cumulative)640
- Downloads (12 months)58
- Downloads (6 weeks)9
- Average Downloads per Article640
- Average Citation per Article45
View Full Profile
Bhiksha Raj
Carnegie Mellon University
- Publication Years1996 - 2023
- Publication counts62
- Citation count503
- Available for Download12
- Downloads (cumulative)2,699
- Downloads (12 months)404
- Downloads (6 weeks)50
- Average Downloads per Article225
- Average Citation per Article8
View Full Profile
Rita Singh
Carnegie Mellon University
- Publication Years1999 - 2023
- Publication counts12
- Citation count76
- Available for Download4
- Downloads (cumulative)902
- Downloads (12 months)189
- Downloads (6 weeks)23
- Average Downloads per Article226
- Average Citation per Article6
View Full Profile
Evandro Bacci Gouvêa
Technical University of Darmstadt
- Publication Years1999 - 2015
- Publication counts5
- Citation count47
- Available for Download1
- Downloads (cumulative)640
- Downloads (12 months)58
- Downloads (6 weeks)9
- Average Downloads per Article640
- Average Citation per Article9
View Full Profile
Peter Wolf
University of Montana
- Publication Years2004 - 2011
- Publication counts5
- Citation count61
- Available for Download3
- Downloads (cumulative)1,276
- Downloads (12 months)89
- Downloads (6 weeks)15
- Average Downloads per Article425
- Average Citation per Article12
View Full Profile
Joseph Woelfel
Mitsubishi Electric Research Laboratories
- Publication Years2002 - 2004
- Publication counts3
- Citation count55
- Available for Download2
- Downloads (cumulative)815
- Downloads (12 months)81
- Downloads (6 weeks)13
- Average Downloads per Article408
- Average Citation per Article18
View Full Profile

Recommendations

A Comparison between HTK and SPHINX on Chinese Mandarin
JCAI '09: Proceedings of the 2009 International Joint Conference on Artificial Intelligence

In order to show the performances between different speech recognition engines in Chinese Mandarin recognition, experiments are designed in this paper to compare the recognition performances on Chinese Mandarin between HTK and Sphinx, which are the two ...
Read More
Estimation of the Optimal HMM Parameters for Amazigh Speech Recognition System Using CMU-Sphinx

In this paper, we are looking for the optimal value of number of Hidden Markov Models (HMMs) states, and number of Gaussian mixture density functions for Amazigh automated speech recognition system. This system is based on the open source CMU Sphinx-4, ...
Read More
Recent progress in the SPHINX Speech Recognition system
HLT '89: Proceedings of the workshop on Speech and Natural Language

This paper describes recent improvements in the SPHINX Speech Recognition System. These enhancements include function-phrase modeling, between-word coarticulation modeling, and corrective training. On the DARPA resource management task. SPHINX attained ...
Read More

Comments

Browse Reports

Sections

Cited By

A Comparison between HTK and SPHINX on Chinese Mandarin

Estimation of the Optimal HMM Parameters for Amazigh Speech Recognition System Using CMU-Sphinx

Recent progress in the SPHINX Speech Recognition system

Save to Binder

Sections

Cited By

Save to Binder

Recommendations

A Comparison between HTK and SPHINX on Chinese Mandarin

Estimation of the Optimal HMM Parameters for Amazigh Speech Recognition System Using CMU-Sphinx

Recent progress in the SPHINX Speech Recognition system