Error-responsive feedback mechanisms for speech recognizers

January 1997

Author:
Lin Lawrance Chase

Publisher:

Carnegie Mellon University
Schenley Park Pittsburgh, PA
United States

ISBN:978-0-591-91865-6

Order Number:AAI9838206

Pages:

287

Purchase on ProQuest

Bibliometrics

Abstract

This thesis is about modeling, analyzing, and predicting errorful behavior in large vocabulary continuous speech recognition systems. Because today's state-of-the-art recognizers are not designed to be situated naturally in an error feedback loop, they are ill-positioned for inclusion in multi-modal interfaces, multi-media databases, and other interesting applications. I make improvements to the current approach to predicting and analyzing error behaviors, which is currently based only on the measurement of word error rate.

The speech recognizer's functionality is extended to include confidence annotations, which are "meta-level" markings that indicate how certain the recognizer is that it has decoded its input correctly. This is accomplished by feeding externally defined error conditions back to the recoginizer. Error feedback enables the construction of statistical models that map measurements of the recognizer's internal states and behaviors to externally defined error conditions.

The measuring and modeling techniques used for confidence annotation are extended to create a blame assignment system for utterances whose actual transcripts are known. Errors are classified into a set of categories, some of which are directly useful in automatic adaptation schemes while others are more suited for human interpretation.

This classification approach is enhanced when used in conjunction with a visual error analysis tool that was developed during the thesis project.

Cited By

Contributors

Lin Lawrance Chase
Carnegie Mellon University
- Publication Years1997 - 1997
- Publication counts1
- Citation count10
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article10
View Full Profile

Recommendations

Artificial neural networks as speech recognisers for dysarthric speech: Identifying the best-performing set of MFCC parameters and studying a speaker-independent approach

Dysarthria is a neurological impairment of controlling the motor speech articulators that compromises the speech signal. Automatic Speech Recognition (ASR) can be very helpful for speakers with dysarthria because the disabled persons are often ...
Read More
Improving the fine phonetic performance of automatic speech recognizers
Read More
Can continuous speech recognizers handle isolated speech?
Read More

Comments

Browse Theses

Sections

Cited By

Artificial neural networks as speech recognisers for dysarthric speech: Identifying the best-performing set of MFCC parameters and studying a speaker-independent approach

Improving the fine phonetic performance of automatic speech recognizers

Can continuous speech recognizers handle isolated speech?

Sections

Cited By

Save to Binder

Recommendations

Artificial neural networks as speech recognisers for dysarthric speech: Identifying the best-performing set of MFCC parameters and studying a speaker-independent approach

Improving the fine phonetic performance of automatic speech recognizers

Can continuous speech recognizers handle isolated speech?