Corrected Tandem Features for Acoustic Model Training

TitleCorrected Tandem Features for Acoustic Model Training
Publication TypeConference Paper
Year of Publication2008
AuthorsFaria, A., & Morgan N.
Conference NameICASSP 2008
Date Published04/2008
Conference LocationLas Vegas, Nevada

This paper describes a simple method for significantly improving Tandem features used to train acoustic models for large-vocabulary speech recognition. The linear activations at the outputs of an MLP classifier were modified according to known reference labels: where necessary, the activation of the output unit corresponding to the correct phone label was increased in order to make an accurate classification. This technique was inspired by another experiment that determined a lower error bound on ASR performance within the Tandem framework. By simulating an idealized classifier with forward-backward phone posterior probabilities, we observed a best-case scenario in which nearly all errors were eliminated. Although this performance is not practically achievable, the experiment demonstrated the validity of the Tandem processing approach and suggested that considerable gains are possible by improving the MLP phone classifier.

Corrected Tandem Features for Acoustic Model Training.pdf118.97 KB