This paper is a study on the composition of Real-Time Continuous Speech Recognition System for Man-Machine Interface and it examines the posibility that applies to automatic system.
HMM model can be classified into Continuous Distribution HMM and Dis...
This paper is a study on the composition of Real-Time Continuous Speech Recognition System for Man-Machine Interface and it examines the posibility that applies to automatic system.
HMM model can be classified into Continuous Distribution HMM and Discrete Duration Control HMM, and the recognition algorithm can be classified into O(n)DP method and One Pass DP method in order to choose HMM model and recognition algorithm.
The simulation is implemented for 35 continuous speech samples of four connected spoken digits in two cases which are divided into two submodels according to whether the regression coefficients are included or not. As a result of the simulation, the average recognition rates show 93.0% and 80.5% respectively for two cases; the one is Continuous Distribution HMM model which includes regression coefficients and the other does not include when O(n)DP method is used.
Average recognition rates show 93.4% and 84.4% respectively for two cases the one is Discrete Duration Control HMM model which includes regression coefficients and the other does not include when O(n)DP method is used.
When HMM model does not include regression coefficients, the average recognition rate of One Pass DP method is better improved than that of O(n)DP method by 12%.
The Continuous Speech Recognition System is composed of Continuous Distribution HMM model and algorithm of One Pass DP method which are chosen by the consideration of computing time and recognition rate according to the result of simulation.
Continuous Speech Recognition System is composed so that it may detect start point and end point of speech data which are converted into samples by 10 KHz, 8 bit A/D within real time, then so that it may recognize them by One Pass DP method, display the result of recognition on PC monitor and at same time send control data to Interface.
HMM models are created by training for continuous speech samples which are control words, area names and digital sounds.
In the result of experiment by Continuous Speech Recognition System, there are some kind of errors which are insertion, replacement and deletion of one syllable, but it examined the posibility that can be applied to Man-Machine Interface on automatic system if post-process is performed for recognition.