experimental results
The above-mentioned speech command recognition algorithms for recognizing Uzbek speech commands were implemented in the EM3288 module. These words are: “talaba, malika, samalyot, lola, bola, ona, masala, mamlakat, olti, oltin, lochin, lobar, kitob, maktab, archa, chelak, gul, yulduz, sayyora, viloyat, tegirmon, dala, vagon, bekobod, angren, yangiyo’l, yunusobod, umid”.
Recognition of Uzbek speech commands in the EM3288 module showed the following accuracy (Fig.4).
Figure 4. Accuracy of speech command recognition using various algorithms.
From the figure above, it can be seen that the spectral values of the speech signal and the MFCC parameters showed 86%, 76% and 94% accuracy, respectively, when obtained by VQ, DTW and HMM algorithms. The signal parameters obtained by the STFT + SVD algorithms proposed by us showed 98% accuracy when recognized by the HMM algorithm.
conclusion and future work
The built-in EM3288 system comes in handy when implementing speech control systems through speech commands in different environments. Because through this device it is possible to combine algorithms and programs for processing complex speech signals under a single operating system. The HMM model has shown high results in recognizing limited speech commands in embedded systems. Recognition of speech commands in systems installed using the proposed STFT+SVD+HMM algorithm showed 98% accuracy.
References
M. Mazo, F. J. Rodriguez, J. L. Lazaro, J. Urena, J. C. Garcia, E. Santiso, P. Revenga, J. J. Garcia, «Wheelchair for Physically Ultrasonic and Infrared Sensor Control» Autonomous Robots, № Vol. 2., pp. pp. 203-224, 1995.
V. V. Ngoc, J. Whittington, J. Devlin, «Real-time Hardware Feature Extraction with Embedded Signal Enhancement for Automatic Speech Recognition» Speech Technologies, Intech, pp. pp. 29-54, 2011.
M. Bahoura, H. Ezzaidi, «Hardware implementation of MFCC feature extraction for respiratory sounds analysis» M. Bahoura, H. Ezzaidi, Hardware implementation of MFCC feature ext8th Workshop on Systems, Signal Processing and their Applications, pp. pp. 226-229, 2013.
Mosleh M., Setayeshi S., Mehdi Lotfinejad M., Mirshekari A., «FPGA implementation of a linear systolic array for speech recognition based on HMM» The 2nd International Conference on Computer and Automation Engineering (ICCAE), № Vol. 3, pp. pp. 75-78., 2010.
Veitch R., Aubert L.-M., Woods R., Fischaber S., «FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition» International Journal of Reconfigurable Computing, 2011.
Stephen J. Melnikoff, Steven F. Quigley, Martin J. Russell, «Implementing a Hidden Markov Model Speech Recognition System in Programmable Logic» 11-th International Conference on Field Programmable Logic and Applications, Lecture Notes in Computer Scienc, № Vol. 2147, pp. pp. 81-90., 2001.
J.J. Rodríguez-Andina, R.D.R. Fagundes, D.B. Junior, «A FPGA-based Viterbi algorithm implementation for speech recognition systems,» A FPGA-based Viterbi algorAcoustics, Speech, and Signal Processing. Proceedings. (ICASSP '01). 2001 IEEE International Conference, т. Vol. 2, pp. pp. 1217-1220, 2001.
G. Tamulevicius, V. Arminas, E. Ivanovas, D. Navakauskas, «Hardware Accelerated FPGA Implementation of Lithuanian Isolated Word Recognition System» Electronics & Electrical Engineering, т. 3, № 99, pp. pp. 57-62., 2010.
Sujuan Ke, Yibin Hou, Zhangqin Huang, Hui Li, «A HMM Speech Recognition System Based on FPGA» Image and Signal Processing, CISP '08, т. vol. 5, pp. pp. 305-309., 2008.
Мусаев М. М., «Современные методы цифровой обработки речевых сигналов.,» Вестник ТУИТ, т. 2, № 42, pp. стр. 2-13, 2017[In Russian].
Мусаев М.М., Хужаяров И.Ш., Очилов М.М., «Машинали ўқитиш алгоритмлари асосида ўзбек тили фонемаларини таниб олиш.,» Информатика ва энергетика муаммолари., т. 6, 2019[In Uzbek].
Алимурадов А.К.,Чураков П.П., «Обзор и классификация методов обработки речевых сигналов в системах распознавания речи» Измерение. Мониторинг. Управление. Контроль, т. №2, № 12, pp. 27-35, 2015[In Russian].
Musaev M., Khujayorov I. and Ochilov M., «The Use of Neural Networks to Improve the Recognition Accuracy of Explosive and Unvoiced Phonemes in Uzbek Language» в Information Communication Technologies Conference (ICTC), Nanjing, China, 2020.
Мусаев М. М., Рахимов М. Ф. «Алгоритмы параллельной обработки речевых сигналов,» Вестник ТУИТ, т. 2, № 46, pp. 2-13 стр., 2018[In Russian].
M.M. Musaev, U.A. Berdanov, K.E. Shukurov, «Hardware and software solution signal compression algorithms based on the Chebyshev polynomial» International Journal of Information and Electronics Engineering, т. Vol. 4, № No. 5, pp. pp. 380-383, 2014.
Мусаев M.М., Кардашев М.С., «Спектральный анализ сигналов на многоядерных процессорах.,» Журнал «Цифровая обработка сигналов», № № 3, pp. стр.82-86., 2014[In Russian].
Do'stlaringiz bilan baham: |