全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

An Overview of Basics Speech Recognition and Autonomous Approach for Smart Home IOT Low Power Devices

DOI: 10.4236/jsip.2018.94015, PP. 239-257

Keywords: Voice Recognition, Speech Processing, Voice Command, Embedded Device

Full-Text   Cite this paper   Add to My Lib

Abstract:

Automatic speech recognition, often incorrectly called voice recognition, is a computer based software technique that analyzes audio signals captured by a microphone and translates them into machine interpreted text. Speech processing is based on techniques that need local CPU or cloud computing with an Internet link. An activation word starts the uplink; “OK google”, “Alexa”, … and voice analysis is not usually suitable for autonomous limited CPU system (16 bits microcontroller) with low energy. To achieve this realization, this paper presents specific techniques and details an efficiency voice command method compatible with an embedded IOT low-power device.

References

[1]  Kamarudin, M.R., et al. (2013) Low Cost Smart Home Automation via Microsoft Speech Recognition. International Journal of Engineering & Computer Science IJECS, IJECS-IJENS, 13.
[2]  Vacher, M., Istrate, D., Portet, F. Joubert, T. (2011) The Sweet-Home Project: Audio Technologyin Smart Homes to Improve Well-Being and Reliance. 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.
[3]  Georgoulas, C., Raza, A., Güttler, J., Linner, T. and Bock, T. (2014) Proceedings of the International Symposium on Automation and Robotics in Construction, Vilnius, Vol. 31, 1-9.
[4]  Rabiner, L. and Jaung, B. (1993) Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs.
[5]  Becchetti, C. and Ricotti, K.P. (2008) Speech Recognition: Theory and C++ Implementation (With CD). John Wiley & Sons.
[6]  Thiang, D.W. (2009) Limited Speech Recognition for Controlling Movement of Mobile Robot Implemented on ATmega162 Microcontroller. International Conference on Computer and Automation Engineering.
[7]  Amodei, D., Ananthanarayanan, S. and Anubhai, R. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. Proceedings of Machine Learning Research, 48.
[8]  Verma, A., Kumar, A. and Kaur, I. (2018) Automatic Speech Recognition Using Mel-Frequency Cepstrum Coefficient (MFCC) and Vector Quantization (VQ) Techniques for Continuous Speech. International Journal of Advanced and Applied Sciences, 5, 73-78.
[9]  Kumar, S. (2014) Ubiquitous Smart Home System Using Android Application. International Journal of Computer Networks & Communications, 6, 33-43.
[10]  Sonia, B. and Sridhar, D.S. (2018) Implementation of Voice Recognition Technology in Hospitals Using Dragon NaturallySpeaking Software. Biometrics and Bioinformatics, 10.
[11]  Vojtas, P., Stepan, J., Sec, D., Cimler, R. and Krejcar, O. (2018) Voice Recognition Software on Embedded Devices. In: Nguyen, N.T., Hoang, D.H., Hong, T.-P., Pham, H. and Trawiński, B., Eds., Intelligent Information and Database Systems, Vol. 10751, Springer International Publishing, Cham, 642-650.
[12]  Lai, C.-H. and Hwang, Y.-S. (2018) The Voice Controlled Internet of Things System. 1-3.
[13]  Basyal, L., Kaushal, S. and Singh, G. (2018) Voice Recognition Robot with Real Time Surveillance and Automation. International Journal of Creative Research Thoughts (IJCRT), 6, 11-16.
[14]  Newman, M.J. et al. (2015) Embedded System for Construction of Small Footprint Speech Recognition with User-Definable Constraints. Patent US9117449B2, 2015-08-25.
[15]  Perez-Cortes, J.C. and Guardiola, J.L. (2009) Pattern Recognition with Embedded Systems Technology: A Survey. 20th International Workshop on Database and Expert Systems Application.
[16]  Jiang, H., et al. (2014) Convolutional Neural Networks for Speech Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 22, 1533-1545.
[17]  Deng, L. and Li, X. (2013) Machine Learning Paradigms for Speech Recognition: An Overview. IEEE Transactions on Audio, Speech, and Language Processing, 21, 1060-1089.
https://doi.org/10.1109/TASL.2013.2244083
[18]  IEEE (2012) IEEE Systems, Man, and Cybernetics Society. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 42, C2.
[19]  Sinha, P. (2009) CPU Architectures for Speech Processing. In: Speech Processing in Embedded Systems, Springer International Publishing, Boston, MA, 55-74
[20]  Nedevschi, S., Patra, R.K. and Brewer, E.A. (2005) Hardware Speech Recognition for User Interfaces in Low Cost, Low Power Devices. Proceedings of 42nd Design Automation Conference, Anaheim, CA, USA, 13-17 June 2005.
[21]  Schafer, E.C., et al. (2017) Speech Recognition in Noise in Adults and Children Who Speak English or Chinese as Their First Language. Journal of the American Academy of Audiology.
https://doi.org/10.3766/jaaa.17066
[22]  Li, K., Mao, S., Li, X., Wu, Z. and Meng, H. (2018) Automatic Lexical Stress and Pitch Accent Detection for L2 English Speech Using Multi-Distribution Deep Neural Networks. Speech Communication, 96, 28-36.
https://doi.org/10.1016/j.specom.2017.11.003

Full-Text

Contact Us

[email protected]

QQ:3279437679

WhatsApp +8615387084133