Recurrent Neural Networks for Decoding Lip Read Speech
Conference paper
Fenghour, S, Chen, D and Xiao, P (2019). Recurrent Neural Networks for Decoding Lip Read Speech. 2019 8th International Conference on Software and Information Engineering (ICSIE 2019). Cairo 09 - 12 Apr 2019
Authors | Fenghour, S, Chen, D and Xiao, P |
---|---|
Type | Conference paper |
Abstract | The success of automated lip reading has been constrained by the inability to distinguish between homopheme words, which are words have different characters and produce the same lip movements (e.g. ”time” and ”some”), despite being intrinsically different. One word can often have different phonemes (units of sound) producing exactly the viseme or visual equivalent of phoneme for a unit of sound. Through the use of a Long-Short Term Memory Network with word embeddings, we can distinguish between homopheme words or words that produce identical lip movements. The neural network architecture achieved a character accuracy rate of 77.1% and a word accuracy rate of 72.2%. |
Year | 2019 |
Accepted author manuscript | License CC BY 4.0 |
Publication dates | |
09 Apr 2019 | |
Publication process dates | |
Deposited | 20 Mar 2019 |
Accepted | 18 Mar 2019 |
Permalink -
https://openresearch.lsbu.ac.uk/item/866z8
42
total views30
total downloads4
views this month0
downloads this month
Related outputs
Distributed deep networks based on Bagging-Down SGD algorithm
Qin, C, Gao, X and Chen, D (2019). Distributed deep networks based on Bagging-Down SGD algorithm. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics. 41 (5), pp. 1021-1027.Design of a voice control 6DoF grasping robotic arm based on ultrasonic sensor, computer vision and Alexa voice assistance
Wang, Z, Chen, D and Xiao, P (2019). Design of a voice control 6DoF grasping robotic arm based on ultrasonic sensor, computer vision and Alexa voice assistance. International Conference on Information Technology in Medicine and Education. Qingdao, China 23 - 25 Aug 2019Towards automated cost analysis, benchmarking and estimating in construction: a machine learning approach
Chen, D, Hajderanj, L and Fiske, J (2019). Towards automated cost analysis, benchmarking and estimating in construction: a machine learning approach. 13th Multi Conference on Computer Science and Information Systems (MCCSIS). Porto, Portugal 16 - 18 Jul 2019In-Vivo Skin Capacitive Image Classification Using AlexNet Convolution Neural Network
Zhang, X, Pan, W and Xiao, P (2018). In-Vivo Skin Capacitive Image Classification Using AlexNet Convolution Neural Network. 2018 3rd International Conference on Image, Vision and Computing (ICIVC 2018). Chongqing, China 27 - 29 Jun 2018 pp. 439-443 doi:10.1109/ICIVC.2018.8492860
Educational Network Bandwidth Analysis and Prediction
Oumar, O A, Dyllon, S and Xiao, P (2018). Educational Network Bandwidth Analysis and Prediction. the 14th International Conference on Machine Learning and Data Mining (MLDM'2018) July 14-19, 2018. New York, USA 14 - 19 Jul 2018Educational Bandwidth Traffic Prediction using Non-Linear Autoregressive Neural Networks
Oumar, O A, Dyllon, S, Xiao, P and Hong, T (2018). Educational Bandwidth Traffic Prediction using Non-Linear Autoregressive Neural Networks. The 21st International Conference on Climbing and Walking Robots and the Support Technologies for Mobile Machines - CLAWAR 2018. Panama 10 - 12 Sep 2018