Loading...
Search for: lip-reading
0.009 seconds

    Automated Lip-Reading robotic system based on convolutional neural network and long short-term memory

    , Article 13th International Conference on Social Robotics, ICSR 2021, 10 November 2021 through 13 November 2021 ; Volume 13086 LNAI , 2021 , Pages 73-84 ; 03029743 (ISSN) ; 9783030905248 (ISBN) Gholipour, A ; Taheri, A ; Mohammadzade, H ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2021
    Abstract
    In Iranian Sign Language (ISL), alongside the movement of fingers/arms, the dynamic movement of lips is also essential to perform/recognize a sign completely and correctly. In a follow up of our previous studies in empowering the RASA social robot to interact with individuals with hearing problems via sign language, we have proposed two automated lip-reading systems based on DNN architectures, a CNN-LSTM and a 3D-CNN, on the robotic system to recognize OuluVS2 database words. In the first network, CNN was used to extract static features, and LSTM was used to model temporal dynamics. In the second one, a 3D-CNN network was used to extract appropriate visual and temporal features from the... 

    SFAVD: Sharif farsi audio visual database

    , Article IKT 2013 - 2013 5th Conference on Information and Knowledge Technology, Shiraz, Iran ; 2013 , Pages 417-421 ; 9781467364904 (ISBN) Naraghi, Z ; Jamzad, M ; Sharif University of Technology
    2013
    Abstract
    With increasing use of computers in everyday life, improved communication between machines and human is needed. To make a right communication and understand a humankind face which is made in a graphical environment, implementing the audio and visual projects like lip reading, audio and visual speech recognition and lip making are needed. Lack of a complete audio and visual database for this application in Farsi language made us provide a new complete Farsi database for this project that is called SFAVD. It is a unique audio and visual database which in addition to considering Farsi conceptual and speech structure, it considers influence of speech on lip changes. This database is created for... 

    Designing an Automatic Lip-reading System for Persian Words Using Deep Neural Networks and Implementing it on Rasa Social Robot

    , M.Sc. Thesis Sharif University of Technology Gholipour, Amir (Author) ; Taheri, Alireza (Supervisor) ; Mohammadzadeh, Hoda (Supervisor)
    Abstract
    In Iranian Sign Language (ISL), alongside the movement of fingers, the movement of the lips is also essential for to perform words completely and correctly. The purpose of current study is to provide an automated lip-reading system using deep neural networks and implement it on Rasa social robot; So that the robot can recognize a limited number of specified Persian words. To do this, we propose an automated lip-reading system based on convolutional neural networks and long short-term memories. Convolutional neural networks in extracting features from images and long short-term memories in modeling temporal dynamics have achieved good results. We have also recorded a database in Persian...