Ask a Question

Prefer a chat interface with context about you and your work?

End-to-end visual speech recognition with LSTMS

End-to-end visual speech recognition with LSTMS

Traditional visual speech recognition systems consist of two stages, feature extraction and classification. Recently, several deep learning approaches have been presented which automatically extract features from the mouth images and aim to replace the feature extraction stage. However, research on joint learning of features and classification is very limited. In …