1. Deep audio-visual speech recognition;Afouras;IEEE Transactions on Pattern Analysis and Machine Intelligence,2018
2. LRS3-TED: A large-scale dataset for visual speech recognition;Afouras,2018
3. ASR is all you need: Cross-modal distillation for lip reading;Afouras,2020
4. Sub-word level lip reading with visual attention;Afouras,2021
5. Clustering Persian viseme using phoneme subspace for developing visual speech application;Aghaahmadi;Multimedia Tools and Applications,2013