1. Contextual deep learning-based audio-visual switching for speech enhancement in real-world environments;Adeel;Information Fusion,2020
2. Deep audio-visual speech recognition;Afouras;IEEE Transactions on Pattern Analysis and Machine Intelligence,2018
3. LRS3-TED: A large-scale dataset for visual speech recognition;Afouras,2018
4. Voice interfaced vehicle user help;Alvarez,2010
5. MuAViC: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation;Anwar,2023