1. Classification of visemes using visual cues;Alothmany,2010
2. Deep speech 2: End-to-end speech recog- nition in English and mandarin;Amodei,2016
3. Assael, Yannis, Shillingford, Brendan, Whiteson, Shimon, & Freitas, Nando. (2016). LipNet: Sentence-level lip-reading.
4. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition;Chan,2016
5. Collobert, Ronan, Puhrsch, Christian, & Synnaeve, Gabriel. (2016). Wav2Letter: An end-to-end ConvNet-based speech recognition system.