1. Rosana Ardila , Megan Branson , Kelly Davis , Michael Henretty , Michael Kohler , Josh Meyer , Reuben Morais , Lindsay Saunders , Francis M. Tyers , and Gregor Weber . 2019 . Common Voice: A Massively-Multilingual Speech Corpus. CoRR , Vol. abs/ 1912 .06670 (2019). arxiv: 1912.06670 http://arxiv.org/abs/1912.06670 Rosana Ardila, Megan Branson, Kelly Davis, Michael Henretty, Michael Kohler, Josh Meyer, Reuben Morais, Lindsay Saunders, Francis M. Tyers, and Gregor Weber. 2019. Common Voice: A Massively-Multilingual Speech Corpus. CoRR, Vol. abs/1912.06670 (2019). arxiv: 1912.06670 http://arxiv.org/abs/1912.06670
2. Bryant Chen , Wilka Carvalho , Nathalie Baracaldo , Heiko Ludwig , Benjamin Edwards , Taesung Lee , Ian M. Molloy , and Biplav Srivastava . 2018. Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering. CoRR , Vol. abs/ 1811 .03728 ( 2018 ). arxiv: 1811.03728 http://arxiv.org/abs/1811.03728 Bryant Chen, Wilka Carvalho, Nathalie Baracaldo, Heiko Ludwig, Benjamin Edwards, Taesung Lee, Ian M. Molloy, and Biplav Srivastava. 2018. Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering. CoRR, Vol. abs/1811.03728 (2018). arxiv: 1811.03728 http://arxiv.org/abs/1811.03728
3. Douglas Coimbra de Andrade , Sabato Leo , Martin Loesener Da Silva Viana, and Christoph Bernkopf . 2018 . A neural attention model for speech command recognition. arxiv: eess.AS/1808.08929 Douglas Coimbra de Andrade, Sabato Leo, Martin Loesener Da Silva Viana, and Christoph Bernkopf. 2018. A neural attention model for speech command recognition. arxiv: eess.AS/1808.08929
4. STRIP
5. Google. 2022. Speech-to-Text basics. https://cloud.google.com/speech-to-text/docs/basics. [Online Accessed: today]. Google. 2022. Speech-to-Text basics. https://cloud.google.com/speech-to-text/docs/basics. [Online Accessed: today].