1. Aksënova, Alëna, Zhehuai Chen, Chung-Cheng Chiu, Daan van Esch, Pavel Golik, Wei Han, Bhuvana Ramabhadran, Levi King, Andrew Rosenberg, Susan Schwartz & Gary Wang. 2022. Accented speech recognition: Benchmarking, pre-training, and diverse data. arXiv preprint arXiv:2205.08014.
2. Aksënova, Alëna, Daan van Esch, James Flynn & Pavel Golik. 2021. How might we create better benchmarks for speech recognition? In Proceedings of the 1st workshop on benchmarking: Past, present and future, 22–34.
3. Ardila, Rosana, Megan Branson, Kelly Davis, Michael Kohler, Josh Meyer, Michael Henretty, Reuben Morais, Lindsay Saunders, Francis Tyers & Gregor Weber. 2019. Common voice: A massively-multilingual speech corpus. Proceedings of the twelfth language resources and evaluation conference, 4218–4222. European Language Resources Association.
4. Augustyniak, Łukasz, Kamil Tagowski, Albert Sawczyn, Denis Janiak, Roman Bartusiak, Adrian Szymczak, Marcin Wątroba, Arkadiusz Janz, Piotr Szymański, Mikołaj Morzy, Tomasz Kajdanowicz & Maciej Piasecki. 2022. This is the way: Designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish. Arxiv:2211.13112.
5. Bender, Emily M. & Batya Friedman. 2018. Data statements for natural language processing: Toward mitigating system bias and enabling better science. Transactions of the Association for Computational Linguistics 6. 587–604. https://doi.org/10.1162/tacl_a_00041.