1. Adafre S, de Rijke M (2006) Finding Similar Sentences across Multiple Languages in Wikipedia, In: Proceedings of the 11th conference of the European chapter of the association for computational linguistics (EACL), pp 62–69
2. Aker A, Kanoulas E, Gaizauskas R (2012) A light way to collect comparable corpora from the Web, In: Calzolari N, Choukri K, Declerck T, Dogan M, Maegaard B, Mariani J, Odijk J and Piperidis S (eds) Proceedings of the eighth international conference on language resources and evaluation (LREC), European Language Resources Association (ELRA), Istanbul, Turkey, pp 15–20
3. Artetxe M, Schwenk H (2019) Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond. Trans Assoc Comput Linguist (TACL) 7:597–610
4. Aspert N, Miz V, Ricaud B, Vandergheynst P (2019) A Graph-Structured Dataset for Wikipedia Research, In: Companion Proceedings of The 2019 World Wide Web conference (WWW), Association for Computing Machinery (ACM), New York, NY, USA, pp 1188–1193
5. Barrón-Cedeño A, España-Bonet C, Boldoba J, Màrquez L (2015) A Factory of Comparable Corpora from Wikipedia, In: Proceedings of the 8th Workshop on Building and Using Comparable Corpora (BUCC), Beijing, China, pp 3–13. http://www.aclweb.org/anthology/W15-3402