1. Baroni M, Dinu G, Kruszewski G (2014) In: Proceedings of the 52nd annual meeting of the association for computational linguistics, pp 238–247
2. Bengio Y, Ducharme R, Vincent P, Jauvin C (2003) A neural probabilistic language model. J Mach Learn Res 3:1137
3. Cao S, Lu W, Zhou J, Li X (2018) In: Proceedings of the 32nd AAAI conference on artificial intelligence, pp 5053–5061
4. Chen X, Xu L, Liu Z, Sun M, Luan H (2015) In: Proceedings of the 24th international joint conference on artificial intelligence, pp 1236–1242
5. Chen HY, Yu SH, Lin SD (2020) In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 2865–2871