1. Cao, S., Yang, N., Liu, Z.: Online news recommender based on stacked auto-encoder. In: 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS), pp. 721–726. IEEE (2017)
2. Chen, X., Liu, D., Lei, C., Li, R., Zha, Z.J., Xiong, Z.: Bert4sessrec: content-based video relevance prediction with bidirectional encoder representations from transformer. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 2597–2601 (2019)
3. Collins,. J., Sohl-Dickstein, J., Sussillo, D.: Capacity and trainability in recurrent neural networks. In: Proceedings of the 2017 International Conference on Learning Representations (2016)
4. de Souza Pereira Moreira, G., Lee, S.R.J.M., Ak, R., Oldridge, E.: Transformers4rec: bridging the gap between NLP and sequential/session-based recommendation. In: RecSys 21: Fifteen ACM Conference on Recommender Systems, pp. 143–153. Association for Computing Machinery, New York (2021)
5. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186, Minneapolis, Minnesota. Association for Computational Linguistics (2018)