1. Yoshua Bengio Réjean Ducharme and Pascal Vincent. 2000. A neural probabilistic language model. Advances in Neural Information Processing Systems 13. Yoshua Bengio Réjean Ducharme and Pascal Vincent. 2000. A neural probabilistic language model. Advances in Neural Information Processing Systems 13.
2. Antoine Bordes Nicolas Usunier Alberto Garc?a-Durán JasonWeston and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In NIPS 2787--2795. Antoine Bordes Nicolas Usunier Alberto Garc?a-Durán JasonWeston and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. In NIPS 2787--2795.
3. Samuel Broscheit . 2019. Investigating entity knowledge in BERT with simple neural end-to-end entity linking . In CoNLL. Association for Computational Linguistics , 677--685. Samuel Broscheit. 2019. Investigating entity knowledge in BERT with simple neural end-to-end entity linking. In CoNLL. Association for Computational Linguistics, 677--685.
4. Kevin Clark , Urvashi Khandelwal , Omer Levy , and Christopher D . Manning . 2019 . What does BERT look at? an analysis of bert's attention. In BlackboxNLP@ACL. Association for Computational Linguistics , 276--286. Kevin Clark, Urvashi Khandelwal, Omer Levy, and Christopher D. Manning. 2019. What does BERT look at? an analysis of bert's attention. In BlackboxNLP@ACL. Association for Computational Linguistics, 276--286.
5. Fahim Dalvi , Abdul Rafae Khan , Firoj Alam, Nadir Durrani, Jia Xu, and Hassan Sajjad. 2022 . Discovering latent concepts learned in BERT. CoRR , abs/2205.07237. Fahim Dalvi, Abdul Rafae Khan, Firoj Alam, Nadir Durrani, Jia Xu, and Hassan Sajjad. 2022. Discovering latent concepts learned in BERT. CoRR, abs/2205.07237.