1. Ashish Vaswani , Noam Shazeer , Niki Parmar , Jakob Uszkoreit , Llion Jones , Aidan N Gomez , Lukasz Kaiser , and Illia Polosukhin . Attention is all you need. Advances in neural information processing systems, 30, 2017.
2. Bertology meets biology: interpreting attention in protein language models;arXiv preprint,2020
3. Learning the protein language: Evolution, structure, and function;Cell systems,2021
4. Learning meaningful representations of protein sequences;Nature communications,2022
5. Lucrezia Valeriani , Diego Doimo , Francesca Cuturello , Alessandro Laio , Alessio Ansuini , and Alberto Cazzaniga . The geometry of hidden representations of large transformer models. Advances in Neural Information Processing Systems, 36, 2024.