1. Attention is all you need;A Vaswani;Advances in neural information processing systems,2017
2. Generalized autoregressive pretraining for language understanding;Z Yang;Advances in neural information processing systems,2019
3. Large language models in medicine;A J Thirunavukarasu;Nature medicine,2023
4. Pal: Program-aided language models;L Gao;International Conference on Machine Learning,2023