1. Vaswani A , Shazeer N , Parmar N , Uszkoreit J , Jones L , Gomez AN , et al. Attention is all you need. Adv Neural Inf Process Syst. 2017;30.
2. Touvron H , Lavril T , Izacard G , Martinet X , Lachaux M-A , Lacroix T , et al. Llama: Open and efficient foundation language models. ArXiv Prepr ArXiv230213971. 2023;
3. Touvron H , Martin L , Stone K , Albert P , Almahairi A , Babaei Y , et al. Llama 2: Open foundation and fine-tuned chat models. ArXiv Prepr ArXiv230709288. 2023;
4. Bommasani R , Hudson DA , Adeli E , Altman R , Arora S , von Arx S , et al. On the opportunities and risks of foundation models. ArXiv Prepr ArXiv210807258. 2021;
5. Language models are unsupervised multitask learners;OpenAI Blog,2019