1. BommasaniR HudsonDA AdeliE AltmanR AroraS vonArxS et al.On the opportunities and risks of foundation models.2021. Preprint at arXiv: 2108.07258.
2. ZhaoWX ZhouK LiJ TangT WangX HouY et al.A survey of large language models.2023. Preprint at arXiv: 2303.18223.
3. VaswaniA ShazeerN ParmarN UszkoreitJ JonesL GomezAN et al.Attention is all you need.2017. Preprint at arXiv: 1706.03762.
4. UszkoreitJ.Transformer: a novel neural network architecture for language understanding. Google Research Blog.2017.
5. BahdanauD ChoK BengioY.Neural machine translation by jointly learning to align and translate.2014. Preprint at arXiv: 1409.0473.