1. Achanta, S., Antony, A., Golipour, L., Li, J., Raitio, T., Rasipuram, R., Rossi, F., Shi, J., Upadhyay, J., Winarsky, D., & Zhang, H. (2021). On-device neural speech synthesis. In ASRU (pp 1155–1161). IEEE.
2. Bengio, Y., Ducharme, R., & Vincent, P. (2000). A neural probabilistic language model. NeurIPS.
3. Brown, T., Mann, B., Ryder, N., et al., ... Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
4. Christiano, P. F., Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. (2017). Deep reinforcement learning from human preferences. Advances in Neural Information Processing Systems, 30
5. Devlin, J., Chang, M. W., Lee, K., et al. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding (pp. 4171–4186). NAACL.