1. Artificial hallucinations in chatgpt: implications in scientific writing;Alkaissi;Cureus,2023
2. Fine-tuning language models to find agreement among humans with diverse preferences;Bakker;Adv. Neural Inform. Proc. Syst,2022
3. “A systematic review of reproducibility research in natural language processing,”;Belz,2021
4. Chatgpt's one-year anniversary: are open-source large language models catching up?;Chen;arXiv,2023
5. A survey on evaluation of large language models;Chang;arXiv,2023