1. Evaluation Methodologies in Information Retrieval (Dagstuhl Seminar 13441);Agosti Maristella;Dagstuhl Reports,2014
2. Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study
3. Hussam Alkaissi and Samy I. McFarlane. 2023. Artificial Hallucinations in ChatGPT: Implications in Scientific Writing. Cureus, Vol. 15, 2 (2023), bibinfonumpages4 pages.
4. Negar Arabzadeh, Amin Bigdeli, and Charles L. A. Clarke. 2024. Adapting Standard Retrieval Benchmarks to Evaluate Generated Answers. In Advances in Information Retrieval - 46th European Conference on Information Retrieval, ECIR 2024, Glasgow, UK, March 24--28, 2024, Proceedings, Part II (Lecture Notes in Computer Science, Vol. 14609), Nazli Goharian, Nicola Tonellotto, Yulan He, Aldo Lipani, Graham McDonald, Craig Macdonald, and Iadh Ounis (Eds.). Springer, 399--414.
5. Negar Arabzadeh and Charles L. A. Clarke. 2024. A Comparison of Methods for Evaluating Generative IR. arXiv 2404.04044.