1. P. Bajaj , D. Campos , N. Craswell , L. Deng , J. Gao , X. Liu , R. Majumder , A. Mc- Namara , B. Mitra , T. Nguyen , M. Rosenberg , X. Song , A. Stoica , S. Tiwary , and T. Wang . MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arXiv:1611.09268v3 , 2018 . P. Bajaj, D. Campos, N. Craswell, L. Deng, J. Gao, X. Liu, R. Majumder, A. Mc- Namara, B. Mitra, T. Nguyen, M. Rosenberg, X. Song, A. Stoica, S. Tiwary, and T. Wang. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arXiv:1611.09268v3, 2018.
2. H. Bast and M. Celikik . Efficient index-based snippet generation. ACM Transactions on Information Systems (TOIS), 32(2):1--24 , 2014 . H. Bast and M. Celikik. Efficient index-based snippet generation. ACM Transactions on Information Systems (TOIS), 32(2):1--24, 2014.
3. InPars: Unsupervised Dataset Generation for Information Retrieval
4. A Full-Text Learning to Rank Dataset for Medical Information Retrieval
5. M. Bueno , C. Gemmel , J. Dalton , R. Lotufo , and R. Nogueira . Induced natural language rationales and interleaved markup tokens enable extrapolation in large language models. arXiv preprint arXiv:2208.11445 , 2022 . M. Bueno, C. Gemmel, J. Dalton, R. Lotufo, and R. Nogueira. Induced natural language rationales and interleaved markup tokens enable extrapolation in large language models. arXiv preprint arXiv:2208.11445, 2022.