1. “Cloze Procedure”: A New Tool for Measuring Readability
2. K. Church, ''A stochastic parts program and noun phrase parser for unrestricted text,'' in Proceedings of the second conference on Applied natural language processing. ACL, 1988.
3. P. Rajpurkar, J. Zhang, K. Lopyrev, and P. Liang, ''SQuAD: 100,000+ questions for machine comprehension of text,'' in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. Austin, Texas: Association for Computational Linguistics, Nov. 2016, pp. 2383--2392. [Online]. Available: https://aclanthology.org/D16--1264
4. A. Wang, A. Singh, J. Michael, F. Hill, O. Levy, and S. Bowman, ''GLUE: A multi-task benchmark and analysis platform for natural language understanding,'' in Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. Brussels, Belgium: Association for Computational Linguistics, Nov. 2018, pp. 353--355. [Online]. Available: https://www.aclweb.org/anthology/W18-5446
5. A. Wang, Y. Pruksachatkun, N. Nangia, A. Singh, J. Michael, F. Hill, O. Levy, and S. Bowman, ''Superglue: A stickier benchmark for general-purpose language understanding systems,'' Advances in neural information processing systems, vol. 32, 2019.