1. Hsu A, Khoo W, Goyal N, Wainstein M. Next-generation digital ecosystem for climate data mining and knowledge discovery: a review of digital data collection technologies. Fron Big Data. 2020;3:29. https://doi.org/10.3389/fdata.2020.00029.
2. Gharagozlou H, Mohammadzadeh J, Bastanfard A, Ghidary SS. Semantic relation extraction: a review of approaches, datasets, and evaluation methods with looking at the methods and datasets in the persian language. ACM Trans Asian Low-Resour Lang Inf Process. 2023. https://doi.org/10.1145/3592601.
3. Kinney R, Anastasiades C, Authur R, Beltagy I, Bragg J, Buraczynski A, Cachola I, Candra S, Chandrasekhar Y, Cohan A, Crawford M, Downey D, Dunkelberger J, Etzioni O, Evans R, Feldman S, Gorney J, Graham D, Hu F, Huff R, King D, Kohlmeier S, Kuehl B, Langan M, Lin D, Liu H, Lo K, Lochner J, MacMillan K, Murray T, Newell C, Rao S, Rohatgi S, Sayre P, Shen Z, Singh A, Soldaini L, Subramanian S, Tanaka A, Wade AD, Wagner L, Wang LL, Wilhelm C, Wu C, Yang J, Zamarron A, Zuylen MV, Weld DS. The Semantic Scholar Open Data Platform. 2023; https://arxiv.org/abs/2301.10140.
4. Lo K, Wang LL, Neumann M, Kinney R, Weld D. S2ORC: The semantic scholar open research corpus. In: Jurafsky D, Chai J, Schluter N, Tetreault J, editors. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 4969–4983. Association for Computational Linguistics, Online 2020. https://doi.org/10.18653/v1/2020.acl-main.447. https://aclanthology.org/2020.acl-main.447.
5. Saier T, Krause J, Färber M. unarxive 2022: All arxiv publications pre-processed for nlp, including structured full-text and citation network. In: 2023 ACM/IEEE joint conference on digital libraries (JCDL), 2023. pp. 66–70. https://doi.org/10.1109/JCDL57899.2023.00020.