1. Renouf, A.: Corpus development 25 years on: from super-corpus to cybercorpus. Lang. Comput. Stud. Pract. Linguist. 62(1), 27–49 (2007)
2. Kennedy, G., Ooi, V.B.Y.: An Introduction to Corpus Linguistics. Studies in Language and Linguistics (1998)
3. Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 6000–6010 (2017)
4. Cohen, K.B., Ogren, P.V., Fox, L., et al.: Corpus design for biomedical natural language processing. In: ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, pp. 38–45. Association for Computational Linguistics (2005)
5. Heydon, A., Najork, M.: Mercator: A scalable, extensible Web crawler. World Wide Web-Internet Web Inf. Syst. 2(4), 219–229 (1999)