1. The wikipedia xml corpus;Denoyer,2006
2. Bert: Pre-training of deep bidirectional transformers for language understanding;Devlin,2018
3. The {Ever-Changing} labyrinth: A {Large-Scale} analysis of wildcard {DNS} powered blackhat {SEO};Du,2016
4. Identifying products in online cybercrime marketplaces: A dataset for fine-grained domain adaptation;Durrett,2017
5. Analyzing and identifying data breaches in underground forums;Fang;IEEE Access,2019