1. Data Collection in Sociolinguistics
2. Timothy Baldwin , Paul Cook , Marco Lui , Andrew MacKinlay , and Li Wang . 2013 . How noisy social media text, how diffrnt social media sources? . In Proceedings of the Sixth International Joint Conference on Natural Language Processing. 356–364 . Timothy Baldwin, Paul Cook, Marco Lui, Andrew MacKinlay, and Li Wang. 2013. How noisy social media text, how diffrnt social media sources?. In Proceedings of the Sixth International Joint Conference on Natural Language Processing. 356–364.
3. Erkan Başar , Iris Hendrickx , Emiel Krahmer , Gert-Jan de Bruijn , and Tibor Bosse . 2022 . Hints of independence in a pre-scripted world: on controlled usage of open-domain language models for chatbots in highly sensitive domains . In Proceedings of the 14th International Conference on Agents and Artificial Intelligence. 401–407 . Erkan Başar, Iris Hendrickx, Emiel Krahmer, Gert-Jan de Bruijn, and Tibor Bosse. 2022. Hints of independence in a pre-scripted world: on controlled usage of open-domain language models for chatbots in highly sensitive domains. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence. 401–407.
4. A data-centric review of deep transfer learning with applications to text data
5. A Survey on Data Augmentation for Text Classification