1. Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171--4186. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. 4171--4186.
2. Milan Gritta , Mohammad Taher Pilehvar, and Nigel Collier . 2018 . Which melbourne? augmenting geocoding with maps. ACL. Milan Gritta, Mohammad Taher Pilehvar, and Nigel Collier. 2018. Which melbourne? augmenting geocoding with maps. ACL.
3. Yingjie Hu and Jimin Wang . 2020. How do people describe locations during a natural disaster: an analysis of tweets from Hurricane Harvey. arXiv preprint arXiv:2009.12914 ( 2020 ). Yingjie Hu and Jimin Wang. 2020. How do people describe locations during a natural disaster: an analysis of tweets from Hurricane Harvey. arXiv preprint arXiv:2009.12914 (2020).
4. Bowen Jin , Wentao Zhang , Yu Zhang , Yu Meng , Xinyang Zhang , Qi Zhu , and Jiawei Han . 2023 . Patton: Language Model Pretraining on Text-Rich Networks. ACL. Bowen Jin, Wentao Zhang, Yu Zhang, Yu Meng, Xinyang Zhang, Qi Zhu, and Jiawei Han. 2023. Patton: Language Model Pretraining on Text-Rich Networks. ACL.
5. Bowen Jin , Yu Zhang , Yu Meng , and Jiawei Han . 2023 . Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks. In The Eleventh International Conference on Learning Representations. Bowen Jin, Yu Zhang, Yu Meng, and Jiawei Han. 2023. Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks. In The Eleventh International Conference on Learning Representations.