1. Agarwal, R. and Boggess, L. (1992). “A Simple but Useful Approach to Conjunct Identification.” In Proceedings of the 30th Annual Meeting of the Association for Computational Linguistics, pp. 15–21. Association for Computational Linguistics.
2. Andor, D., Alberti, C., Weiss, D., Severyn, A., Presta, A., Ganchev, K., Petrov, S., and Collins, M. (2016). “Globally Normalized Transition-Based Neural Networks.” In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2442–2452. Association for Computational Linguistics.
3. de Marneffe, M.-C., MacCartney, B., and Manning, C. D. (2006). “Generating Typed Dependency Parses from Phrase Structure Parses.” In Proceedings of the 5th International Conference on Language Resources and Evaluation, pp. 449–454. European Language Resources Association (ELRA).
4. Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019). “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.” In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics.
5. Dozat, T. and Manning, C. D. (2017). “Deep Biaffine Attention for Neural Dependency Parsing.” In Proceedings of the 5th International Conference on Learning Representations.