1. Ba, J.L., Kiros, J.R., Hinton, G.E., 2016. Layer normalization. URL: https://arxiv.org/abs/1607.06450, 10.48550/ARXIV.1607.06450.
2. Neural machine translation by jointly learning to align and translate;Bahdanau;CoRR abs/1409.0473,2014
3. Bai, J., Wang, Y., Chen, Y., Yang, Y., Bai, J., Yu, J., Tong, Y., 2021. Syntax-BERT: Improving pre-trained transformers with syntax trees, in: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Association for Computational Linguistics, Online. pp. 3011–3020. URL: https://www.aclweb.org/anthology/2021.eacl-main.262.
4. Balachandran, V., Pagnoni, A., Lee, J.Y., Rajagopal, D., Carbonell, J., Tsvetkov, Y., 2021. StructSum: Summarization via structured representations, in: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, Association for Computational Linguistics, Online. pp. 2575–2585. URL: https://www.aclweb.org/anthology/2021.eacl-main.220.
5. Brants, S., Dipper, S., Hansen, S., Lezius, W., Smith, G., 2002. TIGER treebank, in: Proceedings of the 1st Workshop on Treebanks and Linguistic Theories (TLT), pp. 24–42.