1. Devlin J, Chang M-W, Lee K, Toutanova K. 2018. BERT: pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805 [cs].
2. Hugging Face. 2021. bert-large-uncased-whole-word-masking. Hugging Face [accessed 2021 Mar 22]. https://huggingface.co/bert-large-uncased-whole-word-masking.
3. Lococo KH, Staplin L, Martell CA, Sifrit KJ. 2012. Pedal application errors (No. DOT HS 811 597).
4. null
5. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. 2017. Attention is all you need. Paper presented at the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, p. 11.