1. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin,2018
2. Attention is all you need;vaswani;Advances in Neural IInformation Processing Systems,2017
3. Triple Notch Filter using Non-Uniform Transmission Lines for UWB Applications
4. Roberta: A robustly optimized bert pretraining approach;liu,2019
5. Sequence to sequence learning with neural networks;sutskever;Advances in neural information processing systems,2014