1. Attention is all you need;vaswani;Neural Information Processing Systems,2017
2. Answering complex open-domain questions with multi-hop dense retrieval;xiong;International Conference on Learning Representations,2021
3. Metric Learning: A Survey
4. Long Short-Term Memory
5. Decoupled weight decay regularization;loshchilov;International Conference on Learning Representations,2018