1. Adam: a method for stochastic optimization,2015
2. cw2vec: learning Chinese word embeddings with stroke n-gram information,2018
3. BERT: pre-training of deep bidirectional transformers for language understanding,2019
4. Learning deep structured semantic models for web search using clickthrough data,2018
5. Bayesian network based failure diagnosis method for on-board equipment of train control system;Journal of the China Railway Society,2017