1. Cristian Bucilu?, Rich Caruana , and Alexandru Niculescu-Mizil . 2006 . Model compression . In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. 535--541 . Cristian Bucilu?, Rich Caruana, and Alexandru Niculescu-Mizil. 2006. Model compression. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. 535--541.
2. Wide & Deep Learning for Recommender Systems
3. Corinna Cortes and Mehryar Mohri . 2004. AUC optimization vs. error rate minimization. Advances in neural information processing systems , Vol. 16 , 16 ( 2004 ), 313--320. Corinna Cortes and Mehryar Mohri. 2004. AUC optimization vs. error rate minimization. Advances in neural information processing systems, Vol. 16, 16 (2004), 313--320.
4. Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2018 . Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018). Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
5. MOBIUS