1. Rich Caruana . 1997. Multitask learning. Machine learning 28, 1 ( 1997 ), 41–75. Rich Caruana. 1997. Multitask learning. Machine learning 28, 1 (1997), 41–75.
2. Zhao Chen , Vijay Badrinarayanan , Chen-Yu Lee , and Andrew Rabinovich . 2018 . Gradnorm: Gradient normalization for adaptive loss balancing in deep multitask networks . In International conference on machine learning. PMLR, 794–803 . Zhao Chen, Vijay Badrinarayanan, Chen-Yu Lee, and Andrew Rabinovich. 2018. Gradnorm: Gradient normalization for adaptive loss balancing in deep multitask networks. In International conference on machine learning. PMLR, 794–803.
3. Michael Crawshaw . 2020. Multi-task learning with deep neural networks: A survey. arXiv preprint arXiv:2009.09796 ( 2020 ). Michael Crawshaw. 2020. Multi-task learning with deep neural networks: A survey. arXiv preprint arXiv:2009.09796 (2020).
4. Adaptive Domain Interest Network for Multi-domain Recommendation
5. Diederik P Kingma and Jimmy Ba . 2014 . Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014). Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).