Publisher
Springer Nature Switzerland
Reference39 articles.
1. Ainsworth, S.K., Hayase, J., Srinivasa, S.: Git re-basin: merging models modulo permutation symmetries (2023). https://doi.org/10.48550/arXiv.2209.04836
2. Allen-Zhu, Z., Li, Y.: Towards understanding ensemble, knowledge distillation and self-distillation in deep learning. CoRR abs/2012.09816 (2020). https://arxiv.org/abs/2012.09816
3. Anwar, S., Hwang, K., Sung, W.: Structured pruning of deep convolutional neural networks. J. Emerg. Technol. Comput. Syst. 13(3) (2017). https://doi.org/10.1145/3005348
4. Bhagat Smith, J., Gashler, M.: Investigation of how neural networks learn from the experiences of peers through periodic weight averaging. In: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA) (2017). https://doi.org/10.1109/ICMLA.2017.00-72
5. Chowdhery, A., et al.: Palm: scaling language modeling with pathways (2022). https://doi.org/10.48550/arXiv.2204.02311