1. Git re-basin: Merging models modulo permutation symmetries;Ainsworth,2023
2. Layer normalization;Ba,2016
3. Revisiting model stitching to compare neural representations;Bansal;Advances in Neural Information Processing Systems,2021
4. Network optimization: continuous and discrete methods;Bertsekas,1998
5. Similarity and matching of neural network representations;Csiszárik;Advances in Neural Information Processing Systems,2021