Cosine Normalization: Using Cosine Similarity Instead of Dot Product in Neural Networks-Reference-Cited by-同舟云学术

Cosine Normalization: Using Cosine Similarity Instead of Dot Product in Neural Networks

Published:2018 Issue: Volume: Page:382-391
ISSN:0302-9743
Container-title:Artificial Neural Networks and Machine Learning – ICANN 2018
language:
Short-container-title:

Author:

Luo Chunjie,Zhan Jianfeng,Xue Xiaohe,Wang Lei,Ren Rui,Yang Qiang

Publisher

Springer International Publishing

Link

http://link.springer.com/content/pdf/10.1007/978-3-030-01418-6_38

Reference15 articles.

1. Hochreiter, S., Bengio, Y., Frasconi, P., Schmidhuber, J.: Gradient flow in recurrent nets: the difficulty of learning long-term dependencies (2001)

2. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 448–456 (2015)

3. Krogh, A., Hertz, J.A.: A simple weight decay can improve generalization. In: NIPS, vol. 4, pp. 950–957 (1991)

4. Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence);N Srebro,2005

5. Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)

Cited by 99 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Acceleration of the complex reacting flow simulation with a generalizable neural network based on meta-learning;Fuel;2024-09

2. Reference-based super-resolution reconstruction of remote sensing images based on a coarse-to-fine feature matching transformer;Engineering Applications of Artificial Intelligence;2024-09

3. LSTM Network-Based Adaptation Approach for Dynamic Integration in Intelligent End-Edge-Cloud Systems;Tsinghua Science and Technology;2024-08

4. Leveraging LLM: Implementing an Advanced AI Chatbot for Healthcare;International Journal of Innovative Science and Research Technology (IJISRT);2024-06-17

5. B-Cos Alignment for Inherently Interpretable CNNs and Vision Transformers;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-06