1. Vaswani, A., Shazeer, N., Parmar, N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, December 2017, 6000-6010. https://dl.acm.org/doi/10.5555/3295222.3295349
2. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
3. Video Swin Transformer
4. Jadon, A., Omama, M., Varshney, A., et al. (2019) Firenet: A Specialized Lightweight Fire & Smoke Detection Model for Real-Time Iot Applications. https://arxiv.org/abs/1905.11922
5. FireNet-v2: Improved Lightweight Fire Detection Model for Real-Time IoT Applications