DCT-Former: Efficient Self-Attention with Discrete Cosine Transform-Reference-Cited by-同舟云学术

DCT-Former: Efficient Self-Attention with Discrete Cosine Transform

Published:2023-02-07 Issue:3 Volume:94 Page:
ISSN:0885-7474
Container-title:Journal of Scientific Computing
language:en
Short-container-title:J Sci Comput

Author:

Scribano Carmelo^ORCID,Franchini Giorgia,Prato Marco,Bertogna Marko

Publisher

Springer Science and Business Media LLC

Subject

Computational Theory and Mathematics,General Engineering,Theoretical Computer Science,Software,Applied Mathematics,Computational Mathematics,Numerical Analysis

Link

https://link.springer.com/content/pdf/10.1007/s10915-023-02125-5.pdf

Reference60 articles.

1. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)

2. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

3. Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I., et al.: Language models are unsupervised multitask learners. OpenAI blog 1(8), 9 (2019)

4. Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al.: Language models are few-shot learners. Adv. Neural. Inf. Process. Syst. 33, 1877–1901 (2020)

5. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Early Warning System for Scientific Research Integrity based on Abnormal Data Recognition Algorithm;2024 5th International Conference on Image Processing and Capsule Networks (ICIPCN);2024-07-03

2. Frequency-Oriented Transformer for Remote Sensing Image Dehazing;Sensors;2024-06-19

3. GreenNAS: A Green Approach to the Hyperparameters Tuning in Deep Learning;Mathematics;2024-03-14

4. Video Compression Prototype for Autonomous Vehicles;Smart Cities;2024-03-08

5. DCT-SwinGAN: Leveraging DCT and Swin Transformer for Face Synthesis from Sketch and Thermal Domains;Communications in Computer and Information Science;2024