Author:
Zakershahrak Mehrdad,Ghodratnama Samira
Publisher
Springer Nature Singapore
Reference12 articles.
1. Rahman, M.W.U., et al.: Quantized transformer language model implementations on edge devices, arXiv preprint arXiv:2310.03971 (2023)
2. Beheshti, A., et al.: ProcessGPT: transforming business process management with generative artificial intelligence, arXiv preprint arXiv:2306.01771 (2023)
3. Reidy, B., Mohammadi, M., Elbtity, M., Smith, H., Ramtin, Z.: Work in progress: real-time transformer inference on edge AI accelerators. In: IEEE 29th Real-Time and Embedded Technology and Applications Symposium (RTAS), pp. 341–344. IEEE (2023)
4. Reidy, B.C., Mohammadi, M., Elbtity, M.E., Zand, R.: Efficient deployment of transformer models on edge TPU accelerators: a real system evaluation. In: Architecture and System Support for Transformer Models (ASSYST@ ISCA 2023) (2023)
5. Nag, S., Datta, G., Kundu, S., Chandrachoodan, N., Beerel, P.A.: ViTA: a vision transformer inference accelerator for edge applications, arXiv preprint arXiv:2302.09108 (2023)