1. Sadia Afroze , Md Rajib Hossain , and Mohammed Moshiul Hoque . 2022. DeepFocus: A visual focus of attention detection framework using deep learning in multi-object scenarios . Journal of King Saud University-Computer and Information Sciences ( 2022 ). Sadia Afroze, Md Rajib Hossain, and Mohammed Moshiul Hoque. 2022. DeepFocus: A visual focus of attention detection framework using deep learning in multi-object scenarios. Journal of King Saud University-Computer and Information Sciences (2022).
2. Karan Ahuja Andy Kong Mayank Goel and Chris Harrison. 2020. Direction-of-Voice (DoV) Estimation for Intuitive Speech Interaction with Smart Devices Ecosystems.. In UIST. 1121–1131. Karan Ahuja Andy Kong Mayank Goel and Chris Harrison. 2020. Direction-of-Voice (DoV) Estimation for Intuitive Speech Interaction with Smart Devices Ecosystems.. In UIST. 1121–1131.
3. Proxemic interaction
4. Jim Barnett . 2017. Introduction to the Multimodal Architecture Specification . Springer International Publishing , Cham , 3–17. https://doi.org/10.1007/978-3-319-42816-1_1 10.1007/978-3-319-42816-1_1 Jim Barnett. 2017. Introduction to the Multimodal Architecture Specification. Springer International Publishing, Cham, 3–17. https://doi.org/10.1007/978-3-319-42816-1_1
5. A study on automatic speech recognition;Benkerzaz Saliha;Journal of Information Technology Review,2019