Affiliation:
1. Department of Computer Science and Engineering, Indian Institute of Information Technology Kottayam, Kottayam, Kerala, India, Kottayam, India
2. Department of Computer Science and Engineering, Indian Institute of Information Technology Kottayam, Kottayam, India
3. Computer Science and Engineering, Pondicherry University, Pondicherry, India
Abstract
From the innovation, Artificial Intelligence (AI) materialized as one of the noticeable research areas in various technologies and has almost expanded into every aspect of modern human life. However, nowadays, the development of AI is unpredictable with the stated values of those developing them; hence, the risk of misbehaving AI increases continuously. Therefore, there are uncertainties about indorsing that the development and deploying AI are favorable and not unfavorable to humankind. In addition, AI holds a black-box pattern, which results in a lack of understanding of how systems can work based on the raised concerns. From the above discussion, trustworthy AI is vital for the extensive adoption of AI in many applications, with strong attention to humankind and the need to focus on AI systems developing into the system outline at the time of system design. In this survey, we discuss compound materials on trustworthy AI and present state-of-the-art of trustworthy AI technologies, revealing new perspectives, bridging knowledge gaps, and paving the way for potential advances of robustness, and explainability rules which play a proactive role in designing AI systems. Systems that are reliable and secure and mimic human behaviour significantly impact the technological AI ecosystem. We provided various contemporary technologies to build explainability and robustness for AI-based solutions, so AI works safer and more trustworthy. Finally, we conclude our survey paper with high-end opportunities, challenges, and future research directions for trustworthy AI to investigate in the future.
Publisher
Association for Computing Machinery (ACM)
Reference163 articles.
1. Rishabh Agarwal, Levi Melnick, Nicholas Frosst, Xuezhou Zhang, Ben Lengerich, Rich Caruana, and Geoffrey E. Hinton. 2021. Neural Additive Models: Interpretable Machine Learning with Neural Nets. Adv. Neural Inf. Process. Syst. 6, NeurIPS (2021), 4699–4711.
2. Explainable artificial intelligence: an analytical review
3. Dzmitry Bahdanau, Kyung Hyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. 3rd Int. Conf. Learn. Represent. ICLR 2015 - Conf. Track Proc. 1–15.
4. Network Dissection: Quantifying Interpretability of Deep Visual Representations
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献