Affiliation:
1. School of Electrical Engneering and Robotics, Queensland University of Technology, Brisbane, Australia
2. School of Engineering, Brown University, Providence, United States
Abstract
The incorporation of physical information in machine learning frameworks is opening and transforming many application domains. Here the learning process is augmented through the induction of fundamental knowledge and governing physical laws. In this work, we explore their utility for computer vision tasks in interpreting and understanding visual data. We present a systematic literature review of more than 250 papers on formulation and approaches to computer vision tasks guided by physical laws. We begin by decomposing the popular computer vision pipeline into a taxonomy of stages and investigate approaches to incorporate governing physical equations in each stage. Existing approaches are analyzed in terms of modeling and formulation of governing physical processes, including modifying input data (observation bias), network architectures (inductive bias), and training losses (learning bias). The taxonomy offers a unified view of the application of the physics-informed capability, highlighting where physics-informed learning has been conducted and where the gaps and opportunities are. Finally, we highlight open problems and challenges to inform future research. While still in its early days, the study of physics-informed computer vision has the promise to develop better computer vision models that can improve physical plausibility, accuracy, data efficiency, and generalization in increasingly realistic applications.
Publisher
Association for Computing Machinery (ACM)
Reference263 articles.
1. Real-world super-resolution of face-images from surveillance cameras;Aakerberg Andreas;IET Image Processing,2022
2. Martin Alnæs, Jan Blechta, Johan Hake, August Johansson, Benjamin Kehlet, Anders Logg, Chris Richardson, Johannes Ring, Marie E Rognes, and Garth N Wells. 2015. The FEniCS project version 1.5. Archive of Numerical Software 3, 100 (2015).
3. Applications of Generative Adversarial Networks (GANs): An Updated Review
4. Physics-Informed Attention Temporal Convolutional Network for EEG-Based Motor Imagery Classification;Altaheri Hamdi;IEEE Transactions on Industrial Informatics,2022
5. A Deep Journey into Super-resolution