Affiliation:
1. Al Farabi Kazakh National University; Institute of Information and Computing Technologies
2. Satbayev University
3. Al Farabi Kazakh National University
Abstract
Currently, the theory and methods of machine learning (ML) are rapidly developing and are increasingly used in various fields of science and technology, in particular in manufacturing, education and medicine. Weakly supervised learning is a subset of machine learning research that aims to develop models and methods for analyzing various types of information. When formulating a weakly supervised learning problem, it is assumed that some objects in the model are not defined correctly. This inaccuracy can be understood in different ways. Weakly supervised learning is a type of machine learning method in which a model is trained using incomplete, inaccurate, or imprecise observation signals rather than using fully validated data. Weakly supervised learning often occurs in real-world problems for various reasons. This may be due to the high cost of the data labeling process, low sensor accuracy, lack of expert experience, or human error. For example, labeling of poor control is carried out in cases obtained by crowdsourcing methods: for each object there is a set of different assessments, the quality of which depends on the skill of the performers. Another example is the problem of object detection in an image. Boundary lines are a common way to indicate the location and size of objects detected in an image in object detection tasks. The article presents an algorithm for solving a multi-objective weakly supervised regression problem using the Wasserstein metric, various regularizations and a co-association matrix as a similarity matrix. The work also improved the algorithm for calculating the weighted average co-association matrix. We compare the proposed algorithm with existing supervised learning and unsupervised learning algorithms on synthetic and real data.
Publisher
Kazakh-British Technical University
Reference20 articles.
1. Qin Q., Zhou X., Jiang Y. (2021) Prognosis Prediction of Stroke based on Machine Learning and Explanation Model, International Journal of Computers, Communications and Control, vol. 6, pp. 1–13.
2. Merembayev T., Amirgaliyeva S. & Kozhaly K. (2021) Using item response theory in machine learning algorithms for student response data. In 2021 IEEE International Conference on Smart Information Systems and Technologies (SIST), pp. 1–5. IEEE
3. Shahana I., Tri N., Xiao F.][ Fu D.Y., Chen M.F., Zhang M., Fatahalian K., Ré C. The Details Matter: Preventing Class Collapse in Supervised Contrastive Learning. Computer Science Math. Forum 2022.
4. Xiao Y., Yin Z., Liu B. (2020) A similarity-based two-view multiple instance learning method for classification. Knowl.-Based Syst., pp. 201–202.
5. Qian X., Lin C., Chen Z., Wang W. (2024) SAM-Induced Pseudo Fully Supervised Learning for Weakly Supervised Object Detection in Remote Sensing Images. Remote Sens., vol. 16, p. 1532.