Affiliation:
1. School of Statistics and Data Science, Nanjing Audit University, Nanjing 211815, China
Abstract
Most existing data synthesis methods are designed to tackle problems with dataset imbalance, data anonymization, and an insufficient sample size. There is a lack of effective synthesis methods in cases where the actual datasets have a limited number of data points but a large number of features and unknown noise. Thus, in this paper we propose a data synthesis method named Adaptive Subspace Interpolation for Data Synthesis (ASIDS). The idea is to divide the original data feature space into several subspaces with an equal number of data points, and then perform interpolation on the data points in the adjacent subspaces. This method can adaptively adjust the sample size of the synthetic dataset that contains unknown noise, and the generated sample data typically contain minimal errors. Moreover, it adjusts the feature composition of the data points, which can significantly reduce the proportion of the data points with large fitting errors. Furthermore, the hyperparameters of this method have an intuitive interpretation and usually require little calibration. Analysis results obtained using simulated original data and benchmark original datasets demonstrate that ASIDS is a robust and stable method for data synthesis.
Funder
National Social Science Fund of China
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference30 articles.
1. Enhanced data security of communication system using combined encryption and steganography;ALRikabi;iJIM,2021
2. Kollias, D. (2022). Computer Vision—ECCV 2022 Workshops, Springer.
3. Machine learning algorithms—A review;Mahesh;Int. J. Sci. Res. (IJSR),2020
4. Lepot, M., Aubin, J.B., and Clemens, F.H.L.R. (2017). Interpolation in time series: An introductive overview of existing methods, their performance criteria and uncertainty assessment. Water, 9.
5. A review of medical image data augmentation techniques for deep learning applications;Chlap;J. Med. Imaging Radiat. Oncol.,2021