Affiliation:
1. University of Tirana, Albania
Abstract
This chapter offers a comprehensive examination of contemporary practices in synthetic data generation. Its primary objective is to analyze and synthesize the methodologies, techniques, applications, and challenges associated with synthetic data across diverse scientific disciplines. The motivation behind the use of synthetic data stems from data privacy concerns, limitations in data availability, and the necessity for diverse, representative datasets. This chapter delves into various synthetic data generation methods, such as statistical modeling, generative adversarial networks (GANs), simulation-based techniques, and data envelopment analysis (DEA). It also scrutinizes the evaluation metrics for assessing synthetic data quality and privacy preservation. The chapter highlights applications in healthcare, finance, social sciences, and computer vision, and discusses emerging trends, including deep learning integration and domain adaptation. Researchers, practitioners, and policymakers will gain valuable insights into the state-of-the-art in synthetic data generation.
Reference83 articles.
1. StyleGANs and Transfer Learning for Generating Synthetic Images in Industrial Applications
2. How faithful is your synthetic data? sample-level metrics for evaluating and auditing generative models.;A.Alaa;International Conference on Machine Learning,2022
3. Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes
4. Synthetic Sensor Data for Human Activity Recognition
5. Almahairi, A., Rajeshwar, S., Sordoni, A., Bachman, P., & Courville, A. (2018, July). Augmented cyclegan: Learning many-to-many mappings from unpaired data. In International conference on machine learning (pp. 195-204). PMLR.