UCLA Synthetic Data Workshop

UCLA Synthetic Data Workshop



April 13th - 14th, 2023
We are pleased to announce the UCLA synthetic data workshop. This two-day workshop is hosted by the UCLA Department of Statistics and co-sponsored by IDRE, Science Hub for Humanity and Artificial Intelligence, ASA Section on Statistical Computing, NISS and NSF. The workshop is held at the UCLA Faculty Club, Morrison Room from April 13-14, 2023.


Synthetic data generation is an interdisciplinary field that is rapidly gaining momentum in both academia and industry. Synthetic data has become a valuable resource for the development of algorithmic procedures, fraud detection, spam identification, and the construction of AI-driven models in industries such as manufacturing and supply chain management. The benefits of synthetic data include cost savings, increased speed and agility, advanced intelligence, and state-of-the-art privacy. The Gartner report forecasts that synthetic data will largely replace real data for training machine/deep learning models by 2030, as depicted in the figure below. Additionally, synthetic data was named one of the top 10 game-changing innovations for 2022 by the MIT Technology Review. Synthetic data comes in various forms, such as image, text, graph, and tabular data, and has different names, such as simulated data, missing value imputation, and GAN, depending on the scientific community. Synthetic data generation is expected to become an essential component of next-generation machine learning workflows.


Despite the numerous successful applications of synthetic data, its scientific foundation, such as the tradeoff among fidelity, utility, and privacy, is still underdeveloped. Moreover, industrial standards for generating and utilizing synthetic data are not yet fully developed. Additionally, the privacy law on synthetic data is still in its infancy. Therefore, the purpose of this workshop is to bring together a community of synthetic data researchers from statistics, machine learning, and mathematics, policymakers, and industrial partners to collaborate on developing the theory, methodology, and algorithms necessary for producing synthetic benchmark datasets and algorithms.


All events will be held at UCLA faculty club, Morrison Room.

Day 1: April 13, Thursday

Day 2: April 14, Friday


It is strongly encouraged to upload your poster in the registration link below (deadline is April 1st). You need to bring posters to the conference site to setup. The poster exhibit time is from 9:40-3:00 either Thursday or Friday.

Travel Support

Travel support is available for junior participants (who received PhD degree after 2018), invited speakers and panel discussants. Registration fee will be reimbursed for participants that receive travel support.

Registration (Deadline April 1st, 2023).

CLICK HERE to register for the April 13th - 14th, 2023 UCLA synthetic data workshop.
CLICK HERE to pay the registration fee.

PLEASE NOTE: Submitting the above Google form DOES NOT guarantee a place in the workshop until the registration fee is paid.


If you need accommodation, nearby options include the following local hotels.



For more information contact Prof. Guang Cheng.