Bureau of National statistics
Agency for Strategic planning and reforms of the Republic of Kazakhstan
Synthetic data
Illustration of synthetic data concept
What is synthetic data?

Synthetic data - data artificially created through algorithms based on actual data and takes into account their patterns and distribution, but does not reveal the privacy of the data. They are fictitious indicators, the task of which is to demonstrate the possible structure of real data.

Why do we form and place them?

Synthetic data - formed and applied for the purpose of experimental testing of hypotheses and concepts without involving real data, and can also be used to test systems, conduct dataton events in order to complete tasks for participants in various analytical cases. The placement of synthetically generated data is due to the desire to provide end users with a visual representation of the structure of real datasets.

How to get more information?

Samples of synthetic datasets can be downloaded below. For more information on synthetic data, it is recommended to send a formal request to the Bureau for National Statistics (BNS). To understand the technical component of synthetic data, we suggest that you read the UN guidance on compiling synthetic data at the following link https://unece.org/statistics/publications/synthetic-data-official-statistics-starter-guide

Samples of synthetic datasets