Saturday, September 13, 2025
No Result
View All Result
Eltaller Digital
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming
No Result
View All Result
Eltaller Digital
No Result
View All Result
Home Artificial Intelligence

Practical Guide to Generating Synthetic Data

December 18, 2024
in Artificial Intelligence
Reading Time: 3 mins read
0 0
A A
0
Practical Guide to Generating Synthetic Data
Share on FacebookShare on Twitter



Here’s a rewritten version of the article in a more easily understandable style:

—

The quality of synthetic data largely depends on the model used to generate it and the representativeness and quality of the original data. While most data analysts are familiar with the importance of original data quality, the quality of the model used to generate synthetic data deserves more attention.

Figure 1. The process of generating synthetic data with quality evaluation.

When discussing models, we’re not only talking about the algorithm itself but the entire process that helps create high-quality synthetic data. This involves some additional verification steps, like comparing model outcomes with real-world data. Figure 1 shows this process, and Figure 2 illustrates how it’s practically implemented using SAS Studio on the SAS Viya platform.

Figure 2. An example of implementing a GAN generator on the SAS Viya platform using SAS nodes available on Github.

SMOTE Model

When it comes to generating synthetic data, two popular techniques are used to address specific issues in real data. The first is SMOTE (Synthetic Minority Oversampling Technique), introduced in 2002 by Nitesh V. Chawla and others. This oversampling technique helps solve the problem of imbalanced datasets by selecting a sample and its nearest neighbors from the same group and creating new synthetic observations through interpolation. Figure 3 illustrates the basic idea of SMOTE.

Figure 3. The concept of the SMOTE method.

GAN Networks

Another versatile method involves GANs (Generative Adversarial Networks), which utilize generative AI to create synthetic data. Introduced by Ian Goodfellow and others in 2014, GANs were initially successful in image processing but have since been adapted for creating synthetic data in table format.

The CPCTGAN model (Correlation-Preserving Conditional Tabular GAN) was developed to tackle challenges in processing tabular data, such as handling both discrete and continuous variables and maintaining correlation between variables. This approach focuses on analyzing distributions and correlations to ensure synthetic data closely resembles real data.

Learning Process

The core idea of GAN-based models is fascinating because it involves a game theory concept with two players: the Generator and the Discriminator. The Generator produces synthetic data based on random input, aiming to mimic real data, while the Discriminator evaluates whether the data is real or synthetic. Training the Generator relies on the Discriminator’s errors, leading to a balance between the two. Figure 4 shows the operational process of this interaction.

Figure 4. How GAN-based synthetic data generators work.

Synthetic Data Without Coding

Both SMOTE and CPCTGAN models are available on the SAS Viya platform, making it easier to use them without extensive coding. SAS provides ready-to-use nodes in SAS Studio, allowing users to build data flows in a low-code/no-code environment. These nodes and instructions are available on Github.

Figure 5. An example of a data set generated using GAN-based methods.

References:
1. Nitesh V. Chawla et al. (2002). “SMOTE: Synthetic Minority Over-sampling Technique.” Journal of Artificial Intelligence Research 16:321–357.
2. Ian Goodfellow et al. (2014). Generative adversarial nets. Advances in neural information processing systems, 27.

—



Source link

Related

Tags: datageneratingGuidePracticalsynthetic
Previous Post

3 Improvements in the Oppo Find X8 Pro’s Camera System

Next Post

Fortnite X Skibidi Toilet: Complete Guide – PlayerAuctions Blog

Related Posts

Artificial Intelligence

MLCommons: Benchmarking Machine Learning for a Better World

September 7, 2025
Artificial Intelligence

Generative Video AI: Creating Viral Videos with One Click

September 7, 2025
Artificial Intelligence

Realtime APIs: The Next Transformational Leap for AI Agents

September 7, 2025
Artificial Intelligence

AI in Cyber Threat Simulation: Outwitting Hackers with Bots

September 7, 2025
Artificial Intelligence

Responsible AI: How to Build Ethics into Intelligent Systems

September 7, 2025
Artificial Intelligence

Relevance AI & Autonomous Teams: Streamlining Work with AI

September 7, 2025
Next Post
Fortnite X Skibidi Toilet: Complete Guide – PlayerAuctions Blog

Fortnite X Skibidi Toilet: Complete Guide - PlayerAuctions Blog

Amazon’s top-selling router potential ban due to Chinese espionage fears

Amazon's top-selling router potential ban due to Chinese espionage fears

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Get Your Steam Deck Payment Plan – Easy Monthly Options

Get Your Steam Deck Payment Plan – Easy Monthly Options

December 21, 2024
Will AI Take Over the World? How Close Is AI to World Domination?

Will AI Take Over the World? How Close Is AI to World Domination?

December 21, 2024
Installing the Nothing AI Gallery App on Any Nothing Device

Installing the Nothing AI Gallery App on Any Nothing Device

December 14, 2024
Applying Quartz Filters to Images in macOS Preview

Applying Quartz Filters to Images in macOS Preview

December 19, 2024
The Best 10 Luxury Perfumes for Women in 2025

The Best 10 Luxury Perfumes for Women in 2025

December 28, 2024
Bridging Knowledge Gaps with AI-Powered Contextual Search

Bridging Knowledge Gaps with AI-Powered Contextual Search

December 19, 2024

MLCommons: Benchmarking Machine Learning for a Better World

September 7, 2025

Generative Video AI: Creating Viral Videos with One Click

September 7, 2025

Realtime APIs: The Next Transformational Leap for AI Agents

September 7, 2025

AI in Cyber Threat Simulation: Outwitting Hackers with Bots

September 7, 2025

Responsible AI: How to Build Ethics into Intelligent Systems

September 7, 2025

Relevance AI & Autonomous Teams: Streamlining Work with AI

September 7, 2025
Eltaller Digital

Stay updated with Eltaller Digital – delivering the latest tech news, AI advancements, gadget reviews, and global updates. Explore the digital world with us today!

Categories

  • Apple
  • Artificial Intelligence
  • Automobile
  • Best AI Tools
  • Deals
  • Finance & Insurance
  • Gadgets
  • Gaming
  • Latest
  • Technology

Latest Updates

  • MLCommons: Benchmarking Machine Learning for a Better World
  • Generative Video AI: Creating Viral Videos with One Click
  • Realtime APIs: The Next Transformational Leap for AI Agents
  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact Us

Copyright © 2024 Eltaller Digital.
Eltaller Digital is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}
No Result
View All Result
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming

Copyright © 2024 Eltaller Digital.
Eltaller Digital is not responsible for the content of external sites.