Friday, June 27, 2025
No Result
View All Result
Eltaller Digital
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming
No Result
View All Result
Eltaller Digital
No Result
View All Result
Home Artificial Intelligence

Practical Guide to Generating Synthetic Data

December 18, 2024
in Artificial Intelligence
Reading Time: 3 mins read
0 0
A A
0
Practical Guide to Generating Synthetic Data
Share on FacebookShare on Twitter



Here’s a rewritten version of the article in a more easily understandable style:

—

The quality of synthetic data largely depends on the model used to generate it and the representativeness and quality of the original data. While most data analysts are familiar with the importance of original data quality, the quality of the model used to generate synthetic data deserves more attention.

Figure 1. The process of generating synthetic data with quality evaluation.

When discussing models, we’re not only talking about the algorithm itself but the entire process that helps create high-quality synthetic data. This involves some additional verification steps, like comparing model outcomes with real-world data. Figure 1 shows this process, and Figure 2 illustrates how it’s practically implemented using SAS Studio on the SAS Viya platform.

Figure 2. An example of implementing a GAN generator on the SAS Viya platform using SAS nodes available on Github.

SMOTE Model

When it comes to generating synthetic data, two popular techniques are used to address specific issues in real data. The first is SMOTE (Synthetic Minority Oversampling Technique), introduced in 2002 by Nitesh V. Chawla and others. This oversampling technique helps solve the problem of imbalanced datasets by selecting a sample and its nearest neighbors from the same group and creating new synthetic observations through interpolation. Figure 3 illustrates the basic idea of SMOTE.

Figure 3. The concept of the SMOTE method.

GAN Networks

Another versatile method involves GANs (Generative Adversarial Networks), which utilize generative AI to create synthetic data. Introduced by Ian Goodfellow and others in 2014, GANs were initially successful in image processing but have since been adapted for creating synthetic data in table format.

The CPCTGAN model (Correlation-Preserving Conditional Tabular GAN) was developed to tackle challenges in processing tabular data, such as handling both discrete and continuous variables and maintaining correlation between variables. This approach focuses on analyzing distributions and correlations to ensure synthetic data closely resembles real data.

Learning Process

The core idea of GAN-based models is fascinating because it involves a game theory concept with two players: the Generator and the Discriminator. The Generator produces synthetic data based on random input, aiming to mimic real data, while the Discriminator evaluates whether the data is real or synthetic. Training the Generator relies on the Discriminator’s errors, leading to a balance between the two. Figure 4 shows the operational process of this interaction.

Figure 4. How GAN-based synthetic data generators work.

Synthetic Data Without Coding

Both SMOTE and CPCTGAN models are available on the SAS Viya platform, making it easier to use them without extensive coding. SAS provides ready-to-use nodes in SAS Studio, allowing users to build data flows in a low-code/no-code environment. These nodes and instructions are available on Github.

Figure 5. An example of a data set generated using GAN-based methods.

References:
1. Nitesh V. Chawla et al. (2002). “SMOTE: Synthetic Minority Over-sampling Technique.” Journal of Artificial Intelligence Research 16:321–357.
2. Ian Goodfellow et al. (2014). Generative adversarial nets. Advances in neural information processing systems, 27.

—



Source link

Related

Tags: datageneratingGuidePracticalsynthetic
Previous Post

3 Improvements in the Oppo Find X8 Pro’s Camera System

Next Post

Fortnite X Skibidi Toilet: Complete Guide – PlayerAuctions Blog

Related Posts

Will AI Take Over the World? How Close Is AI to World Domination?
Artificial Intelligence

Will AI Take Over the World? How Close Is AI to World Domination?

December 21, 2024
Will AI Take Over The World: What Experts Say
Artificial Intelligence

Will AI Take Over The World: What Experts Say

December 21, 2024
Google DeepMind’s Participation at NeurIPS 2024
Artificial Intelligence

Google DeepMind’s Participation at NeurIPS 2024

December 21, 2024
Are AI Models Efficiently Scaling Knowledge Storage? Meta Researchers Enhance Memory Layer Capabilities
Artificial Intelligence

Are AI Models Efficiently Scaling Knowledge Storage? Meta Researchers Enhance Memory Layer Capabilities

December 21, 2024
Ecologists Identify Limitations of Computer Vision Models in Wildlife Image Retrieval
Artificial Intelligence

Ecologists Identify Limitations of Computer Vision Models in Wildlife Image Retrieval

December 21, 2024
Efficient Text Compression for Reducing LLM Expenses
Artificial Intelligence

Efficient Text Compression for Reducing LLM Expenses

December 20, 2024
Next Post
Fortnite X Skibidi Toilet: Complete Guide – PlayerAuctions Blog

Fortnite X Skibidi Toilet: Complete Guide - PlayerAuctions Blog

Amazon’s top-selling router potential ban due to Chinese espionage fears

Amazon's top-selling router potential ban due to Chinese espionage fears

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Installing the Nothing AI Gallery App on Any Nothing Device

Installing the Nothing AI Gallery App on Any Nothing Device

December 14, 2024
Rewards & Punishments Await the Curious in ‘Dungeons of Blood and Dream’

Rewards & Punishments Await the Curious in ‘Dungeons of Blood and Dream’

December 21, 2024
Get Your Steam Deck Payment Plan – Easy Monthly Options

Get Your Steam Deck Payment Plan – Easy Monthly Options

December 21, 2024
The Best 10 Luxury Perfumes for Women in 2025

The Best 10 Luxury Perfumes for Women in 2025

December 28, 2024
Will AI Take Over the World? How Close Is AI to World Domination?

Will AI Take Over the World? How Close Is AI to World Domination?

December 21, 2024
Local Evaluation of Microsoft’s Phi-4 (14B) AI Model: Insights on Performance, Constraints, and Future Possibilities

Local Evaluation of Microsoft’s Phi-4 (14B) AI Model: Insights on Performance, Constraints, and Future Possibilities

December 18, 2024

Pin Clicks: A Complete Guide to Analyzing & Optimizing Pinterest Success

June 25, 2025
Bigscreen Beyond 2 Launching Next Month: Refining A Vision For VR Enthusiasts Without Apple Or Meta

Bigscreen Beyond 2 Launching Next Month: Refining A Vision For VR Enthusiasts Without Apple Or Meta

March 21, 2025
The Best 10 Luxury Perfumes for Women in 2025

The Best 10 Luxury Perfumes for Women in 2025

December 28, 2024
How Do I earn more money as a Fiverr affiliate?

How Do I earn more money as a Fiverr affiliate?

December 26, 2024
Is the Tesla Cybertruck *Really* Bulletproof? Here’s The Truth

Is the Tesla Cybertruck *Really* Bulletproof? Here’s The Truth

December 23, 2024
Will AI Take Over the World? How Close Is AI to World Domination?

Will AI Take Over the World? How Close Is AI to World Domination?

December 21, 2024
Eltaller Digital

Stay updated with Eltaller Digital – delivering the latest tech news, AI advancements, gadget reviews, and global updates. Explore the digital world with us today!

Categories

  • Apple
  • Artificial Intelligence
  • Automobile
  • Best AI Tools
  • Deals
  • Finance & Insurance
  • Gadgets
  • Gaming
  • Latest
  • Technology

Latest Updates

  • Pin Clicks: A Complete Guide to Analyzing & Optimizing Pinterest Success
  • Bigscreen Beyond 2 Launching Next Month: Refining A Vision For VR Enthusiasts Without Apple Or Meta
  • The Best 10 Luxury Perfumes for Women in 2025
  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact Us

Copyright © 2024 Eltaller Digital.
Eltaller Digital is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}
No Result
View All Result
  • Home
  • Latest
  • AI
  • Technology
  • Apple
  • Gadgets
  • Finance & Insurance
  • Deals
  • Automobile
  • Best AI Tools
  • Gaming

Copyright © 2024 Eltaller Digital.
Eltaller Digital is not responsible for the content of external sites.