Synthetic data
Concept
Data generated by AI models, discussed as a valuable tool for pre-training and data cleaning, especially for improving the quality of web data.
Mentioned in 2 videos
Videos Mentioning Synthetic data

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI
Latent Space
Data generated by AI models, discussed as a valuable tool for pre-training and data cleaning, especially for improving the quality of web data.

Deep Learning State of the Art (2019)
Lex Fridman
Artificially generated data used for training deep neural networks, proving effective for learning from limited real-world samples and creating robust models.