How does Spellbrush's AI generate character art?

Spellbrush uses Generative Adversarial Networks (GANs), which involve two neural networks: a generator that creates images and a discriminator that distinguishes real art from fake. Through millions of training cycles on a dataset of art, the generator learns to produce high-quality, indistinguishable images.

How fast can Spellbrush's AI create character art?

Spellbrush's AI tool can generate a character illustration in under two seconds. This is significantly faster than a professional illustrator, which can take anywhere from two to fifteen hours for a single piece.

What are the challenges with AI training datasets for character art?

The available internet data, particularly for anime aesthetics, is skewed. There's a significant underrepresentation of darker skin tones (less than 3%) and male characters compared to their real-world prevalence or desired diversity.

How does Spellbrush address dataset bias in AI art generation?

Spellbrush actively works to correct dataset bias by improving the generation of darker skin tones and male characters. This ensures their AI can create a more diverse range of characters that better reflects the real world, even if the initial dataset lacks this diversity.

Why did Spellbrush build their own supercomputer?

Training deep learning models, especially GANs, is computationally intensive and expensive. Utilizing cloud services like AWS is costly (estimated $3,000-$4,000 per model), so Spellbrush built a custom, in-house supercomputer to significantly reduce training costs to about $0.60 per hour.

What is the technical architecture behind Spellbrush's AI?

They utilize a custom language called Netgen for describing GAN architectures, which compiles into TensorFlow ops. These are packaged in Singularity containers and scheduled on their cluster using Slurm, with monitoring handled by Prometheus, Grafana, and TensorBoard.

Is Spellbrush hiring?

Yes, Spellbrush is a small, growing team and is actively looking to hire for several positions, including artists (2D animator, motion designer, real-time VFX artist) and an AI research intern.

Key Moments

Designing Characters with Deep Learning: Spellbrush (W18) - YC Gaming Tech Talks 2020

Y Combinator

Science & Technology5 min read11 min video

Dec 7, 2020|7,733 views|172|10

YC Y Combinator

Save to Pod

Want to know something specific about what's covered?

We've already dissected every moment. Ask and we will deliver (with timestamps).

Key Moments

TL;DR

AI can generate anime characters in under two seconds, indistinguishable from human art, but training these models is costly, costing thousands per iteration.

Key Insights

Spellbrush's AI can generate a character portrait in under two seconds, a task that would take a human illustrator 2 to 15 hours.

The company utilizes Generative Adversarial Networks (GANs), comprising a generator and a discriminator, to create art.

Publicly available internet images used for training are heavily skewed, with female characters outnumbering males 6:1 and darker skin tones representing less than 3% of illustrations.

Spellbrush has invested significant effort to improve the generation of darker skin tones and male characters to enhance representation not reflected in raw internet data.

Training a single AI model is expensive, costing $3,000-$4,000 and taking 7-10 days, necessitating a self-built supercomputer for cost efficiency.

Spellbrush is building the world's first AI-illustrated game and is actively hiring for various art and AI research roles.

AI generates professional-level anime art in seconds

Spellbrush, a Y Combinator company, is developing deep learning tools to revolutionize art creation in the gaming industry. Art production is a significant cost in game development, often consuming 50-70% of the total budget. The company addresses this by using AI to scale up art creation capabilities without requiring massive studio expansion. Their AI can generate character portraits in the anime style in under two seconds, a stark contrast to the 2 to 15 hours a professional illustrator might take. Furthermore, the AI can produce hundreds of characters in the time it would take a human to create just one. This capability allows for rapid iteration and extensive character variety, which would be prohibitively time-consuming and expensive with traditional methods. The quality is so high that it is on par with professional human artists, making it difficult to distinguish AI-generated art from human-created art, as demonstrated by a quiz where the AI's output was indistinguishable from that of popular Twitter artists. This technology has the potential to drastically reduce art production costs and timelines for game developers.

How Generative Adversarial Networks (GANs) create art

The core technology behind Spellbrush's character generation is Generative Adversarial Networks (GANs). A GAN consists of two neural networks: a generator and a discriminator. The generator's role is to learn how to create art, aiming to produce outputs that mimic a given dataset. The discriminator's role is to distinguish between real art from the dataset and fake art produced by the generator. These two networks are trained in opposition: the generator tries to fool the discriminator, and the discriminator tries to accurately identify fakes. Through millions of training cycles, both networks improve. The generator learns to produce increasingly realistic images, while the discriminator becomes better at detecting subtle flaws. Crucially, the generator requires a 'latent space' or random noise input to produce varied outputs. By manipulating this noise, developers can control various aspects of the generated image, such as character expressions, colors, and even artistic style, tasks that would normally require significant manual effort from an artist.

Addressing bias and improving representation in training data

Spellbrush trains its GANs using publicly available images scraped from the internet, initially focusing on the anime aesthetic due to the abundance of available data (around 10 million images). However, they discovered significant biases in this data. The dataset is heavily skewed towards female characters, outnumbering male characters by a ratio of approximately 6:1. Additionally, darker skin tones and people of color are underrepresented, making up less than 3% of the illustrations. Recognizing that these percentages do not reflect real-world demographics and that representation is crucial, Spellbrush has dedicated considerable effort to mitigate these biases. They have enhanced the AI's ability to generate darker skin tones at a higher frequency than present in the raw data and improved the generation of male characters. This algorithmic correction aims to produce more diverse and representative character options, even addressing the fact that illustrators in some regions may shy away from drawing male characters due to lower engagement on social media compared to female characters.

The high cost of training AI models

Training deep learning models, especially for complex visual tasks like character generation, is computationally intensive and expensive. Spellbrush found that relying solely on cloud services like AWS could be prohibitively costly. A comparable machine on AWS (p316xlarge) can cost around $24 per hour on-demand, or about $10 per hour using spot instances. Since training their models takes approximately 7 to 10 days, each individual model training run incurs costs of $3,000 to $4,000. This significant expense led the startup to build its own mini-supercomputer in-house. This DIY supercomputer, housed in a 42U rack, features over 200 CPU cores, more than 20 high-end GPUs (Titan RTX), 100-gigabit Ethernet, and substantial storage. The total running cost for this cluster is estimated at around 60 cents per hour, offering a massive cost saving compared to cloud solutions.

Spellbrush's custom architecture and research directions

To manage their custom-built hardware setup and streamline the AI development process, Spellbrush has developed internal tools and workflows. They utilize a proprietary language called 'NetGen' for quickly describing GAN architectures, which compiles down to low-level TensorFlow operations. These operations are then packaged into singularity containers and scheduled onto their cluster using Slurm. Standard monitoring tools like Prometheus, Grafana, and TensorBoard are used for tracking system performance and model training progress, including loss functions. Beyond character generation, Spellbrush is actively researching other areas to enhance the art pipeline. These include automated animation, developing tools to assist with 2D animation workflows (like Live 2D and Spine), and exploring super-resolution techniques for animation processes. This broad research agenda aims to provide a comprehensive suite of AI-powered tools for game art creation.

Building the future: an AI-illustrated game and hiring needs

Leveraging their advanced AI technology, Spellbrush is currently developing what they claim will be the world's first AI-illustrated game. This ambitious project aims to showcase the full potential of their art generation and manipulation tools within a real-world application. The company is currently a small team of five people but is actively looking to expand. They are seeking to hire a sixth team member, specifically targeting individuals who resonate with their vision. Open roles include a 2D animator and motion designer, a real-time VFX artist, and an AI research intern for the upcoming winter. Interested candidates are encouraged to reach out via email at jobs@spellbrush.com, with the CEO available for further discussion in breakout rooms.

Mentioned in This Episode

●Products

●Software & Apps

●Companies

●Organizations

●Concepts

AI Character Design Workflow

Practical takeaways from this episode

Do This

Leverage AI for rapid character generation (sub-second).

Utilize GANs with a generator and discriminator for training.

Control character output by manipulating latent space noise.

Address dataset bias to improve representation of darker skin tones and male characters.

Build in-house infrastructure for cost-effective model training.

Use custom languages like Netgen and tools like TensorFlow for GAN architectures.

Monitor training with Prometheus, Grafana, and TensorBoard.

Avoid This

Don't rely solely on human illustration for scaling content creation.

Don't ignore cost implications of cloud-based model training.

Don't overlook representation issues in training datasets.

AI Art Generation Speed Comparison

Data extracted from this episode

Method	Time per Character
AI Tool	Sub 2 seconds
Professional Illustrator	2-15 hours

Cloud vs. In-house GPU Training Costs

Data extracted from this episode

Platform	On-Demand Cost per Hour	Spot Instance Cost per Hour	Training Time per Model	Cost per Model
AWS p316xlarge	$24	$10 (approx.)	7-10 days	$3,000-$4,000
Spellbrush In-house (DIY Supercomputer)	$0.60 (total running cost)		7-10 days	Significantly less than cloud

Common Questions

Spellbrush is a startup developing deep learning tools specifically for artists. They leverage AI, particularly Generative Adversarial Networks (GANs), to create character illustrations rapidly, aiming to help scale art production in the game industry.

Topics

AI & Machine Learning Technology & Innovation Creativity & Media Ai In Gaming Deep Learning Generative Art Character Design Dataset Bias Art Production Pipeline

Mentioned in this video

Companies

Twitter

Mentioned as a platform where popular artists share their work, contrasted with AI-generated art.

Y Combinator

An organization that Spellbrush is a part of, indicating their status as a startup.

Spellbrush

A company building deep learning tools for art and artists, specializing in character design with AI.

Software & Apps

Singularity Containers

A technology used to package TensorFlow low-level operations for scheduling workloads on their cluster.

Slurm

A workload manager used by Spellbrush to schedule jobs on their custom supercomputer cluster.

AWS

Mentioned as a cloud provider with comparable machines (p316xlarge) that are significantly more expensive for training models than their in-house solution.

Grafana

A data visualization tool used alongside Prometheus by Spellbrush for monitoring their systems.

WiFilabs.com

A website where one of Spellbrush's older AI models for character customization is available online.

TensorFlow

A machine learning framework used as a backend for Spellbrush's internal language, Netgen, to compile GAN architectures.

Prometheus

A monitoring system used by Spellbrush for collecting and visualizing time-series data from their cluster.

TensorBoard

A visualization toolkit for TensorFlow used by Spellbrush to track model training, specifically loss functions.

Products

Titan RTX

High-end GPUs used in Spellbrush's custom supercomputer, capable of running demanding applications like Crisis.

Concepts

Generative Adversarial Networks

The core AI technology, GANs, used by Spellbrush, consisting of a generator and a discriminator network trained against each other.

Ask anything from this episode.

Save it, chat with it, and connect it to Claude or ChatGPT. Get cited answers from the actual content — and build your own knowledge base of every podcast and video you care about.

Get Started Free