← Back to topics
4 model-release OpenAI single-source 3 articles Β· 3 sources

Where's the raccoon with the ham radio? (ChatGPT Images 2.0)

OpenAI's GPT-Image-2 tops image generation leaderboards with major gains in text rendering and resolution.

Where's the raccoon with the ham radio? (ChatGPT Images 2.0)
via Simon Willison

πŸ“° The headlines

πŸ” Let's dive in

OpenAI has released GPT-Image-2 (also called ChatGPT Images 2.0), an image generation model available through ChatGPT, the API, and Codex. Sam Altman is pitching the leap from the previous version as equivalent to the jump from GPT-3 to GPT-5 β€” a bold claim, but early benchmarks back up the ambition: the model holds a +242 Elo lead on Arena's text-to-image leaderboard.

The most concrete improvement is text rendering, historically a weak spot for image generators. The model also supports resolutions up to 3840Γ—2160 and an outputQuality setting, with API pricing at $30 per million output tokens.

Design tools including Figma, Canva, and Adobe Firefly are already integrating it, pointing to a clear push into professional productivity workflows. In stress tests with complex, Where's Waldo-style scenes, it outperformed Google's Gemini and earlier OpenAI versions β€” though it still can't reliably solve the visual puzzles it generates.

Synthesized across 3 sources Β· regenerate

βš–οΈ Pros & cons

Pros

  • Best-in-class text rendering in generated images
  • Supports up to 4K resolution output
  • Strong leaderboard performance over Gemini and prior versions
  • Broad integration with Figma, Canva, and Firefly

Cons

  • Still struggles to solve its own generated visual puzzles
  • $30 per million output tokens may limit high-volume use

πŸ•° The timeline Β· 3 sources

Latent Space (Swyx) analyst Β· 10h ago Β· 4/5

[AINews] OpenAI launches GPT-Image-2 β†—

OpenAI released GPT-Image-2, an image generation model available across ChatGPT, Codex, and API with enhanced text rendering, layout fidelity, editing, and thinking variants. The model ranks first on Arena's image generation leaderboards with a +242 Elo lead on text-to-image tasks and is being integrated by tools including Figma, Canva, and Firefly. The release signals renewed focus on image generation as a productivity surface for UI mockups, documentation, and design workflows.

this is not merely prettier art, but a more usable model for UI, mockups, documentation, productivity visuals, and reference-driven design loops
β€” Latent Space (Swyx)
open-weight models are now credible enough that infra, harness, and deployment quality determine a lot of real-world value
β€” Latent Space (Swyx)
Simon Willison analyst Β· 14h ago Β· 3/5

Where's the raccoon with the ham radio? (ChatGPT Images 2.0) β†—

OpenAI released ChatGPT Images 2.0 today, with Sam Altman claiming the improvement from the previous version is equivalent to the jump from GPT-3 to GPT-5. The new model supports higher resolutions up to 3840x2160 pixels and an outputQuality setting, with pricing at $30 per million output tokens. In testing with complex Where's Waldo-style image generation, the new model outperformed Google's Gemini and earlier versions, though it still struggles with reliably solving its own visual puzzles.

the leap from gpt-image-1 to gpt-image-2 was equivalent to jumping from GPT-3 to GPT-5
β€” Sam Altman
Looks like we definitely can't trust these models to usefully solve their own puzzles!
β€” Simon Willison
TechCrunch AI reporting Β· 16h ago Β· 3/5

ChatGPT’s new Images 2.0 model is surprisingly good at generating text β†—

OpenAI has released ChatGPT Images 2.0, its newest image-generation model. The model demonstrates significant improvements in generating images with readable text, a historically challenging task for image generators. The release reflects the rapid advancement of AI capabilities in visual content creation.

The diffusion models are reconstructing a given input. We can assume writings on an image are a very, very tiny part, so the image generator learns the patterns that cover more of these pixels.
β€” Asmelash Teka Hadgu
Images 2.0 brings an unprecedented level of specificity and fidelity to image creation, able to follow instructions, preserve requested details, and render the fine-grained elements that often break image models: small text, iconography, UI elements, dense compositions, and subtle stylistic constraints.
β€” OpenAI

🏷 Tags

creative-designsoftware-development ChatGPTGeminiClaudeCursorHugging Face

πŸ”§ Debug

Cluster ID
45a2c9145b
Importance (max)
4
Members
3
Sources
Simon Willison, TechCrunch AI, Latent Space (Swyx)
Earliest
2026-04-21T19:00:00.000Z
Latest
2026-04-22T00:23:52.000Z
Lead URL
https://simonwillison.net/2026/Apr/21/gpt-image-2