
On LM Arena, the next-best model, Google's Nano-banana-2 (Gemini 3.1 Flash Image), trailed by 242 Elo points at launch — roughly an 80% win rate in head-to-head blind testing. Other public boards show a tighter spread, but the directional verdict is the same: it's not a step. It's a gulf.


"The architecture was revamped from scratch." — OpenAI's gpt-image-2 research team


"The most interesting systemic implication is that image generation is starting to function as a frontend for coding agents." — Neurohive, on developer reactions to gpt-image-2
