OpenAI has released a major upgrade to ChatGPT's image generation capabilities, delivering photorealistic outputs that rival professional design tools. The new model, sometimes referred to as "ChatGPT Images 2.0," produces images with significantly improved detail, lighting, and composition.
Early users report that the system can now accurately render text, faces, and complex scenes with fewer artifacts than previous AI image generators. The update also includes better understanding of compositional prompts, allowing for more precise control over elements like camera angles and depth of field.
Key improvements:
- Enhanced realism: Shadows, reflections, and textures look natural.
- Better text rendering: Signs, labels, and written content appear legible.
- Improved prompt adherence: The model follows multi-part requests more closely.
- Faster generation: Images appear in seconds rather than minutes.
Tech commentator Matt Wolfe, in a video review, demonstrated dozens of use cases including product mockups, concept art, and photo replacements. He noted that the new model "handles hands and faces much better than before," a common weakness of earlier AI image tools.
The upgrade is rolling out to ChatGPT Plus and Enterprise users, with broader availability expected soon. This positions ChatGPT to compete more directly with specialized image generators like Midjourney and DALL-E 3.