ChatGPT Images 2.0: OpenAI's Major Upgrade for Non-Latin Text and Image Generation

OpenAI AI tools ChatGPT AI image generation ChatGPT Images 2.0 non-Latin text rendering pixel art generation manga creation image model upgrade

OpenAI has unveiled ChatGPT Images 2.0, a significant upgrade to its image generation model, just over a year after introducing the ability to create images directly from its chatbot. The company describes the new system as a “step change” for image generation, particularly in its ability to follow detailed instructions, render dense text, and accurately place and relate objects within a scene.

For the first time, OpenAI has integrated reasoning capabilities into an image model, enabling the system to perform tasks such as searching the web and verifying its outputs. These enhancements are designed to improve reliability, especially when accuracy, consistency, and visual cohesion are critical. An example of ChatGPT's new non-Latin rendering abilities. OpenAI

OpenAI has also focused on improving the model’s understanding and rendering of non-Latin text, achieving “significant gains” in handling languages such as Japanese, Korean, Chinese, Hindi, and Bengali. Additionally, the model now better captures the unique characteristics of different visual languages, making it more effective for applications like game prototyping and storyboarding.

The new model offers greater flexibility in aspect ratios, generating images as wide as 3:1 or as tall as 1:3. It supports resolutions up to 2K and can produce up to eight outputs in a single request. A tortoiseshell cat in the style of Pokémon's third generation of games. ChatGPT

Prior to its public release, I previewed ChatGPT Images 2.0. For my first test, I prompted the model to generate an image of a tortoiseshell cat in the pixel art style of Pokémon’s third generation. This was a challenging task, as AI models often struggle with pixel art, and the Game Boy Advance Pokémon games are iconic for their distinctive style. The result was impressive, accurately capturing the essence of the requested style.

Next, I tasked the model with converting the generated image into a transparent PNG format. While the process took longer than expected, the output met the requirement, though it differed slightly from the original image. Finally, I asked ChatGPT to create a four-page manga featuring my cat enjoying a sunny day by a city stream. Notice how the cat isn't rendered exactly like the one above it. ChatGPT

Of the three tests, the second task consumed the most time, and the output deviated slightly from my initial prompt. However, the model successfully generated a transparent image, a capability that other image models often struggle with. As more users begin testing Images 2.0, we will gain a clearer understanding of how it compares to competitors like Google’s Nano Banana 2.

Source: Engadget

← Previous

Early Antibiotic Use Alters Long-Term Lung Immunity in Babies, Study F...

OpenAI Unveils ChatGPT Images 2.0: Enhanced Text Rendering and Creative Capabilities Now Available

06:08 · 15 May 2026

xAI Unveils Grok Build: New AI Coding Agent in Early Beta for Elite Users

It's in early beta and only available to SuperGrok Heavy subscribers right now.

22:21 · 14 May 2026

Musk vs. Altman Trial: Closing Arguments Reveal Legal Missteps and Evidence Mount

Today was closing arguments in the Musk v. Altman trial, and I almost feel bad writing about the unbelievable demolition derby I just witnessed. Steve...

21:08 · 14 May 2026

Meta Ray-Ban Display Glasses Now Support Gesture-Based Messaging in WhatsApp, Messenger, and More

Meta is rolling out new features to its Meta Ray-Ban Display smart glasses, including bringing the ability to write messages just with hand gestures t...

20:59 · 14 May 2026

Elon Musk’s 'Jackass' Trophy Takes Center Stage in OpenAI Trial

Yesterday, in Musk v. Altman, before the jurors came in, Sam Altman's team passed up what looked - from a distance - like a little league trophy. It w...

20:55 · 14 May 2026

Meta Expands Smart Glasses Capabilities with Third-Party Apps and Games

The $800 smart glasses could soon be a lot more useful.

20:00 · 14 May 2026

OpenAI Launches Mobile Access to Codex for Coding Projects On-the-Go

The integration allows you to keep tabs on your coding projects on the go.

20:00 · 14 May 2026

OpenAI Integrates Codex into ChatGPT Mobile App, Expanding AI Coding Capabilities

OpenAI is going to let users access Codex, its desktop AI tool that can write code and use apps on your computer, from the ChatGPT app on your phone....

19:44 · 14 May 2026

Ray-Ban Meta Smart Glasses Hit Record-Low Prices During Meta’s Summer Sale

You can save over $50 on the latest pair of Ray-Ban Meta glasses, which offer improved video quality and battery life. | Photo: Photo by Colt Bradley...

Technology

ChatGPT Images 2.0: OpenAI Unveils Major Upgrade for Non-Latin Text and Image Generation

Early Antibiotic Use Alters Long-Term Lung Immunity in Babies, Study F...

OpenAI Unveils ChatGPT Images 2.0: Enhanced Text Rendering and Creativ...

Technology

ChatGPT Images 2.0: OpenAI Unveils Major Upgrade for Non-Latin Text and Image Generation

Early Antibiotic Use Alters Long-Term Lung Immunity in Babies, Study F...

OpenAI Unveils ChatGPT Images 2.0: Enhanced Text Rendering and Creativ...

Related articles

xAI Unveils Grok Build: New AI Coding Agent in Early Beta for Elite Users

Musk vs. Altman Trial: Closing Arguments Reveal Legal Missteps and Evidence Mount

Meta Ray-Ban Display Glasses Now Support Gesture-Based Messaging in WhatsApp, Messenger, and More

Elon Musk’s 'Jackass' Trophy Takes Center Stage in OpenAI Trial

Meta Expands Smart Glasses Capabilities with Third-Party Apps and Games

OpenAI Launches Mobile Access to Codex for Coding Projects On-the-Go

OpenAI Integrates Codex into ChatGPT Mobile App, Expanding AI Coding Capabilities

Ray-Ban Meta Smart Glasses Hit Record-Low Prices During Meta’s Summer Sale