GPT-4o Revolutionizes Image Generation in ChatGPT

OpenAI has announced the expansion of multimodality with the release of GPT-4o, a native image gen model, ChatGPT’s biggest step yet. Until then, as of today, GPT-4o is fully live in both ChatGPT and Sora. Users who are subscribed to OpenAI’s Pro plan—$200 a month—have access to it currently. The introduction of GPT-4o represents the…

Lisa Wong Avatar

By

GPT-4o Revolutionizes Image Generation in ChatGPT

OpenAI has announced the expansion of multimodality with the release of GPT-4o, a native image gen model, ChatGPT’s biggest step yet. Until then, as of today, GPT-4o is fully live in both ChatGPT and Sora. Users who are subscribed to OpenAI’s Pro plan—$200 a month—have access to it currently. The introduction of GPT-4o represents the first major enhancement to ChatGPT's image-generation abilities in over a year, promising more accurate and detailed images compared to its predecessor, DALL-E 3.

This incorporates all of those important insights garnered from OpenAI’s pacts with firms reminiscent of Shutterstock. DALL-E 3’s extensive training enables the model to produce highly detailed images with remarkable accuracy. It obviously requires additional time to process and create images than DALL-E 3. Although the processing requires much longer, the final results will have much fewer errors, better quality, and fidelity.

In just a few weeks, Plus and eventually free users of ChatGPT will get access, making it much more widely available. Developers can access this powerful image-generation tool through OpenAI’s API service. This allows them to easily build on top of its robust features to add even more value within their own applications.

Brad Lightcap, OpenAI’s chief operating officer, noted that in creating the technology behind the new image-generation capabilities, the company is committed to clearing the rights of artists.

“We’re respecting of the artists’ rights in terms of how we do the output, and we have policies in place that prevent us from generating images that directly mimic any living artists’ work,” – Brad Lightcap, OpenAI’s chief operating officer

The rollout of GPT-4o behind a paywall is a signal that this is not only a technical upgrade but a strategic play for OpenAI. The technique further enables ChatGPT to generate and edit visuals and photographs with astonishing strength and sophistication. This new feature helps it stand out as a formidable innovator in the burgeoning AI-based creativity tool landscape.