6 Cool Things ChatGPT 4o Can Do That OpenAI Didn’t Highlight

COMBOFRE May 14, 2024

OpenAI lately launched its subsequent flagship mannequin GPT-4o and demonstrated some cool demos. The human-like voice chat has turn out to be the headline function, however there may be extra to it. OpenAI didn’t spotlight many cool issues that ChatGPT 4o is able to. These particulars can be found on OpenAI’s web page and I went via all of them. On that notice, let’s discover out the cool new capabilities of ChatGPT 4o.

1. Correct Textual content Technology in Pictures

We all know that Diffusion fashions battle with producing texts on photos. Dall -E 3 nonetheless fails to generate photos with the given textual content. Nonetheless, the ChatGPT 4o mannequin which is an end-to-end multimodal mannequin, can render texts precisely. OpenAI didn’t point out this within the presentation. Nonetheless, yow will discover the instance on OpenAI’s page the place the corporate explores its capabilities.

gpt-4o text rendering capability in image generation — Picture Courtesy: OpenAI

It may generate and add textual content to pictures effortlessly. The consistency in lots of samples is exceptional. You can even connect photos and ask it to generate photos from totally different angles of the identical character, and it maintains consistency throughout all eventualities. It may additionally generate a 3D view of objects which you’ll be able to mix to create a 3D render. To not point out, it could actually generate fonts too.

Picture Courtesy: OpenAI
Picture Courtesy: OpenAI
Picture Courtesy: OpenAI

Take into account that these capabilities aren’t accessible on ChatGPT but. It nonetheless makes use of Dall -E 3 to generate photos. OpenAI might unlock these options within the close to future.

2. GPT-4o Can Course of Movies Too

chatgpt 4o video processing — Picture Processing: OpenAI

OpenAI didn’t point out that GPT-4o can deal with movies too. Properly, on the mannequin web page, OpenAI has demonstrated which you could add a video and ask GPT-4o to summarize it. From transcription to bullet-point abstract, it does every little thing. So it appears Gemini 1.5 Pro is not the only model that can process videos.

Why Spend on AI Gadgets When These AI Apps Can Do It All

Anshuman Jain

May 9, 2024

3. GPT-4o Can Be Your Tutor

In a presentation with Khan Academy’s Sal Khan, OpenAI showcased a fascinating demo using the GPT-4o model. Basically, on an iPad, you can share your screen with ChatGPT 4o, and it can see everything on your screen.

Did you hear? @OpenAI‘s latest mannequin can motive throughout audio, imaginative and prescient, and textual content in actual time.

How does GPT-4o do with math tutoring?🤔@salkhanacademy and his son check it on a Khan Academy math drawback.

You will get AI-powered math tutoring proper now with Khanmigo:… pic.twitter.com/8NXoh0SwtU— Khan Academy (@khanacademy) May 13, 2024

Now you can ask it to elucidate and show you how to discover options to an issue. Be it arithmetic, sciences, charts, maps, or the rest, ChatGPT 4o might be your private trainer guiding you all through your research session. That’s such a fantastic software of AI, powered by GPT-4o’s multimodal imaginative and prescient functionality. By the way in which, it additionally works with the ChatGPT desktop app for macOS.