OpenAI's GPT-4o can now predict its own outputs

Midjourney prompted by THE DECODER

OpenAI has introduced a new feature called "Predicted Outputs" for its GPT-4o and GPT-4o-mini language models that makes AI-supported text processing much faster than before.

Initial testing of the new feature shows significant speed gains: Code editing responses come in two to four times faster compared to existing models, and large file modifications that used to take about 70 seconds can now be completed in about 20 seconds. OpenAI points to several key applications, including updating blog posts in documents, iterating on previous responses, and rewriting code in an existing file.

The system operates on a straightforward principle: developers can input an expected portion of the output ahead of time. This approach works especially well for repetitive tasks or small document changes, since the model needs to generate fewer new tokens. OpenAI states that as a general guideline, when 50 percent of output tokens can be saved, the processing time decreases by about 50 percent.

Predicted outputs are only useful for special use cases

The feature performs best when the prediction closely aligns with what the model would respond with, but it's less effective for generating entirely new content where meaningful predictions aren't possible. OpenAI has successfully tested the feature across multiple programming languages, including Python, JavaScript, Go, and C++.

However, the feature comes with certain restrictions. It's only available with the GPT-4o and GPT-4o-mini models and doesn't work with advanced API parameters like multiple outputs or function calls. OpenAI suggests that developers should first test the feature with controlled, predictable tasks to achieve maximum efficiency.

Additional details about the feature can be found in OpenAI's documentation.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

OpenAI's GPT-4o can now predict its own outputs

Predicted outputs are only useful for special use cases

Apple weighs abandoning its own AI for Siri as it tests models from OpenAI and Anthropic

Trump administration plans executive orders to speed up U.S. AI data center expansion

OpenAI launches a program to partner with governments on global AI infrastructure

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

New Othello experiment supports the world model hypothesis for large language models

ChatGPT might be draining your brain, MIT warns - what ‘cognitive debt’ means for you

OpenAI's GPT-4o can now predict its own outputs

Predicted outputs are only useful for special use cases

Apple weighs abandoning its own AI for Siri as it tests models from OpenAI and Anthropic

Trump administration plans executive orders to speed up U.S. AI data center expansion

OpenAI launches a program to partner with governments on global AI infrastructure