OpenAI's new GPT-4o mini offers higher performance at 60 percent lower price

Midjourney prompted by THE DECODER

Update

Benchmark added

OpenAI has expanded its lineup with GPT-4o mini, a new AI model that offers improved performance at a lower cost.

GPT-4o mini offers higher performance while costing 60 percent less than OpenAI's most affordable model to date, according to the company. OpenAI claims it's the most powerful and cost-effective small model available today. Developers pay 15 cents per 1M input tokens and 60 cents per 1M output tokens (roughly the equivalent of 2500 pages in a standard book).

The new model is available immediately to users of ChatGPT's Free, Plus, and Team plans. Enterprise customers will have access within the next week.

ChatGPT users can now use GPT-4o mini instead of GPT-3.5 Turbo. Developers can still access GPT-3.5 via the API if they prefer not to switch to GPT-4o mini. OpenAI plans to remove GPT-3.5 from the API in the future, but hasn't set a specific date.

GPT-4o mini supports text and images via the API and is expected to soon support all multimodal input and output, including video and audio. OpenAI attributes the efficiency gains to improvements in model architecture and optimized training data and methods.

Fintech startup Ramp uses GPT-4o mini to extract data from uploaded receipts, according to OpenAI. Email client Superhuman tested the model for an auto-suggest feature for email responses.

In the MMLU (Measuring Massive Multitask Language Understanding) benchmark for general knowledge and language comprehension, GPT-4o mini scored 82 percent. This makes it one of the highest performing small models in this specific benchmark.

For comparison, GPT-3.5 scored 70 percent and GPT-4o scored 88.7 percent. Anthropic's smallest and cheapest model, Claude 3 Haiku, scored around 75 percent. As always, benchmark results don't fully reflect real-world performance.

This announcement comes as OpenAI faces increasing pressure on its model business. Competitors like Anthropic sometimes offer better or cheaper models via API, while other vendors and open-source models are also competing in the market for efficient, small, and low-cost models, including Microsoft.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Recommendation

AI in practice

OpenAI's new GPT-4o mini offers higher performance at 60 percent lower price

Anthropic releases Claude 4 with new safety measures targeting CBRN misuse

LeCun now reports to Alexandr Wang as Meta reshuffles its top AI leadership

OpenAI CEO Sam Altman shifts focus to GPT-6 after the rocky rollout of GPT-5

Google adds new AI features to the Pixel 10

Meta's human-like chatbot personas can mislead users and result in real-world harm

OpenAI launches GPT-5 as a unified system with adaptive reasoning for complex tasks

Google Deepmind's Genie 3 creates interactive 3D worlds that stay consistent for "multiple minutes"

OpenAI's new GPT-4o mini offers higher performance at 60 percent lower price

Share

Bank details