OpenAI Launches Smaller, Cheaper GPT Models

OpenAI launched GPT-5.4 mini & nano for faster, cheaper AI apps.
Mini achieves 54.4% on SWE-Bench Pro, costing $0.75/million tokens.
Developers can now use a tiered model approach for efficiency.

Summarized by AI ⓘ

Mastering AI

SEE ALL

News9

Meta brings AI support tool to Instagram & Facebook for password reset, scams, and settings

NewsBytes

Emergence opens India's 1st AI lab for autonomous agents

Feedpost Specials

AI's University Quest: Smart Shortlisting and Application Triumph for Students

What is the story about?

Discover how OpenAI's new mini and nano GPT-5.4 models are revolutionizing AI development with unmatched speed and affordability, empowering faster, more cost-effective applications.

Speed and Cost Optimized

OpenAI is strategically downsizing its advanced GPT-5.4 models, introducing GPT-5.4 mini and nano to target developers who value swift response times and significantly

lower operational expenses. These new models are engineered to deliver remarkable performance without requiring the extensive computational resources of their larger counterparts. This initiative directly addresses the needs of applications where immediate feedback is crucial, such as sophisticated coding assistants, automated background agents, and dynamic visual processing tools. For these use cases, a more compact model often yields superior overall results by balancing capability with efficiency, making advanced AI more accessible and practical for a wider array of development projects. The emphasis is on achieving a better user experience through speed, rather than solely on maximizing intricate reasoning capabilities.

Performance Benchmarks & Savings

The performance disparity between the full GPT-5.4 and its scaled-down versions is remarkably small, especially considering the substantial gains in speed and reductions in cost. For instance, GPT-5.4 mini achieves an impressive 54.4 percent on the SWE-Bench Pro benchmark, closely trailing the full model's 57.7 percent. Similarly, on OSWorld-Verified, the mini model reaches 72.1 percent, with the larger version scoring 75 percent, indicating that core functionalities remain robust. The cost savings are far more pronounced, with GPT-5.4 mini priced at $0.75 per million input tokens and $4.50 per million output tokens. The nano variant offers even greater economy at $0.20 for input and $1.25 for output tokens. Crucially, these cost reductions do not compromise essential features; both mini and nano support text and image inputs, complex tool and function calling, and a substantial 400,000 token context window, ensuring developers retain vital capabilities.

Multi-Model Workflow Advantage

OpenAI is advocating for a sophisticated multi-model approach, encouraging developers to distribute tasks across different model tiers instead of relying on a single, monolithic system. This strategy involves pairing larger, more powerful models for complex planning and decision-making with smaller, more efficient models for executing routine or repetitive tasks. This mirrors the natural workflow of many real-world applications, where one model might analyze a codebase or devise strategic changes, while another handles the processing of associated data or carries out predictable steps. By offloading simpler operations to the more economical mini or nano models, developers can reserve the full GPT-5.4 for tasks demanding its highest level of analytical prowess and judgment. This tiered approach optimizes resource utilization, minimizes costs, and enhances overall application responsiveness and efficiency, as confirmed by early adopter feedback highlighting competitive or superior performance at reduced expense.

Accessibility and Future Direction

GPT-5.4 mini is readily available across OpenAI's API, Codex, and ChatGPT platforms, offering accessibility to a broad developer base. Free and Go tier users can access it via the Thinking option, while other users may encounter it as an automatic fallback when exceeding their GPT-5.4 Thinking limits. The nano model, currently exclusive to the API, is specifically targeted at teams managing high-volume operations where stringent cost control is paramount. This strategic release signifies a clear shift towards making advanced AI more adaptable and economical. For developers focused on building real-time AI features, the message is unequivocal: smaller, optimized models are now sufficiently capable to manage a significant portion of everyday AI workloads. Consequently, the ability to intelligently balance speed, cost, and functional capabilities is becoming an increasingly critical and practical consideration in modern AI development.

OpenAI Launches Smaller, Cheaper GPT Models

Related Stories

Speed and Cost Optimized

Performance Benchmarks & Savings

Multi-Model Workflow Advantage

Accessibility and Future Direction

More stories you might like

Unveiling GPT-5.4 Mini & Nano: OpenAI's Power-Packed Small Models for High-Volume Tasks

ChatGPT free tier just got better with GPT-5.4 mini and subagent features

OpenAI Unveils Powerful New Mini & Nano Models: Speed and Efficiency Redefined

OpenAI Unveils GPT-5.4 Mini & Nano: Powerful, Affordable AI for High-Volume Tasks

OpenAI launches its most capable small models yet

OpenAI Introduces ChatGPT 5.4 Mini And Nano, Here's What They Offer

OpenAI Unveils Smaller, Faster AI Models: GPT-5.4 Mini & Nano for Efficient Workloads

OpenAI Launches GPT-5.4 Mini & Nano: All You Need To Know About These New Models

OpenAI Unveils GPT-5.4 Mini & Nano: Faster, Cheaper AI for High-Volume Tasks

GPT-5.4 Mini: OpenAI launches nano models to power faster, cost-efficient AI tasks; check price and features

AI Generated