OpenAI: GPT Audio
Analysis Summary
GPT Audio is OpenAI's full-tier audio model, supporting text and audio input and output with tool use and function calling across a 128K context window. Without benchmark data, its reasoning and coding capability cannot be assessed against the broader field.
The model is designed for voice-enabled applications, real-time conversational interfaces, and audio-integrated pipelines. Its practical value depends on audio quality, latency, and transcription accuracy, none of which are captured in available benchmark data.
At $2.50 input and $10.00 output per million tokens, it is expensive relative to text-only alternatives. Teams building audio-first products within the OpenAI ecosystem may find it useful, but the lack of benchmark evidence means capability claims should be validated against specific use cases before production deployment.
Assessed June 6, 2026
Editorial notes
GPT Audio from OpenAI supports bidirectional audio and text with tool use and function calling, but has no benchmark data to assess reasoning or coding capability.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How OpenAI: GPT Audio compares
Its 128K-token context window is larger than 43% of the models we list. At $2.50 per million input tokens it is cheaper than 13% of comparable models.
About OpenAI: GPT Audio
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced..
Capabilities
How does OpenAI: GPT Audio stack up?
Compare side-by-side with other legacy models.
Model Information
| OpenRouter ID |
openai/gpt-audio
|
| Provider | openai |
| Release Date | January 19, 2026 |
| Context Length | 128,000 tokens |
| Max Completion | 16,384 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $2.50 | $0.002500 |
| Output | $10.00 | $0.010000 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
External Resources
Explore Related Models
Frequently asked questions about OpenAI: GPT Audio
How much does OpenAI: GPT Audio cost?
OpenAI: GPT Audio costs $2.50 per million input tokens and $10.00 per million output tokens.
What is the context window of OpenAI: GPT Audio?
OpenAI: GPT Audio has a context window of 128,000 tokens (128K).
What can OpenAI: GPT Audio do?
OpenAI: GPT Audio supports tool use and function calling.
Who created OpenAI: GPT Audio?
OpenAI: GPT Audio is developed by OpenAI and was released on January 19, 2026.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 9, 2026 9:57 pm