OpenAI: GPT Audio

OpenAI: GPT Audio

openai · Released Jan 19, 2026 Legacy
Intelligence #464 / 576
24.5 Our Score
Speed
— Not reported
Input #503 / 576
$2.50 per 1M tokens
Output #492 / 576
$10.00 per 1M tokens
Context #329 / 576
128,000 tokens

Analysis Summary

GPT Audio is OpenAI's full-tier audio model, supporting text and audio input and output with tool use and function calling across a 128K context window. Without benchmark data, its reasoning and coding capability cannot be assessed against the broader field.

The model is designed for voice-enabled applications, real-time conversational interfaces, and audio-integrated pipelines. Its practical value depends on audio quality, latency, and transcription accuracy, none of which are captured in available benchmark data.

At $2.50 input and $10.00 output per million tokens, it is expensive relative to text-only alternatives. Teams building audio-first products within the OpenAI ecosystem may find it useful, but the lack of benchmark evidence means capability claims should be validated against specific use cases before production deployment.

Assessed June 6, 2026

Editorial notes

GPT Audio from OpenAI supports bidirectional audio and text with tool use and function calling, but has no benchmark data to assess reasoning or coding capability.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value6.3Content4
Intelligence 0/10
Technical 0/10
Content 4/10
Value 6.3/10

How OpenAI: GPT Audio compares

Its 128K-token context window is larger than 43% of the models we list. At $2.50 per million input tokens it is cheaper than 13% of comparable models.

About OpenAI: GPT Audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced..

Capabilities

Tool Use Function Calling

How does OpenAI: GPT Audio stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID openai/gpt-audio
Provideropenai
Release Date January 19, 2026
Context Length128,000 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.50 $0.002500
Output $10.00 $0.010000

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
1,387ms
Best Latency (TTFT)
42 tok/s
Best Throughput
1/1
Active Endpoints
Available via: OpenAI

Frequently asked questions about OpenAI: GPT Audio

How much does OpenAI: GPT Audio cost?

OpenAI: GPT Audio costs $2.50 per million input tokens and $10.00 per million output tokens.

What is the context window of OpenAI: GPT Audio?

OpenAI: GPT Audio has a context window of 128,000 tokens (128K).

What can OpenAI: GPT Audio do?

OpenAI: GPT Audio supports tool use and function calling.

Who created OpenAI: GPT Audio?

OpenAI: GPT Audio is developed by OpenAI and was released on January 19, 2026.