Tencent: Hunyuan A13B Instruct

Tencent: Hunyuan A13B Instruct

tencent · Released Jul 8, 2025
30
Our Score

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark performance across mathematics, science, coding, and multi-turn reasoning tasks, while maintaining high inference efficiency via Grouped Query Attention (GQA) and quantization support (FP8, GPTQ, etc.).

$0.14 / 1M Input Price
$0.57 / 1M Output Price
131,072 tokens Context Window
131,072 tokens Max Output
13B Parameters

Architecture

ModalityText → Text
TokenizerOther
Parameters13B

Model Information

OpenRouter ID tencent/hunyuan-a13b-instruct
Providertencent
Release Date July 8, 2025
Context Length131,072 tokens
Max Completion131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.14 $0.000140
Output $0.57 $0.000570

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

1,115ms
Best Latency (TTFT)
15.5 tok/s
Best Throughput
0/1
Active Endpoints
Available via: SiliconFlow