Hunyuan A13B Instruct reasoning
Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark performance across mathematics, science, coding, and multi-turn reasoning tasks, while maintaining high inference efficiency via Grouped Query Attention (GQA) and quantization support (FP8, GPTQ, etc.).
Capabilities
Context Window 131k tokens
Max Output 131k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.14
Output $0.57
Cache Read -
Cache Write -
Supported Parameters
frequency_penaltyinclude_reasoningreasoningresponse_formatstructured_outputstemperaturetop_ktop_p