A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price120k0.07- -0.28
ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k0.07- -0.28
ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (MoE) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price123k0.28- -1.10
A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional text and vision understanding through its innovative heterogeneous MoE structure with modality-isolated routing....
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price30k0.14- -0.56
ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price123k0.42- -1.25
Qianfan-OCR-Fast is a domain-specific multimodal large model purpose-built for OCR. By leveraging specialized OCR training data while preserving versatile multimodal intelligence, it provides a powerful performance upgrade over Qianfan-OCR.
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k-- --
CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k-- --