Baidu Launches ERNIE 5.0 Multimodal AI Model, Claims Superior Performance Over GPT-5 and Gemini 2.5 Pro
Summary
Baidu unveils ERNIE 5.0 multimodal AI model claiming to outperform GPT-5 and Gemini 2.5 Pro in visual tasks while pricing competitively at $0.85-$3.40 per million tokens as the Chinese tech giant expands globally with AI tools that already power 83% of livestreamers during major shopping events.
Key Points
- Baidu launches ERNIE 5.0, a proprietary multimodal AI model that outperforms GPT-5 and Gemini 2.5 Pro on document understanding, chart interpretation, and visual tasks according to company benchmarks
- The new model costs $0.85 per million input tokens and $3.40 per million output tokens, positioning it as mid-range pricing compared to Western competitors while offering native omni-modal processing across text, images, audio, and video
- Baidu expands globally with AI products including GenFlow 3.0 agent platform, international no-code tools, and digital human technology that powered 83% of livestreamers during China's Double 11 shopping event