Baidu Launches ERNIE 5.0 Multimodal AI Model, Claims Superior Performance Over GPT-5 and Gemini 2.5 Pro

Nov 14, 2025
Venturebeat
Article image for Baidu Launches ERNIE 5.0 Multimodal AI Model, Claims Superior Performance Over GPT-5 and Gemini 2.5 Pro

Summary

Baidu unveils ERNIE 5.0 multimodal AI model claiming to outperform GPT-5 and Gemini 2.5 Pro in visual tasks while pricing competitively at $0.85-$3.40 per million tokens as the Chinese tech giant expands globally with AI tools that already power 83% of livestreamers during major shopping events.

Key Points

  • Baidu launches ERNIE 5.0, a proprietary multimodal AI model that outperforms GPT-5 and Gemini 2.5 Pro on document understanding, chart interpretation, and visual tasks according to company benchmarks
  • The new model costs $0.85 per million input tokens and $3.40 per million output tokens, positioning it as mid-range pricing compared to Western competitors while offering native omni-modal processing across text, images, audio, and video
  • Baidu expands globally with AI products including GenFlow 3.0 agent platform, international no-code tools, and digital human technology that powered 83% of livestreamers during China's Double 11 shopping event

Tags

Read Original Article