
Photo by John Schnobrich on Unsplash
Alibaba Releases New Qwen AI Model And Claims It Outperforms DeepSeek-V3
The Chinese giant Alibaba released the latest version of its flagship AI model, Qwen, this Wednesday. The company claims it can perform better than the popular DeepSeek-V3.
In a Rush? Here are the Quick Facts!
- Alibaba released its latest reasoning model Qwen 2.5-Max this Wednesday.
- The Chinese giant claims it outperforms popular models like DeepSeek-V3, GPT-4o, and Llama-3.1-405B.
- The company also launched Qwen2.5-VL this week, an AI model capable of processing images and act as an AI agent using computers and mobiles to perform tasks.
According to Reuters, Alibaba launched the new Qwen 2.5-Max, as it has named the new reasoning model, right during the holidays of the Lunar New Year in China, to the massive AI developments of the past few days and add domestic competition.
On Monday, Nvidia shares dropped 17% in just one day.
Now, Alibaba has announced the latest versions of its Qwen model—it released 100 open-source AI models for the Qwen suite in September last year—promising better results than popular frontier models.
“Qwen 2.5-Max outperforms (…) almost across the board GPT-4o, DeepSeek-V3 and Llama-3.1-405B,” wrote the company on its official WeChat .
The new reasoning model Qwen 2.5-Max’s API is available through Alibaba’s cloud and s can also test the model on its chat page.
“We are developing Qwen2.5-Max, a large-scale MoE model that has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human (RLHF) methodologies,” wrote Qwen Team in Github.
The Chinese giant also released Operator, allowing ChatGPT to perform tasks autonomously taking control of the ’s computer.
According to Alibaba’s team, all Qwen models outperform similar versions from OpenAI, Microsoft, Google, Meta, and DeepSeek.
Leave a Comment
Cancel