Alibaba Unveils QwQ-32B, a Compact Reasoning Model Rivaling DeepSeek-R1

nodetechno7 mars 07, 2025

This year’s Double 11 shopping festival concluded on Nov. 11

On March 6, Alibaba released and open-sourced its new reasoning model, QwQ-32B, featuring 32 billion parameters. Despite being significantly smaller than DeepSeek-R1, which has 6,710 billion parameters (with 3.7 billion active), QwQ-32B matches its performance in various benchmarks. QwQ-32B excelled in math and coding tests, outperforming OpenAI’s o1-mini and distilled versions of DeepSeek-R1. It also scored higher than DeepSeek-R1 in some evaluations like LiveBench and IFEval. The model leverages reinforcement learning and integrates agent capabilities for critical thinking and adaptive reasoning. Notably, QwQ-32B requires much less computational power, making it deployable on consumer-grade hardware. This release aligns with Alibaba’s AI strategy, which includes significant investments in cloud and AI infrastructure. Following the release, Alibaba’s US stock rose 8.61% to $141.03, with Hong Kong shares up over 7%.[Jiemian, in Chinese]

TechNode

Techno Node

Ticker

Alibaba Unveils QwQ-32B, a Compact Reasoning Model Rivaling DeepSeek-R1

Enregistrer un commentaire

0 Commentaires

Subscribe Us

Popular Posts

Tsinghua University’s KTransformers enables full-powered DeepSeek-R1 with low-cost graphics card

WeChat integrates AI Search with DeepSeek, seeks to allay concerns over user privacy

Tencent releases Hunyuan 2.0, its next-generation AI model

From Biscuits to Bytes: Inside the Lisbon Factory Breeding Global Unicorns

DealShot: 13 Deals Exceeding $160 Million Counting Redpoint Ventures, Nio Capital And More

Earnings Data Boosts Asia’s Markets But Inflation Fears Lurk

Alibaba’s grocery chain Freshippo records first profitability in key segment, eyes expansion in 2023

US government restricts 42 Chinese enterprises over alleged support for Russia

Baidu to launch solution that enables building a metaverse in 40 days

Lynk & Co’s flagship SUV to feature NVIDIA’s next-gen chip in world first

Random Posts

Recent in Sports

Popular Posts

Tsinghua University’s KTransformers enables full-powered DeepSeek-R1 with low-cost graphics card

WeChat integrates AI Search with DeepSeek, seeks to allay concerns over user privacy

Tencent releases Hunyuan 2.0, its next-generation AI model

Footer Menu Widget

Ticker

Ad Code

Alibaba Unveils QwQ-32B, a Compact Reasoning Model Rivaling DeepSeek-R1

Ces posts pourraient vous intéresser

Enregistrer un commentaire

0 Commentaires

Social Plugin

Subscribe Us

Popular Posts

Random Posts

Recent in Sports

Popular Posts

Footer Menu Widget