Deepseek unveils v3.1 model with hybrid reasoning and lower prices

来源 Cryptopolitan

The Chinese startup DeepSeek introduced a new update, claiming it outperforms the widely recognized R1 across core benchmarks. In a Thursday WeChat post, the AI company confirmed that the new model version, V3.1, provides quicker responses to queries and signals their entry into AI agent development.

DeepSeek added that the model supports a hybrid reasoning architecture, having both thinking and non-thinking modes, improved agent capabilities, and stronger performance in tool use and task execution.

DeepSeek provides a “Deep Thinking” button to switch between modes

So far, DeepSeek’s official app and website have already been updated to V3.1, allowing users to toggle between thinking and non-thinking modes via the “Deep Thinking” button, similar to how Anthropic’s hybrid models like Opus and Sonnet work.

Reportedly, the V3.1 model also performs better on benchmarks like SWE and Terminal-Bench and thinking efficiency than R1. Moreover, according to Artificial Analysis, the model reached 60 points on its intelligence index in reasoning mode, just above the 59 scored by R1. Still, the underlying architecture remains the same, with 671 billion total parameters and 37 billion active.

Despite having a higher efficiency, it also uses slightly fewer tokens than R1 in reasoning mode. The new model, however, is slightly behind Alibaba’s latest model and OpenAI’s open-source reasoning model, GPT-OSS, in performance. It also lacks function calling in reasoning mode, which is considered a major constraint in agentic workflows.

The startup had first announced the new model on Tuesday, though it was only available on Hugging Face at the time. A separate statement added that the version had been tailored to run on next-generation Chinese-made AI chips. 

Now, the company unveiled a new pricing plan for its upgraded V3. The plan raises some charges, eliminates evening discounts, and reduces costs in certain applications, effective Sept. 6.

DeepSeek set pricing for its Input API at $0.07 per million tokens for cache hits and $0.56 for cache misses, with output tokens at $1.68 per million. The rates sharply undercut competitors: Gemini 2.5 Pro costs $10 per million output tokens ($15 for longer prompts), OpenAI’s GPT-5 is also $10, and Anthropic’s Claude Opus 4.1 goes as high as $75.

Analysts expected DeepSeek to release R1’s successor earlier this year

DeepSeek first rattled Silicon Valley with its low-cost and powerful R1 AI model launch in January. The model has since stayed at the forefront of China’s accelerating AI push, challenging US firms such as OpenAI.

Market observers, however, are still waiting for the follow-up to R1, a possible R2 model, which many had expected to launch earlier this year. Local reports have hinted that the delay in the launch stems from founder Liang Wenfeng’s insistence on perfecting the model. At the same time, he also manages his profitable High-Flyer Asset Management business. 

As previously reported by Cryptopolitan, DeepSeek has delayed the launch of its R2 AI model after encountering persistent technical issues with Huawei’s Ascend processors. Following the success of its R1 model in January, DeepSeek was encouraged by Chinese authorities to adopt Huawei chips instead of US-made Nvidia products. However, the company ran into significant problems during the training phase of its R2 model.

Sources familiar with the matter said DeepSeek had to rely on Nvidia chips for training while using Huawei’s Ascend processors only for inference. Industry insiders note that Chinese chips, including Huawei’s, often lag behind Nvidia in inter-chip connectivity, software support, and overall stability.

Huawei sent engineers to DeepSeek’s offices to help adapt the model. Still, the start-up could not complete a successful training run on Ascend hardware even with on-site assistance. Originally slated for a May release, the R2 model’s launch has been postponed due to these hardware challenges.

While some Chinese media outlets speculate that the new model could launch in the coming weeks, DeepSeek founder Liang Wenfeng has voiced internal frustration over its progress, urging the team to take the necessary time to develop a model that preserves the company’s competitive edge.

Meanwhile, industry heavyweights including Alibaba and Tencent continue to release updates briskly, with Alibaba’s Qwen models attracting a particularly strong following.

Sign up to Bybit and start trading with $30,050 in welcome gifts

免责声明:仅供参考。 过去的表现并不预示未来的结果。
placeholder
原油闪崩!黄金下挫!特朗普称以色列和伊朗同意全面停火WTI原油跌破65美元/桶,布伦特原油跌破69美元/桶,几乎收复本轮中东冲突以来的所有涨幅。
作者  Alison Ho
2025 年 6 月 24 日
WTI原油跌破65美元/桶,布伦特原油跌破69美元/桶,几乎收复本轮中东冲突以来的所有涨幅。
placeholder
原油价格4连跌!俄乌冲突有望结束,油价跌至2026年?特朗普政府正敦促乌克兰在27日之前同意一项结束俄乌冲突的计划。俄乌和平将加剧全球原油供应过剩,油价疲软不可避免。
作者  Alison Ho
2025 年 11 月 24 日
特朗普政府正敦促乌克兰在27日之前同意一项结束俄乌冲突的计划。俄乌和平将加剧全球原油供应过剩,油价疲软不可避免。
placeholder
2026年日元展望:多空因素交织,走势或迎“过山车”在日本财政政策和货币政策不确定下,机构对日元汇率前景出现较大分歧。美元/日元在2026年或重现2025年“过山车”走势,投资者可逢高做空或逢低做多。
作者  Alison Ho
2025 年 12 月 24 日
在日本财政政策和货币政策不确定下,机构对日元汇率前景出现较大分歧。美元/日元在2026年或重现2025年“过山车”走势,投资者可逢高做空或逢低做多。
placeholder
2026热门资产前瞻:黄金、比特币、美元将会出现关键拐点?权威机构观点汇总动荡的2025年已过去,展望2026年,商品、外汇和加密货币市场将何去何从?
作者  Insights
2025 年 12 月 25 日
动荡的2025年已过去,展望2026年,商品、外汇和加密货币市场将何去何从?
placeholder
美联储3月会议前瞻:2026年仅降息一次?美元、黄金迎巨震!点阵图变动和鲍威尔口风成焦点,2026年美联储或更长时间维持利率不变。
作者  Alison Ho
3 月 16 日 周一
点阵图变动和鲍威尔口风成焦点,2026年美联储或更长时间维持利率不变。
goTop
quote