Deepseek unveils v3.1 model with hybrid reasoning and lower prices

來源 Cryptopolitan

The Chinese startup DeepSeek introduced a new update, claiming it outperforms the widely recognized R1 across core benchmarks. In a Thursday WeChat post, the AI company confirmed that the new model version, V3.1, provides quicker responses to queries and signals their entry into AI agent development.

DeepSeek added that the model supports a hybrid reasoning architecture, having both thinking and non-thinking modes, improved agent capabilities, and stronger performance in tool use and task execution.

DeepSeek provides a “Deep Thinking” button to switch between modes

So far, DeepSeek’s official app and website have already been updated to V3.1, allowing users to toggle between thinking and non-thinking modes via the “Deep Thinking” button, similar to how Anthropic’s hybrid models like Opus and Sonnet work.

Reportedly, the V3.1 model also performs better on benchmarks like SWE and Terminal-Bench and thinking efficiency than R1. Moreover, according to Artificial Analysis, the model reached 60 points on its intelligence index in reasoning mode, just above the 59 scored by R1. Still, the underlying architecture remains the same, with 671 billion total parameters and 37 billion active.

Despite having a higher efficiency, it also uses slightly fewer tokens than R1 in reasoning mode. The new model, however, is slightly behind Alibaba’s latest model and OpenAI’s open-source reasoning model, GPT-OSS, in performance. It also lacks function calling in reasoning mode, which is considered a major constraint in agentic workflows.

The startup had first announced the new model on Tuesday, though it was only available on Hugging Face at the time. A separate statement added that the version had been tailored to run on next-generation Chinese-made AI chips. 

Now, the company unveiled a new pricing plan for its upgraded V3. The plan raises some charges, eliminates evening discounts, and reduces costs in certain applications, effective Sept. 6.

DeepSeek set pricing for its Input API at $0.07 per million tokens for cache hits and $0.56 for cache misses, with output tokens at $1.68 per million. The rates sharply undercut competitors: Gemini 2.5 Pro costs $10 per million output tokens ($15 for longer prompts), OpenAI’s GPT-5 is also $10, and Anthropic’s Claude Opus 4.1 goes as high as $75.

Analysts expected DeepSeek to release R1’s successor earlier this year

DeepSeek first rattled Silicon Valley with its low-cost and powerful R1 AI model launch in January. The model has since stayed at the forefront of China’s accelerating AI push, challenging US firms such as OpenAI.

Market observers, however, are still waiting for the follow-up to R1, a possible R2 model, which many had expected to launch earlier this year. Local reports have hinted that the delay in the launch stems from founder Liang Wenfeng’s insistence on perfecting the model. At the same time, he also manages his profitable High-Flyer Asset Management business. 

As previously reported by Cryptopolitan, DeepSeek has delayed the launch of its R2 AI model after encountering persistent technical issues with Huawei’s Ascend processors. Following the success of its R1 model in January, DeepSeek was encouraged by Chinese authorities to adopt Huawei chips instead of US-made Nvidia products. However, the company ran into significant problems during the training phase of its R2 model.

Sources familiar with the matter said DeepSeek had to rely on Nvidia chips for training while using Huawei’s Ascend processors only for inference. Industry insiders note that Chinese chips, including Huawei’s, often lag behind Nvidia in inter-chip connectivity, software support, and overall stability.

Huawei sent engineers to DeepSeek’s offices to help adapt the model. Still, the start-up could not complete a successful training run on Ascend hardware even with on-site assistance. Originally slated for a May release, the R2 model’s launch has been postponed due to these hardware challenges.

While some Chinese media outlets speculate that the new model could launch in the coming weeks, DeepSeek founder Liang Wenfeng has voiced internal frustration over its progress, urging the team to take the necessary time to develop a model that preserves the company’s competitive edge.

Meanwhile, industry heavyweights including Alibaba and Tencent continue to release updates briskly, with Alibaba’s Qwen models attracting a particularly strong following.

Sign up to Bybit and start trading with $30,050 in welcome gifts

免責聲明:僅供參考。 過去的表現並不預示未來的結果。
placeholder
【今日市場前瞻】美國PMI數據來襲!Jackson Hole年會召開美國PMI數據將出爐,黃金價格或迎波動;原油價格反彈,美國需求强勁;傑克遜霍爾年會召開,市場觀望情緒濃厚>>
作者  Alison Ho
9 小時前
美國PMI數據將出爐,黃金價格或迎波動;原油價格反彈,美國需求强勁;傑克遜霍爾年會召開,市場觀望情緒濃厚>>
placeholder
台積電股價暴跌後微彈,外資分析報告力撐股價,半導體產業前景是關鍵台積電(2330)於8月20日股價下探至1,140元,單日重挫45元,台股指數因此一度下滑至23,734.17點,月線失守,不過今天(21日)開盤後,台積電略微回升,最高上漲10元,現價1,145。
作者  財富進化論
9 小時前
台積電(2330)於8月20日股價下探至1,140元,單日重挫45元,台股指數因此一度下滑至23,734.17點,月線失守,不過今天(21日)開盤後,台積電略微回升,最高上漲10元,現價1,145。
placeholder
已獲金管會核准,貝萊德009813 ETF即將募集!追蹤美股50大龍頭,一鍵布局蘋果、輝達全球資產管理巨頭貝萊德(BlackRock)正式進軍台灣ETF市場!旗下“iShares安碩標普500卓越50 ETF”(009813)已獲金管會核准,將於9月30日至10月3日公開募集,預計10月30日掛牌上市。
作者  財富進化論
9 小時前
全球資產管理巨頭貝萊德(BlackRock)正式進軍台灣ETF市場!旗下“iShares安碩標普500卓越50 ETF”(009813)已獲金管會核准,將於9月30日至10月3日公開募集,預計10月30日掛牌上市。
placeholder
無懼川普撐腰,台積電90%市佔率無可撼動,摩根大通最新報告:英特爾的存在對台積電反而有利近期傳聞美國川普政府考慮以「晶片補貼換股權」入股英特爾10%,並可能對台積電採取類似模式,引發熱議。但摩根大通堅信,無論英特爾代工業務獲得多少支持,台積電在先進製程市場的領導地位無可動搖,市占率將穩居九成以上。
作者  財富進化論
9 小時前
近期傳聞美國川普政府考慮以「晶片補貼換股權」入股英特爾10%,並可能對台積電採取類似模式,引發熱議。但摩根大通堅信,無論英特爾代工業務獲得多少支持,台積電在先進製程市場的領導地位無可動搖,市占率將穩居九成以上。
placeholder
警報拉滿!納指創5月來最慘週,AI神話遭暴擊,輝達財報成最後「救命稻草」?這波下跌並非由單一事件引發,核心推手正是撐起美股科技行情的 「七大巨頭」—— 連續兩日的集體下挫,讓整個板塊承壓。
作者  投資-槓把子
9 小時前
這波下跌並非由單一事件引發,核心推手正是撐起美股科技行情的 「七大巨頭」—— 連續兩日的集體下挫,讓整個板塊承壓。
goTop
quote