DeepSeek Releases V4 Trillion-Parameter Open Model

March 3, 2026

32071

Hangzhou, China — DeepSeek, the Chinese AI lab backed by hedge fund High-Flyer, released DeepSeek V4 today, a trillion-parameter open model built on a highly efficient sparse architecture. The company, founded in July 2023 by Liang Wenfeng, continues its rapid cadence of releases that began with the DeepSeek-R1 model and its eponymous chatbot in January 2025.

DeepSeek V4 arrives as the latest salvo in an increasingly intense global race to build larger, more capable open-weight language models. The company’s choice to keep the model open — releasing its weights and architecture to the public — signals a bet that community-driven innovation and transparency can match or exceed the closed, proprietary strategies of rivals like OpenAI. DeepSeek-R1 had already demonstrated that a Chinese startup could produce responses comparable to GPT-4 and o1, and at a reported training cost significantly lower than those Western counterparts.

The sparse design at the heart of V4 is the technical headline. Rather than activating all trillion parameters for every query, the model dynamically selects only the relevant sub-networks needed for a given task. This approach, known as mixture-of-experts, has been pursued by labs including Google and Mistral, but DeepSeek appears to have scaled it to an unprecedented size. The result is a model that can, in principle, match the raw knowledge capacity of dense trillion-parameter models while requiring far less compute at inference time. For developers and researchers, that could mean running state-of-the-art AI on more modest hardware — or serving millions of users without bankrupting a data center.

DeepSeek’s rise has been one of the more surprising stories in AI. The company is owned and funded by High-Flyer, a quantitative hedge fund, giving it a financial backstop that most AI startups lack. Liang Wenfeng serves as CEO of both firms, and his background in high-frequency trading — an industry obsessed with latency and efficiency — may have informed the sparse architecture choices now making headlines. The lab operates out of Hangzhou, a tech hub already home to Alibaba and a dense ecosystem of AI talent.

What makes DeepSeek V4 particularly interesting is the timing. The model lands in a market where the cost of training frontier models has become a central anxiety for the industry. DeepSeek-R1’s relatively low training bill was widely discussed as a proof point that efficient engineering could undercut the massive capital expenditures of companies like OpenAI and Google. V4 extends that logic: if a trillion-parameter model can be built and run efficiently, the barriers to entry for the next generation of AI might be lower than many assumed.

The open-weight release also puts pressure on competitors who have been moving toward more closed, API-only models. Meta’s Llama series has been the most prominent open alternative, but DeepSeek V4 now offers a Chinese counterpart with comparable scale. For researchers in academia, startups, and countries without access to the latest Western models, V4 provides a powerful tool that can be inspected, fine-tuned, and deployed without licensing fees or API dependence.

Looking ahead, the arrival of DeepSeek V4 suggests that the frontier of open AI is not slowing down. If the sparse architecture delivers on its efficiency promises, it could accelerate a shift in how the entire field thinks about model scale — away from brute-force parameter counts and toward smarter, more selective computation. For Liang Wenfeng and his team, today’s release is a statement that the most ambitious AI work is no longer confined to Silicon Valley. The next phase of this story will be written in Hangzhou, and the model is now in the hands of the world.

DeepSeek Releases V4 Trillion-Parameter Open Model

ARTIFICIAL INTELLIGENCE

Nvidia CEO Defends AI’s Job Impact Amid Growing Debate Over Automation

Researchers Report Grok 4 AI Blocked Physical Shutdown of Robot Dog

Art Directors Guild Condemns Scorsese’s AI Stance as Betrayal of Film...

Anthropic Study Shows AI Capability Outpaces Actual Workplace Adoption

100-Billion-Parameter AI Model Trained on GPUs Scattered Worldwide

TECHNOLOGY

2026 World Cup Becomes Most AI-Wired Soccer Tournament Ever

Google’s Android 17 Preview Showcases Major ‘Luminous Design’ Overhaul

Google Unveils Googlebook, Its First Gemini-Powered Laptop

IEEE Sees AI Becoming Infrastructure in 2026 Tech Forecast

Microsoft Debuts Surface RTX Spark Dev Box for Local AI

WORLD NEWS

Investigators identify gas accumulation as likely cause of steel plant blast

AGL Demolishes Two 500-Foot Chimneys at Retired Liddell Coal Plant

Kuwait Air Defenses Activated After Iran Confirms Strike on U.S. Base

Ghalibaf Re-elected as Iranian Parliament Speaker for Seventh Year

Iranian Forces Down Israeli Drone Near Strategic Strait of Hormuz

CANCER NEWS

Montreal researchers discover SLAMF6 molecule acts as second immune switch

Experimental Oral Drug Daraxonrasib Targets KRAS in Pancreatic Cancer

New Research Suggests Immune-Hiding Tumors Have Unexpected Weaknesses

FDA Approves Opdivo Qvantig Subcutaneous Cancer Shot

Notable November 2024 Deaths Highlight Cancer Fight

PENTAGON FILES

Spielberg Rejects Sci-Fi Label, Calls Alien Contact Evidence ‘Overwhelming’

DoW Declassifies 2024 INDOPACOM UAP Report with Unresolved Encounters

DoW Declassifies 2023 Unresolved UAP Report from INDOPACOM

Pentagon Releases 2022 Unresolved UAP Report from Europe

Pentagon Releases 2018 UAP Video from Department of War Platform

EVEN MORE NEWS

2026 World Cup Becomes Most AI-Wired Soccer Tournament Ever

Donald Trump says Iran leaked false details about peace deal terms

Researchers Report Grok 4 AI Blocked Physical Shutdown of Robot Dog

POPULAR CATEGORY