TechCrunch AI • 95일 전

딥시크, 최신 AI 모델 V4 공개…

IMP

9/10

핵심 요약

중국의 AI 연구소 딥시크(DeepSeek)가 최신 대규모 언어 모델인 DeepSeek V4(Flash 및 Pro)의 프리뷰 버전을 공개했습니다. 이번 모델은 최대 1.6조 개의 파라미터를 갖춘 오픈웨이트 모델로서, 미스트럭스(Mixture-of-experts) 방식을 채택해 추론 비용을 절감하면서도 추론 및 코딩 벤치마크에서 최고 수준의 폐쇄형 모델들과 거의 차이를 좁혔습니다. 특히 기존 최고 성능 모델들과 비교해 압도적으로 저렴한 API 사용 비용을 제공하며 시장의 경쟁력을 확보하고 있습니다.

번역된 본문

중국의 AI 연구소 딥시크(DeepSeek)는 작년에 출시되어 AI 업계를 뒤흔든 V3.2 모델 및 R1 추론 모델의 오랜 기다림 끝의 업데이트인 최신 대규모 언어 모델, DeepSeek V4의 두 가지 프리뷰 버전을 공개했습니다.

딥시크에 따르면 DeepSeek V4 Flash와 V4 Pro는 모두 100만 토큰의 컨텍스트 윈도우를 갖춘 미스트럭스(Mixture-of-experts) 모델이며, 이는 대규모 코드베이스나 문서를 프롬프트에 포함시키기에 충분한 용량입니다. 미스트럭스 방식은 작업당 특정 수의 파라미터만 활성화하여 추론 비용을 낮추는 기술입니다.

Pro 모델은 총 1.6조 개의 파라미터(그중 490억 개 활성)를 보유하고 있어, 현재 사용 가능한 가장 큰 오픈웨이트 모델이 되었습니다. 이는 문샷 AI(Moonshot AI)의 Kimi K 2.6(1.1조), 미니맥스(MiniMax)의 M1(4560억)을 뛰어넘고, 기존 DeepSeek V3.2(6710억)의 두 배 이상 규모입니다. 보다 작은 버전인 V4 Flash는 2840억 개의 파라미터(130억 개 활성)를 보유하고 있습니다.

딥시크는 두 모델 모두 아키텍처 개선으로 인해 DeepSeek V3.2보다 효율성과 성능이 뛰어나며, 추론 벤치마크에서 오픈소스 및 폐쇄형 기반의 현재 최고 수준 모델들과 격차를 거의 "close the gap" 좁혔다고 밝혔습니다. 딥시크는 새로운 V4-Pro-Max 모델이 추론 벤치마크에서 오픈소스 동급 모델을 능가하며, 일부 작업에서는 OpenAI의 GPT-5.2와 Gemini 3.0 Pro를 앞선다고 주장합니다. 코딩 대회 벤치마크에서는 두 V4 모델의 성능이 "GPT-5.4와 비슷한 수준"이라고 덧붙였습니다.

그러나 이 모델들은 지식 테스트, 특히 OpenAI의 GPT-5.4와 구글의 최신 Gemini 3.1 Pro와 비교할 때 최신 모델들에 비해 약간 뒤처지는 것으로 보입니다. 이러한 지연은 "최첨단 모델보다 약 3~6개월 뒤진 개발 궤적"을 보여준다고 연구소는 전했습니다. 오디오, 비디오 및 이미지의 이해와 생성을 지원하는 많은 폐쇄형 모델들과 달리, V4 Flash와 V4 Pro 모두 텍스트만 지원합니다.

주목할 점은, DeepSeek V4가 오늘날 사용 가능한 어떤 최신 모델보다도 가격이 훨씬 저렴하다는 것입니다. 작은 버전인 V4 Flash 모델은 백만 입력 토큰당 0.14달러, 백만 출력 토큰당 0.28달러의 비용이 들어 GPT-5.4 Nano, Gemini 3.1 Flash, GPT-5.4 Mini, Claude Haiku 4.5보다 저렴합니다. 반면 더 큰 V4 Pro 모델은 백만 입력 토큰당 0.145달러, 백만 출력 토큰당 3.48달러로 Gemini 3.1 Pro, GPT-5.5, Claude Opus 4.7, GPT-5.4보다 가격이 낮습니다.

이번 출시는 미국이 중국이 수천 개의 프록시 계정을 사용하여 미국 AI 연구소의 지식재산권을 산업 규모로 도난했다고 비난한 지 하루 만에 이루어졌습니다. 딥시크 자체도 안스로픽(Anthropic)과 오픈AI(OpenAI)로부터 그들의 AI 모델을 본질적으로 복사하는 일종의 '증류(distilling)'를 했다는 비난을 받아왔습니다.

원문 보기

원문 보기 (영어)

Chinese AI lab DeepSeek has launched two preview versions of its newest large language model, DeepSeek V4 , a much-awaited update to last year's V3.2 model and the accompanying R1 reasoning model that took the AI world by storm . The company says both DeepSeek V4 Flash and V4 Pro are mixture-of-experts models with context windows of 1 million tokens each — enough to allow large codebases or documents to be used in prompts. The mixture-of-experts approach involves activating only a certain number of parameters per task to lower inference costs. The Pro model has a total of 1.6 trillion parameters (49 billion active), which makes it the biggest open-weight model available, outstripping Moonshot AI's Kimi K 2.6 (1.1 trillion), MiniMax's M1 (456 billion), and more than double DeepSeek V3.2 (671 billion). The smaller, V4 Flash has 284 billion parameters (13 billion active). DeepSeek says both models are more efficient and performant than DeepSeek V3.2 due to architectural improvements, and have almost "closed the gap" with current leading models, both open and closed, on reasoning benchmarks. The company claims its new V4-Pro-Max model outperforms its open-source peers across reasoning benchmarks, and outstrips OpenAI's GPT-5.2 and Gemini 3.0 Pro on some tasks. In coding competition benchmarks, DeepSeek said both V4 models' performance is "comparable to GPT-5.4." However, the models seem to fall slightly behind frontier models in knowledge tests, specifically OpenAI's GPT-5.4 and Google's latest Gemini 3.1 Pro. This lag suggests a "developmental trajectory that trails state-of-the-art frontier models by approximately 3 to 6 months," the lab wrote. Both V4 Flash and V4 Pro support text only, unlike many of its closed-source peers, which offer support for understanding and generating audio, video, and images. Techcrunch event Meet your next investor or portfolio startup at Disrupt Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $410. Meet your next investor or portfolio startup at Disrupt Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $410. San Francisco, CA | October 13-15, 2026 REGISTER NOW Notably, DeepSeek V4 is much more affordable than any frontier model available today. The smaller V4 Flash model costs $0.14 per million input tokens and $0.28 per million output tokens, undercutting GPT-5.4 Nano, Gemini 3.1 Flash, GPT-5.4 Mini, and Claude Haiku 4.5. The larger V4 Pro model, meanwhile, costs $0.145 per million input tokens and $3.48 per million output tokens, also undercutting Gemini 3.1 Pro, GPT-5.5, Claude Opus 4.7, and GPT-5.4. The launch comes a day after the U.S. accused China of stealing American AI labs' IP on an industrial scale using thousands of proxy accounts. DeepSeek itself has been accused by Anthropic and OpenAI of " distilling ," essentially copying, their AI models. Topics AI , China , deepseek , Deepseek V4 , open source ai When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence. Ram Iyer Editor Ram is a financial and tech reporter and editor. He covered North American and European M&A, equity, regulatory news and debt markets at Reuters and Acuris Global, and has also written about travel, tourism, entertainment and books. You can contact or verify outreach from Ram by emailing ram.iyer@techcrunch.com . View Bio April 30 San Francisco, CA StrictlyVC kicks off the year in SF. Register now for unfiltered fireside chats and VC insights with leaders from Uber, Replit, Eclipse, and more. Plus, high-value connections that actually move the needle. Tickets are limited. REGISTER NOW Most Popular Microsoft offers buyout for up to 7% of US employees Amanda Silberling Duolingo is now giving users access to advanced learning content Lauren Forristal Unauthorized group has gained access to Anthropic's exclusive cyber tool Mythos, report claims Lucas Ropek SpaceX is working with Cursor and has an option to buy the startup for $60B Tim Fernholz Tim Cook stepping down as Apple CEO, John Ternus taking over Amanda Silberling Connie Loizos Blue Origin's New Glenn put a customer satellite in the wrong orbit during its third launch Sean O'Kane Palantir posts mini-manifesto denouncing inclusivity and ‘regressive’ cultures Anthony Ha

DeepSeek 오픈소스 AI 대규모 언어 모델 가격 경쟁력 미중 AI 경쟁