TechCrunch AI • 61일 전

AI 진정한 병목은 연산이 아닌 '메모리'

IMP

8/10

핵심 요약

스타트업 XCENA(엑세나)는 AI 처리 과정의 비효율적인 데이터 이동 문제를 해결하기 위해 메모리(DRAM) 바로 옆에서 연산을 수행하는 칩(MX1)을 개발해 1억 3,500만 달러(시리즈 B)를 유치했습니다. 이 칩은 기존에 수십 대의 서버가 필요했던 작업을 단 한 대로 처리할 수 있게 해주어 막대한 AI 인프라 비용을 절감할 수 있다는 점에서 하이퍼스케일러들에게 매우 중요한 혁신으로 평가받습니다.

번역된 본문

ChatGPT에 질문할 때마다 질문은 데이터 릴레이 레이스를 촉발합니다. 정보는 메모리를 떠나 전처리를 위해 CPU(Central Processing Unit)를 거치고, 본격적인 연산을 위해 GPU(Graphics Processing Unit)로 이동한 뒤 다시 되돌아옵니다. 그리고 AI가 단어를 하나 생성할 때마다 이 전체 여정이 반복됩니다. 이러한 병목 현상은 구조적인 문제입니다. 즉, 모든 요청마다 업계에서 가장 비싸고 전력을 많이 소비하는 칩들을 거쳐야 한다는 뜻입니다. 한국과 미국에 거점을 둔 스타트업 XCENA가 해결하려고 하는 것은 바로 이러한 비효율성입니다.

창립 4년차인 이 스타트업은 컴퓨팅 기능을 DRAM(Dynamic Random Access Memory, 프로세서가 활발하게 사용하는 데이터를 저장하는 빠르고 단기적인 메모리 칩)과 훨씬 더 가깝게 배치한 칩을 설계했습니다. 이를 통해 CPU, GPU, 메모리 사이의 비용이 많이 드는 왕복 없이도 메모리 근처에서 일상적인 데이터 작업을 처리할 수 있습니다. 이 기술이 대규모로 작동한다면 AI 인프라 비용에 미치는 영향은 상당할 것이며, 이것이 바로 투자자들이 열광하는 주된 이유입니다. 실제로 XCENA는 최근 시리즈 B 라운드에서 5억 7,000만 달러(약 7,600억 원)의 기업가치로 1억 3,500만 달러(약 1,800억 원)를 유치하며 총 투자 유치액을 1억 8,500만 달러로 늘렸습니다.

XCENA의 김진 대표는 2022년 삼성과 SK하이닉스, 엔비디아 GPU에 칩을 공급하는 메모리 거인 출신의 최고기술책임자(CTO) 김도훈, 최고제품책임자(CPO) 김주현과 함께 이 스타트업을 공동 창립했습니다.

김진 대표는 TechCrunch와의 인터뷰에서 "수십 년 동안 CPU와 GPU는 모두 더 똑똑해졌지만, 메모리는 그렇지 않았다. XCENA는 이를 바꾸고자 한다"며 "최근 메모리 가격과 관련 주식의 상승은 AI 인프라가 메모리 중심 아키텍처로 광범위하게 전환하고 있음을 시사한다"고 덧붙였습니다. (이번 달, 글로벌 메모리 칩 시장을 장악하고 있는 삼성전자, SK하이닉스, 마이크론 등 3개사는 모두 창사 이래 처음으로 시가총액 1,000억 달러(약 13조 원)를 돌파했습니다.)

김 대표는 XCENA의 비즈니스 모델이 '추론(Inference)은 단순히 연산 문제가 아니라 점점 더 메모리 확장 문제가 되고 있다'는 논점에 베팅하고 있다고 밝혔습니다.

XCENA의 칩인 MX1은 CXL(Compute Express Link, 프로세서와 메모리 간의 전용 고속도로 역할)을 통해 CPU에 연결되며, 데이터가 메모리 모듈을 떠나야 할 필요가 있기 전에 미리 데이터를 처리합니다. 이는 데이터를 연산 쪽으로 가져오는 것이 아니라 연산을 데이터 쪽으로 가져가는 방식입니다. 이 회사는 과거에 10대의 서버가 필요했던 작업을 단 한 대의 서버로도 실행할 수 있을 것이라고 주장합니다.

김 대표는 "GPU는 AI 모델 학습의 핵심인 행렬 곱셈과 같은 복잡한 연산에 탁월하지만, KV 캐시 관리(모델이 이전 대화 문맥을 다시 처리하지 않아도 되도록 저장하는 시스템), 데이터 캐싱 및 전처리 등 주변 데이터 오케스트레이션의 상당 부분은 여전히 CPU에서 실행된다. 우리의 칩은 이러한 작업을 메모리 모듈 자체 내에서 직접 처리한다"고 설명했습니다.

작년 하반기 이후 메모리 솔루션에 대한 수요가 급증했으며, 회사는 시기가 자신들에게 유리하게 작용하고 있다고 보고 있습니다. 여러 글로벌 메모리 공급업체와의 대화는 초기 단계에 있으나 김 대표는 해당 기업들의 이름을 밝히기를 거절했습니다. 이 회사의 이상적인 고객은 매년 수백억 달러를 AI 인프라에 쏟아붓는 하이퍼스케일러들이며, 메모리 효율성이 조금만 개선되어도 수억 달러의 비용을 절약할 수 있기 때문입니다.

MX1은 아직 프로토타입(시제품) 단계입니다. 양산용 칩은 2026년 말까지 삼성전자의 파운드리 라인을 통해 생산될 예정이며, 2027년부터 매출을 올릴 것으로 기대하고 있습니다.

신경망 처리 장치(NPU) 제조업체들이 학습 워크로드를 놓고 엔비디아에 도전하기 위해 경쟁하는 동안, XCENA는 그 기반에 있는 메모리 집약적인 계층을 타겟으로 삼고 있습니다. XCENA의 가장 가까운 경쟁사로는 차세대 메모리 연결성을 연구하는 나스닥 상장 기업인 아스테라 랩스(Astera Labs)와 마벨(Marvell)이 있습니다.

김 대표는 마벨이 이미 같은 분야에서 활동하는 대형 기존 기업이라고 말하며, 차별점은 결국 지적재산권(IP)에 달려있다고 덧붙였습니다. 김 대표는 "우리는 수천 개의 코어를 보유하고 있다"고 밝혔습니다. 공개된 사양을 기반으로 비교해 보면, 마벨의 접근 방식은 소수의 범용 코어에 의존합니다. 참고로 XCENA의 코어들은 오픈소스 칩 설계 청사진인 RISC-V를 기반으로 구축되었습니다.

원문 보기

원문 보기 (영어)

Every time you ask ChatGPT a question, your request triggers a data relay race. Information leaves memory, passes through a CPU for preprocessing, travels to a GPU for heavy computation, and then makes its way back — and that entire journey repeats for every single word the AI generates. The bottleneck is structural — it means routing through some of the most expensive and power-intensive chips in the industry on every single request. That inefficiency is exactly what XCENA , a startup with offices in South Korea and the U.S., is trying to solve. The four-year-old startup has designed a chip that places compute capabilities much closer to DRAM — the fast, short-term memory chips that store data a processor is actively using — allowing routine data operations to be handled near memory, without the costly round trips between CPUs, GPUs, and memory. If it works at scale, the implications for AI infrastructure costs could be significant, which largely explains investor enthusiasm around the country. Indeed, XCENA just raised $135 million in a Series B at a valuation of $570 million, bringing its total raised to $185 million. XCENA CEO Jin Kim co-founded the startup in 2022 alongside CTO Dohun Kim and CPO Harry Juhyun Kim, all veterans of Samsung and SK Hynix, the memory giants that supply chips powering Nvidia's GPUs. "CPUs and GPUs have both gotten smarter over the decades. Memory never did. XCENA wants to change that," Kim said in an interview with TechCrunch. "The recent rise in memory prices and related stocks points to a broader shift in AI infrastructure toward memory-centric architectures," he added. (This month, the three companies that dominate the global memory chip market — Samsung, SK Hynix, and Micron — each crossed a trillion-dollar valuation for the first time.) XCENA is betting its business on the thesis that "inference isn't just a compute problem; it's increasingly a memory scaling problem," said Kim. XCENA's chip, the MX1, connects to the CPU through CXL (Compute Express Link) — essentially a dedicated express lane between the processor and memory — processing data before it ever needs to leave the memory module. It brings compute to the data, not the other way around. The company claims that what used to require 10 servers could potentially run on just one. "While GPUs excel at matrix multiplication — the heavy math behind AI model training — much of the surrounding data orchestration, including preprocessing, KV cache management [the system that stores prior conversation context so a model doesn't have to reprocess it], and data caching, still runs on CPUs. Our chip handles those tasks directly within the memory module itself," Kim said. Demand for memory solutions has surged since the second half of last year, and the company believes the timing is working in its favor. Conversations with several global memory vendors are in early stages, though Kim declined to name them. The company's ideal customers are hyperscalers spending tens of billions a year on AI infrastructure, where even a small gain in memory efficiency can mean hundreds of millions in savings. The MX1 is still a prototype. Mass production chips are scheduled to roll off Samsung's foundry lines by the end of 2026, with the company expecting to generate revenue starting in 2027. While neural processing unit (NPU) makers are competing to challenge Nvidia for training workloads, XCENA is targeting the memory-intensive layer that sits underneath all of it. XCENA's closest rivals include Astera Labs and Marvell, both Nasdaq-listed companies working on next-generation memory connectivity. Marvell is a large, established player already working in the same space, Kim said, adding that the differentiator comes down to intellectual property. "We have thousands of cores," Kim said. Based on public specs, Marvell's approach relies on a handful of general-purpose cores by comparison. Those cores are built on RISC-V — an open-source chip design blueprint — and optimized specifically for data processing , with each core deliberately kept small and efficient. Beyond the cores themselves, XCENA designs its own internal memory hierarchy, interconnect bus, and DRAM controller — a level of vertical integration that most chip companies, including larger rivals, typically outsource. Seoul-based VC firms Altinum and IMM Investment co-led the Series B round, along with Corstone Asia and existing investors SBI Investment and Mirae Asset Capital. The company, which has more than 90 staff across offices in Pangyo, a tech hub outside Seoul, and in Sunnyvale, is also in conversations with international investors about additional funding. Topics AI , Asia , dram , memory chip shortage , memory chips , North America , Samsung Electronics , SK Hynix , south korea , South Korea , Startups , XCENA When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence. Kate Park Reporter, Asia Kate Park is a reporter at TechCrunch, with a focus on technology, startups and venture capital in Asia. She previously was a financial journalist at Mergermarket covering M&A, private equity and venture capital. View Bio October 13-15 San Francisco Early Bird ticket savings of up to $410 end May 29, 11:59 p.m. PT. Connect with 10,000+ tech leaders in unparalleled networking, 200+ hands-on sessions led by 250+ industry leaders, and discover 300+ showcasing startups. REGISTER NOW Most Popular Blue Origin's New Glenn rocket explodes during testing in Florida Sean O'Kane Meta launches Instagram, Facebook, and WhatsApp subscriptions, with more to come, including AI plans Sarah Perez Tech CEOs are apparently suffering from AI psychosis Julie Bort DuckDuckGo installs are up 30% as users reject being ‘force-fed’ Google’s AI Search Rebecca Bellan Starship's path to reusability looks murky after SpaceX's S-1 Tim Fernholz 6 kitchen gadgets that make adulting feel easier Lauren Forristal I tried Amazon's Bee wearable and am both intrigued and slightly creeped out Lucas Ropek

AI 반도체 메모리 반도체 스타트업 투자 인프라 비용 절감 CXL 기술