TechCrunch AI • 97일 전

구글, 엔비디아 겨냥해 AI 반도체 2종 새롭게 출시

IMP

8/10

핵심 요약

구글 클라우드가 8세대 커스텀 AI 칩(TPU)을 AI 모델 학습용(TPU 8t)과 추론용(TPU 8i) 두 가지로 세분화하여 출시했습니다. 신형 칩은 이전 세대 대비 최대 3배 빠른 학습 속도와 80% 향상된 가성비를 제공하며, 에너지 및 비용 효율성을 크게 높였습니다. 자체 칩을 강화하면서도 구글은 여전히 엔비디아 최신 칩(Vera Rubin)을 지원하고 네트워크 기술을 협업하는 등 보완적인 관계를 유지하고 있습니다.

번역된 본문

구글 클라우드는 수요일, 8세대 자체 제작 AI 칩인 텐서 처리 장치(TPU)를 두 가지 버전으로 나누어 출시한다고 발표했습니다. 모델 학습에 특화된 'TPU 8t'와 추론(Inference) 작업에 초점을 맞춘 'TPU 8i'가 바로 그것입니다. 추론이란 모델이 지속적으로 사용되는 과정, 즉 사용자가 프롬프트를 입력한 후 일어나는 일련의 작업을 의미합니다.

예상하실 수 있듯, 구글은 이전 세대와 비교했을 때 이 새로운 TPU의 놀라운 성능 스펙을 자랑했습니다. AI 모델 학습 속도는 최대 3배 빠르며, 달러당 성능은 80% 향상되었고, 단일 클러스터에서 100만 개 이상의 TPU를 함께 작동시킬 수 있습니다. 그 결과, 이전 버전보다 훨씬 적은 전력으로 더 많은 컴퓨팅 파워를 제공할 수 있게 되었으며, 이는 곧 고객의 비용 절감으로 이어집니다. 이 칩들을 GPU가 아닌 TPU라고 부르는 이유는, 저전력 맞춤형 칩이 처음에 'Tensor(텐서)'라는 이름으로 명명되었기 때문입니다.

하지만 구글의 칩이 엔비디아의 미래에 대한 전면적인 공격은 아닙니다. 적어도 아직은 아닙니다. 마이크로소프트와 아마존을 포함한 다른 거대 클라우드 제공업체들처럼, 구글 역시 이 칩들을 자사 인프라에서 제공하는 엔비디아 기반 시스템을 보완하기 위해 사용하고 있습니다. 구글이 엔비디아를 완전히 대체하려는 것은 아닙니다. 실제로 구글은 올해 하반기에 자사 클라우드에서 엔비디아의 최신 칩인 '베라 루빈(Vera Rubin)'을 사용할 수 있도록 할 것이라고 약속했습니다.

자체 AI 칩을 개발하는 하이퍼스케일러(아마존, 마이크로소프트, 구글 포함)들은 기업들이 AI 필요성을 클라우드로 옮기고 애플리케이션을 이 칩들로 이식하면서 언젠가는 엔비디아에 대한 의존도가 줄어들 수 있습니다. 그럼에도 불구하고 현 시점에서 엔비디아의 성장에 반하는 것은 현명한 투자가 아닙니다. 저명한 반도체 시장 분석가인 패트릭 무어head는 X(옛 트위터)에 농담조로, 검색 거인이 첫 TPU를 출시했던 2016년에 구글의 TPU가 엔비디아(그리고 인텔)에 나쁜 소식이 될 것이라고 예측했었다고 게시했습니다. 엔비디아는 현재 시가총액 약 5조 달러에 육박하는 기업이 되었으므로, 그 예측은 시간의 시험을 견뎌내지 못한 셈입니다. 모든 것이 엔비디아의 계획대로 진행된다면, 많은 워크로드가 구글 칩에서 실행되더라도 구글의 AI 클라우드 제공업체로서의 성장은 오히려 칩 메이커(엔비디아)를 위한 더 많은 비즈니스를 창출하게 될 것입니다.

실제로 구글은 또한 엔비디아와 협력하여 엔비디아 기반 시스템이 구글 클라우드에서 훨씬 더 효율적으로 작동할 수 있도록 컴퓨터 네트워킹을 엔지니어링하기로 합의했다고 밝혔습니다. 특히, 두 기술 거인은 소프트웨어 기반 네트워킹 기술인 'Falcon'을 강화하기 위해 협력하고 있습니다. Falcon은 구글이 2023년에 모든 오픈소스 데이터센터 하드웨어 조직의 대부격인 '오픈 컴퓨트 프로젝트(Open Compute Project)'를 통해 만들고 오픈소스화한 기술입니다.

원문 보기

원문 보기 (영어)

Google Cloud on Wednesday announced that its eighth generation of custom-built AI chips, or tensor processing units (TPUs), will be split in two. One chip, named the TPU 8t, will be geared for model training and another, the TPU 8i, is aimed at inference. Inference is the ongoing usage of models, aka what happens after users submit prompts. As you might expect, the company touts some impressive performance specs for these new TPUs compared to the previous generations: up to 3x faster AI model training, 80% better performance per dollar, and the ability to get 1 million+ TPUs to work together in a single cluster. The upshot should be a lot more compute for a lot less energy — and cost to customers — than previous versions. It calls these chips TPUs, not GPUs, because its custom low-power chips were originally named Tensor. But Google's chips are not a full frontal assault on Nvidia's future, at least not yet. Like the other giant cloud providers, including Microsoft and Amazon , Google is using these chips to supplement the Nvidia-based systems it offers in its infrastructure. It is not flat-out replacing Nvidia. In fact, Google promises its cloud will have Nvidia's latest chip, Vera Rubin, available later this year. One day the hyperscalers building their own AI chips (which includes Amazon, Microsoft and Google) may grow to need Nvidia less, as enterprises move their AI needs to their clouds and port their apps to these chips. Still, as things stand today, it's not profitable to bet against Nvidia. As notable chip market analyst Patrick Moore jokingly posted on X , he had predicted that Google's TPU could be bad news for Nvidia (and Intel) back in 2016 when the search giant launched its first one. Nvidia is now a nearly $5 trillion market cap company, meaning that prediction didn't exactly hold up to the test of time. If all goes according to Nvidia's plan, Google's growth as an AI cloud provider would result in more business for the chip maker not less, even if many a workload runs on Google's chips. Techcrunch event Meet your next investor or portfolio startup at Disrupt Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $410. Meet your next investor or portfolio startup at Disrupt Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $410. San Francisco, CA | October 13-15, 2026 REGISTER NOW In fact, Google also says it has agreed to work with Nvidia to engineer computer networking that allows Nvidia-based systems to perform even more efficiently in its cloud. In particular, the two tech giants are working to beef up the software-based networking tech called Falcon, which Google created and open sourced in 2023 under the godfather of all open source data center hardware organizations, the Open Compute Project . Topics AI , AI chips , Enterprise , Google , google cloud , Google Cloud Next , nvidia When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence. Julie Bort Venture Editor Julie Bort is the Startups/Venture Desk editor for TechCrunch. You can contact or verify outreach from Julie by emailing julie.bort@techcrunch.com or via @Julie188 on X. View Bio April 30 San Francisco, CA StrictlyVC kicks off the year in SF. Get in the room for unfiltered fireside chats with industry leaders, insider VC insights, and high-value connections that actually move the needle. Tickets are limited. REGISTER NOW Most Popular Unauthorized group has gained access to Anthropic's exclusive cyber tool Mythos, report claims Lucas Ropek Tim Cook stepping down as Apple CEO, John Ternus taking over Amanda Silberling Connie Loizos Blue Origin's New Glenn put a customer satellite in the wrong orbit during its third launch Sean O'Kane Palantir posts mini-manifesto denouncing inclusivity and ‘regressive’ cultures Anthony Ha ‘Tokenmaxxing' is making developers less productive than they think Tim Fernholz Anthropic launches Claude Design, a new product for creating quick visuals Aisha Malik Anthropic CPO leaves Figma's board after reports he will offer a competing product Tim Fernholz

구글 AI 반도체 TPU 엔비디아 클라우드