메뉴
BL
r/singularity 46일 전

엔비디아 "AI 도입으로 GPU 설계 10개월 업무, 하루 만에 처리"

IMP
8/10
핵심 요약

엔비디아는 칩 설계 과정 전반에 AI를 도입하여 설계 시간을 획기적으로 단축했다고 밝혔습니다. 특히 8명의 엔지니어가 10개월 걸리던 표준 셀 라이브러리 포팅 작업을 단일 GPU로 하룻밤 만에 끝낼 수 있게 되었습니다. 그러나 윌리엄 달리 수석 과학자는 완전한 무인 칩 설계에는 아직 멀었으며, 현재는 AI를 보조 및 최적화 도구로 활용해 인간 설계자 이상의 성능을 이끌어내는 단계라고 설명했습니다.

번역된 본문

엔비디아(Nvidia)는 칩 설계 과정의 모든 단계에 인공지능을 도입하여 개발 시간을 극적으로 단축하려고 노력하고 있다고 밝혔다. 특히 이 회사는 이전에는 8명의 엔지니어가 10개월 동안 완료해야 했던 표준 셀 라이브러리(standard cell library) 포팅 작업을 이제는 단일 GPU로 하룻밤 만에 수행할 수 있다고 밝혔다. 그러나 엔비디아의 윌리엄 달리(William Dally) 수석 과학자는 인공지능이 사람의 개입 없이 완전히 독자적으로 프로세서를 설계하는 수준에는 아직 많이 멀었다고 말했다.

달리는 구글의 제프 딘(Jeff Dean)과의 대담에서 "설계 과정에서 가능한 모든 곳에 AI를 사용하려고 노력하고 있다"며 "단순히 '새로운 GPU를 설계해 줘'라고 명령하여 처음부터 끝까지 자동으로 완성하는 단말단(end-to-end) 설계를 갖고 싶지만, 그것까지는 아직 갈 길이 멀다고 생각한다"고 전했다.

달리에 따르면, 엔비디아는 이미 회로 수준의 최적화부터 시스템 수준의 탐색에 이르기까지 칩 설계의 여러 단계에 걸쳐 AI를 사용하고 있으며, 이를 통해 수십 배에 달하는 생산성 향상을 달성했고 일부 경우에는 사람보다 더 나은 결과를 얻고 있다.

가장 낮은 수준에서 AI는 이미 표준 셀 개발을 변화시켰는데, 이는 새로운 제조 공정으로 전환할 때 가장 시간이 많이 걸리는 단계 중 하나다. 달리에 따르면, 약 2,500~3,000개의 셀로 구성된 표준 셀 라이브러리를 포팅하는 데는 이전에 8명의 엔지니어 팀이 약 10개월 동안 작업해야 했다. 엔비디아는 이 작업을 'NB-Cell'이라는 강화 학습(Reinforcement learning) 시스템으로 대체했으며, 이제 단일 GPU에서 하룻밤 만에 동일한 작업을 완료할 수 있다.

더 높은 수준에서 엔비디아는 지금까지 개발한 모든 GPU를 다루는 사내 독점 아키텍처 문서로 학습된 내부 대형 언어 모델(LLM)인 '칩 네모(Chip Nemo)'와 '버그 네모(Bug Nemo)'를 개발했다. 이러한 LLM은 주니어 설계자에게 복잡한 하드웨어 블록이 어떻게 작동하는지 설명해 줄 수 있는 엔지니어링 어시스턴트 역할을 한다. 그 결과 엔비디아는 더 이상 LLM이 처리할 수 있는 문제로 수석 엔지니어를 괴롭힐 필요가 없어졌다.

달리는 "우리는 칩 네모와 버그 네모라는 일련의 LLM을 보유하고 있었다. 범용 LLM을 가져와서 엔비디아 고유의 모든 설계 문서를 제공하여 미세 조정(fine-tuning)했다"고 밝혔다. 그는 "이는 회사 밖에서는 얻을 수 없는 것으로, 모든 RTL, 하드웨어 설계 문서, 엔비디아에서 설계된 모든 GPU의 RTL, 그리고 이들에 대한 모든 아키텍처 사양을 포함한다. 이제 GPU 설계에 대해 매우 똑똑한 이 LLM을 갖게 된 것이다. [...] 주니어 설계자가 있을 때 그들에게 칩 네모에게 물어보라고 하면, 칩 네모가 (GPU가 어떻게 작동하는지) 설명해 줄 것이다. 그런 방식으로 생산성을 향상시킨다. 매우 인내심 있는 멘토 역할을 하는 셈이다."라고 설명했다.

셀 라이브러리와 엔지니어링 지원을 넘어, 엔비디아는 고전적인 회로 설계 문제에 강화 학습을 적용하고 있다. 예를 들어, 강화 학습 기반 시스템은 시행착오를 통해 설계 옵션을 탐색하며, 이러한 접근 방식은 인간보다 빠르게 면적, 전력, 성능 측면에서 사람의 결과를 뛰어넘는 칩 설계를 만드는 데 도움을 준다.

달리는 "이 시스템은 인간은 생각지도 못했을 완전히 기발하고 엉뚱한(bizarre) 설계를 고안해 낸다."라고 덧붙이며, AI가 인간의 고정관념을 벗어난 혁신적인 칩 설계의 가능성을 보여주고 있음을 시사했다.

원문 보기
원문 보기 (영어)
Copy link Facebook X Whatsapp Reddit Pinterest Flipboard Email Share this article 7 Join the conversation Follow us Add us as a preferred source on Google Newsletter Stay On the Cutting Edge: Get the Tom's Hardware Newsletter Get Tom's Hardware's best news and in-depth reviews, straight to your inbox. Contact me with news and offers from other Future brands Receive email from us on behalf of our trusted partners or sponsors By submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over. You are now subscribed Your newsletter sign-up was successful An account already exists for this email address, please log in. Subscribe to our newsletter Nvidia says it is trying to introduce artificial intelligence to every stage of the chip design process, drastically reducing development time. Notably, the company has revealed that porting a standard cell library, a task that previously took eight engineers 10 months to complete, can now be done overnight by a single GPU. However, the company's chief scientist, William Dally, says that artificial intelligence is still not quite close to designing a processor completely by itself. Go deeper with TH Premium: GPUs Desktop Roadmap Enterprise Roadmap Rubin in-depth The Stout Owl: The ultimate Noctua G2 PC "We are trying to use AI wherever we can in our design process," Dally told Google's Jeff Dean . "I would love to have the end-to-end stage where I could simply say, 'design me the new GPU,' but I think we are a long way from that." Nvidia is already using AI across multiple stages of chip design, from circuit-level optimizations to system-level exploration, and achieves orders-of-magnitude productivity gains and, in some cases, better-than-human results, according to Dally. Article continues below At the lowest level, AI has already transformed standard cell development, one of the most time-consuming steps when transitioning to a new fabrication process. Porting a standard cell library of roughly 2,500–3,000 cells previously required a team of eight engineers working for about 10 months, according to Dally. Nvidia has replaced this work with a reinforcement learning system called NB-Cell, which can now complete the same task overnight on a single GPU. At a higher level, Nvidia has developed internal large language models — Chip Nemo and Bug Nemo — trained on proprietary architecture documentation covering all GPUs that Nvidia has ever developed. These LLMs can act like engineering assistants who can explain to junior designers how complex hardware blocks work. As a result, Nvidia no longer has to bother senior engineers about things that can be done by LLMs. "We had a series of LLMs that we called Chip Nemo and Bug Nemo. We took a generic LLM, and then we fine-tuned it by feeding it all of the design documents proprietary to Nvidia," Dally said. "So this is stuff you cannot get outside the company, it is all of the RTL, hardware design docs, all of the RTL for every GPU ever designed at Nvidia, all of the architecture specs for those. Now you have this LLM that is actually very smart about GPU design. […] When you have a junior designer, they can ask Chip Nemo, and Chip Nemo will explain [how GPUs work]. It improves productivity that way; it is a very patient mentor." Beyond cell libraries and engineering assistance, Nvidia is applying reinforcement learning to classical circuit design problems. For example, an RL-based system explores design options in a trial-and-error manner, and this approach helps to create chip designs that exceed human results in terms of area, power, and performance faster than humans can. Stay On the Cutting Edge: Get the Tom's Hardware Newsletter Get Tom's Hardware's best news and in-depth reviews, straight to your inbox. Contact me with news and offers from other Future brands Receive email from us on behalf of our trusted partners or sponsors "It comes up with totally bizarre designs that no human would ever come up with, but they are actually 20% or 30% better than the human designs," said Dally. In addition to using AI for place and route, Nvidia is also using AI to explore architectural designs. In particular, Nvidia's agent-based systems run large numbers of experiments, evaluate different design directions, and narrow down viable configurations. This greatly accelerates decision-making in the early stages of the chip development cycle when engineers must choose between various architectural trade-offs. Last but not least, Nvidia uses AI for design verification, one of the longest stages in the chip development cycle. Nonetheless, AI still cannot be responsible for the whole verification process, so Nvidia must emulate its designs and conduct actual experiments to ensure that everything works fine. "We would like to collapse that space, what the really long pole is design verification," said Dally. "We are particularly looking at how we can use AI to prove that designs work more quickly." In the long term, Nvidia's chief scientist envisions chip development to shift to a multi-agent model in which specialized AI systems will handle different parts of the design, like human teams do today. For now, AI acts to cut development time by assisting engineers and improving design quality to levels beyond what humans can do, which in turn enables engineers to explore more design options than before. Follow Tom's Hardware on Google News , or add us as a preferred source , to get our latest news, analysis, & reviews in your feeds. TOPICS See all comments (7) Anton Shilov Contributing Writer Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends. 7 Comments Comment from the forums What they won't tell you is how many man/hours were used to create those automation steps. Granted, it will create benefits long term, but it is not for sure a irrelevant part of the process. They are highlighting only the good part of the thing, without indicating any of the drawbacks (I don't count the indicated limitations as drawbacks). Reply Ostensibly, this allows for an annual GPU refresh cycle, instead of Super refreshes on existing chips. It could open the door for more flexibility with GPU features and VRAM size to meet current needs. On the other hand, RTX GPU driver quality has really gone down the drain in recent years, and I don't see how it could improve by having a new RTX series each year with differing feature sets. Reply Finally... cheaper gpu's 🤣 Reply What it doesn’t sag is how much work has to go into fixing the design after it’s produced Reply So if the Ai is good enough to replace engineers, then it can definitely replace the "work" done by the CEO and Board. Reply Nvidia is actively trying to create Skynet. Reply AI designing chips, what could go wrong? Reply View All 7 Comments Show more comments