MIT Tech Review • 106일 전

AI에 대한 평가가 극명하게 엇갈리는 이유

IMP

8/10

핵심 요약

스탠퍼드 대학의 연례 AI 보고서는 미국의 압도적인 데이터센터 인프라와 TSMC에 의존하는 공급망의 취약성 등 핵심 통계를 제시합니다. 특히 최근 AI 생태계의 가장 큰 특징은 전문가와 일반 대중 간의 인식 격차가 매우 크다는 점입니다. 코딩 등 기술적 작업에 매일 최신 AI를 사용하는 파워 유저는 기술의 발전에 압도되지만, 일반적인 용도로 무료 버전을 사용하는 대중은 여전히 어리석은 실수를 하는 AI에 실망하는 등 사용 경험의 차이가 극명한 갈림길에 서 있습니다.

번역된 본문

이 기사는 원래 당사의 주간 AI 뉴스레터인 The Algorithm에 실린 내용입니다. 이와 같은 기사를 가장 먼저 받아보고 싶다면 여기에서 구독하세요.

결코 멈추지 않는 이 업계에서 스탠퍼드의 'AI 인덱스(연례 핵심 결과 및 트렌드 요약)'는 숨을 돌릴 수 있는 기회를 제공합니다. (결국 이는 단거리 스프린트가 아닌 마라톤이니까요.) 오늘 공개된 올해의 보고서에는 놀라운 통계가 가득합니다. 이 보고서의 상당한 가치는 미국이 다른 어느 국가보다도 AI 분야에 더 맹렬하게 매달리고 있다는 직관적인 느낌을 뒷받침해 주는 구체적인 수치에서 나옵니다. 미국은 (계속 증가하는 추세인) 5,427개의 데이터센터를 보유하고 있습니다. 이는 다른 어떤 국가보다 10배 이상 많은 수치입니다.

또한 AI 산업이 의존하는 하드웨어 공급망에 상당한 병목 현상이 존재한다는 사실도 상기시켜 줍니다. 어쩌면 가장 놀라운 사실은 다음과 같을 것입니다. "단일 기업인 TSMC가 거의 모든 최첨단 AI 칩을 제조하기 때문에, 전 세계 AI 하드웨어 공급망은 대만에 위치한 단 하나의 파운드리에 의존하고 있습니다." 단 하나의 파운드리라니! 정말 터무니없는 일입니다.

하지만 2026 AI 인덱스(역자 주: 원문 기준 연도)를 통해 얻은 가장 중요한 교훈은 현재 AI의 상태가 모순으로 가득 차 있다는 것입니다. 동료 기자 미셸 김(Michelle Kim)이 오늘 이 보고서에 관한 기사에서 말했듯이, "AI 뉴스를 따라간다면 아마 혼란스러울 것입니다. AI는 골드러시입니다. AI는 거품입니다. AI가 당신의 일자리를 빼앗고 있습니다. 심지어 AI는 시계도 읽지 못합니다." (스탠퍼드 보고서에 따르면, 구글 딥마인드의 최고 수준 추론 모델인 '제미니 딥 씽크(Gemini Deep Think)'는 국제 수학 올림피아드에서 금메달을 받았지만, 아날로그 시계의 반을 읽지 못합니다.)

미셸은 보고서의 핵심을 훌륭하게 요약했습니다. 하지만 저는 머릿속에서 떠나지 않는 하나의 질문에 집중하고 싶었습니다. 왜 지금 AI에서 정확히 무슨 일이 일어나고 있는지 파악하기가 이렇게 힘들까요?

가장 큰 격차는 전문가와 비전문가 사이에 존재하는 것 같습니다. AI 인덱스의 저자들은 이렇게 말합니다. "AI 전문가와 일반 대중은 이 기술의 궤적을 매우 다르게 봅니다. AI가 일자리에 미치는 영향을 평가할 때 미국 전문가의 73%가 긍정적인 반면, 일반 대중은 23%에 불과해 50% 포인트의 격차가 존재합니다. 경제 및 의료 분야에서도 비슷한 격차가 나타납니다." 정말 엄청난 격차입니다. 대체 무슨 일이 벌어지고 있는 걸까요? 대중은 모르는 무엇을 전문가가 알고 있는 걸까요? (여기서 '전문가'는 2023년과 2024년 AI 학회에 참가한 미국 기반 연구원을 의미합니다.)

저는 그 이유 중 하나가 전문가와 비전문가가 각자 매우 다른 경험을 바탕으로 견해를 형성하고 있기 때문이라고 생각합니다. 얼마 전 한 소프트웨어 개발자가 X(옛 트위터)에 "AI에 감탄하는 정도는 코딩을 위해 AI를 얼마나 사용하는지와 완벽하게 비례한다"고 게시했습니다. 농담 같은 말일 수도 있지만, 분명히 일리 있는 말입니다. 최고 수준의 연구소에서 나온 최신 모델은 이제 코드를 생성하는 데 있어 그 어느 때보다 뛰어납니다. 코딩과 같은 기술적 작업은 결과가 명확하게 맞거나 틀리기 때문에, 열린 결말을 가진 작업들보다 모델을 학습시키기가 더 쉽습니다. 게다가 코딩이 가능한 모델은 수익성이 입증되고 있기 때문에, 모델 개발사들은 이를 개선하는 데 막대한 자원을 쏟아붓고 있습니다.

이는 코딩이나 기타 기술적 작업을 위해 이러한 도구를 사용하는 사람들이야말로 이 기술이 가장 좋은 상태일 때 경험하고 있음을 의미합니다. 하지만 그러한 사용 사례를 벗어나면 결과는 엇갈리게 됩니다. 대형 언어 모델(LLM)은 여전히 얄괴한 실수를 저지릅니다. 이 현상은 소위 '톱니바퀴 모서리(Jagged frontier)'로 알려져 있습니다. 모델은 어떤 것은 매우 잘하지만, 다른 것은 그렇지 못하다는 뜻입니다.

영향력 있는 AI 연구자인 안드레이 카파시(Andrej Karpathy) 역시 몇 가지 생각을 전했습니다. "제 타임라인을 보건대, AI 역량에 대한 이해도의 격차가 점점 벌어지고 있습니다."라고 그는 해당 X 게시물에 대한 답글로 적었습니다. 그는 파워 유저(즉, 코딩, 수학 또는 연구를 위해 LLM을 사용하는 사람들)가 최신 모델을 사용할 뿐만 아니라 최고 버전을 사용하기 위해 종종 한 달에 200달러를 지불한다고 지적했습니다. "올해 이 분야에서 이루어진 최근의 발전은 그야말로 엄청난 수준입니다."라고 그는 덧붙였습니다. LLM은 여전히 빠르게 발전하고 있기 때문에, 비용을 지불하고 '클로드 코드(Claude Code)'를 사용하는 사람은 6개월 전 무료 버전의 클로드를 사용해 결혼식을 계획하려 했던 사람과는 사실상 완전히 다른 기술을 경험하게 될 것입니다.

원문 보기

원문 보기 (영어)

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here . In an industry that doesn’t stand still, Stanford’s AI Index, an annual roundup of key results and trends , is a chance to take a breath. (It’s a marathon, not a sprint , after all.) This year’s report , which dropped today, is full of striking stats. A lot of the value comes from having numbers to back up gut feelings you might already have, such as the sense that the US is gunning harder for AI than everyone else: It hosts 5,427 data centers (and counting). That’s more than 10 times as many as any other country. There’s also a reminder that the hardware supply chain the AI industry relies on has some major choke points. Here’s perhaps the most remarkable fact: “A single company, TSMC, fabricates almost every leading AI chip, making the global AI hardware supply chain dependent on one foundry in Taiwan.” One foundry! That’s just wild. But the main takeaway I have from the 2026 AI Index is that the state of AI right now is shot through with inconsistencies. As my colleague Michelle Kim put it today in her piece about the report : “If you’re following AI news, you’re probably getting whiplash. AI is a gold rush. AI is a bubble. AI is taking your job. AI can’t even read a clock.” (The Stanford report notes that Google DeepMind’s top reasoning model, Gemini Deep Think, scored a gold medal in the International Math Olympiad but is unable to read analog clocks half the time.) Michelle does a great job covering the report’s highlights. But I wanted to dwell on a question that I can’t shake. Why is it so hard to know exactly what’s going on in AI right now? The widest gap seems to be between experts and non-experts. “AI experts and the general public view the technology’s trajectory very differently,” the authors of the AI Index write. “Assessing AI’s impact on jobs, 73% of U.S. experts are positive, compared with only 23% of the public, a 50 percentage point gap. Similar divides emerge with respect to the economy and medical care.” That’s a huge gap. What’s going on? What do experts know that the public doesn’t? (“Experts” here means US-based researchers who took part in AI conferences in 2023 and 2024.) I suspect part of what’s going on is that experts and non-experts base their views on very different experiences. “The degree to which you are awed by AI is perfectly correlated with how much you use AI to code,” a software developer posted on X the other day . Maybe that’s tongue-in-cheek, but there’s definitely something to it. The latest models from the top labs are now better than ever at producing code. Because technical tasks like coding have right or wrong results, it is easier to train models to do them, compared with tasks that are more open-ended. What’s more, models that can code are proving to be profitable, so model makers are throwing resources at improving them. This means that people who use those tools for coding or other technical work are experiencing this technology at its best. Outside of those use cases, you get more of a mixed bag. LLMs still make dumb mistakes. This phenomenon has become known as the “jagged frontier”: Models are very good at doing some things and less good at others. The influential AI researcher Andrej Karpathy also had some thoughts. “Judging by my [timeline] there is a growing gap in understanding of AI capability,” he wrote in reply to that X post. He noted that power users (read: people who use LLMs for coding, math, or research) not only keep up to date with the latest models but will often pay $200 a month for the best versions. “The recent improvements in these domains as of this year have been nothing short of staggering,” he continued. Because LLMs are still improving fast, someone who pays to use Claude Code will in effect be using a different technology from someone who tried using the free version of Claude to plan a wedding six months ago. Those two groups are speaking past each other. Where does that leave us? I think there are two realities. Yes, AI is far better than a lot of people realize. And yes, it is still pretty bad at a lot of stuff that a lot of people care about (and it may stay that way). Anyone making bets about the future on either side should bear that in mind. Deep Dive Artificial intelligence OpenAI is throwing everything into building a fully automated researcher An exclusive conversation with OpenAI’s chief scientist, Jakub Pachocki, about his firm's new grand challenge and the future of AI. By Will Douglas Heaven archive page How Pokémon Go is giving delivery robots an inch-perfect view of the world Exclusive: Niantic's AI spinout is training a new world model using 30 billion images of urban landmarks crowdsourced from players. By Will Douglas Heaven archive page This startup wants to change how mathematicians do math Axiom Math is giving away a powerful new AI tool. But it remains to be seen if it speeds up research as much as the company hopes. By Will Douglas Heaven archive page AI benchmarks are broken. Here’s what we need instead. One-off tests don’t measure AI’s true impact. We’re better off shifting to more human-centered, context-specific methods. By Angela Aristidou archive page Stay connected Illustration by Rose Wong Get the latest updates from MIT Technology Review Discover special offers, top stories, upcoming events, and more. Enter your email Privacy Policy Thank you for submitting your email! Explore more newsletters It looks like something went wrong. We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.

AI 인식 격차 스탠퍼드 AI 보고서 코딩 AI LLM 한계 AI 트렌드