The Decoder • 106일 전

스탠퍼드 AI 보고서: 고속 성장과 안전 우려

IMP

9/10

핵심 요약

스탠퍼드 HAI가 발표한 'AI 지수 보고서 2026'에 따르면 AI 모델의 성능은 박사급 수준으로 급격히 발전했으나, 아날로그 시계 읽기 같은 기초적인 작업에서는 여전히 오류를 범하고 있습니다. 미국과 중국 간의 AI 성능 격차가 사실상 사라졌으며, 높은 도입률에도 불구하고 일자리 감소에 대한 우려로 대중의 신뢰는 계속 하락하고 있습니다.

번역된 본문

스탠퍼드 AI 지수 2026 보고서는 AI의 빠른 발전, 커져가는 안전 우려, 그리고 하락하는 대중의 신뢰를 보여줍니다.

[핵심 요약] 스탠퍼드 AI 지수 2026에 따르면, AI 모델은 박사급 과학 문제에서 인간의 기준치를 능가하지만 아날로그 시계 읽기와 같은 단순한 작업에서는 여전히 실패합니다. 미국과 중국 간의 성능 격차는 사실상 사라졌습니다. 미국이 투자(2,859억 달러)를 주도하고 있지만, 2017년 이후 유입되는 AI 연구원의 약 89%를 잃었습니다. 생성형 AI는 PC나 인터넷보다 빠르게 인구의 53%에게 도달했음에도 불구하고, 노동 시장에 대한 긍정적인 영향을 본다고 생각하는 미국 대중은 23%에 불과합니다.

스탠퍼드 HAI의 AI 지수 보고서 2026은 AI 모델의 큰 성능 도약, 미국과 중국 간의 격차 축소, 그리고 증가하는 안전 문제를 기록하는 반면, 대중의 신뢰는 계속해서 악화되고 있음을 보여줍니다.

AI 지수 보고서 2026은 스탠퍼드 인공지능 중심 연구소(HAI)의 인공지능 연례 평가로, 연구, 산업 및 사회적 영향 전반에 걸친 발전을 추적합니다. 올해의 에디션은 기술이 얼마나 발전했는지를 보여줍니다. 즉, 이제 AI 모델은 박사급 과학 문제와 대회 수준의 수학에서 인간의 기준을 능가합니다.

보고서에 따르면 SWE-bench Verified 코딩 벤치마크에서는 단 1년 만에 성능이 60%에서 거의 100%로 껑충 뛰었습니다.

구글의 제미나이 딥 싱크(Gemini Deep Think)는 국제 수학 올림피아드에서 금메달을 획득했습니다. 하지만 이 모든 발전에도 불구하고 '톱니 모양의 경계(jagged frontier)' 현상은 지속됩니다. 최고 수준의 모델조차도 아날로그 시계를 정확히 읽는 비율은 50.1%에 불과합니다.

보고서에 따르면 미국과 중국 간의 성능 격차는 본질적으로 닫혔습니다. 2025년 초부터 양국의 모델은 1위 자리를 서로 번갈아 차지했습니다. 2026년 3월 현재 앤스로픽(Anthropic)의 최고 모델이 단 2.7%의 우위를 점하고 있습니다.

중국은 논문 출판 수, 피인용 수, 산업용 로봇 분야에서 압도적인 반면, 미국은 최고 수준 모델의 수와 투자를 주도하고 있습니다. 2025년에는 2,859억 달러가 민간 AI 투자로 유입되었으며, 이는 중국보다 23배 많은 수치입니다. 그러나 미국으로 이주하는 AI 연구원의 수는 2017년 이후 89%나 감소했습니다.

생산성 향상은 주니어 일자리 감소를 동반합니다.

보고서는 고객 지원 및 소프트웨어 개발 부문에서 14~26%, 마케팅 팀에서는 최대 72%의 생산성 향상을 기록했습니다. 하지만 더 많은 판단이 필요한 작업의 경우 그 효과는 약하거나 심지어 부정적입니다. 기업 전반에 걸친 AI 에이전트의 도입은 거의 모든 부서에서 한 자릿수에 머물고 있습니다.

이 이야기에는 이면도 있습니다. 생산성 향상이 가장 크게 측정된 소프트웨어 개발 분야에서 2024년 이후 22~25세 미국 개발자의 고용은 약 20% 감소했습니다. 반면, 시니어 개발자의 수는 계속 증가하고 있습니다.

50% 이상의 도입률, 그러나 교육은 이를 따라가지 못합니다.

보고서에 따르면 생성형 AI는 3년 만에 인구의 53%에게 도달하여 PC나 인터넷보다 빠르게 확산되었습니다.

젊은 층에서는 도입률이 훨씬 더 높아, 미국 학생 5명 중 4명(80%)이 학교 과제에 AI를 사용합니다. 그러나 중고등학교의 절반만이 AI 정책을 마련하고 있으며, 단 6%의 교사만이 그러한 정책이 명확하게 정의되어 있다고 답했습니다.

전문가와 대중은 서로 다른 AI 세계에 살고 있습니다.

이 보고서의 가장 시사하는 바가 큰 발견은 인식의 격차일 것입니다. 미국 전문가의 73%는 노동 시장에 대한 AI의 영향을 긍정적으로 보지만, 일반 대중 중에서는 단 23%만이 같은 평가를 내리고 있습니다. 경제 및 의료 분야에서도 비슷한 격차가 나타납니다.

정부의 AI 규제에 대한 신뢰는 전 세계적으로 큰 차이를 보입니다. 스탠퍼드 보고서에 따르면 조사된 국가 중 미국은 자국 정부가 AI를 규제할 것이라는 대중의 신뢰에서 꼴찌를 기록했으며, 그 수치는 단 31%에 불과합니다. 전 세계적으로 EU가 미국이나 중국보다 더 많은 신뢰를 누리고 있습니다.

원문 보기

원문 보기 (영어)

Stanford's AI Index 2026 shows rapid progress, growing safety concerns, and declining public trust Maximilian Schreiner View the LinkedIn Profile of Maximilian Schreiner Apr 14, 2026 Stanford Key Points According to the Stanford AI Index 2026, AI models outperform human baselines on PhD science questions but continue to fail at simple tasks such as reading analog clocks. The performance gap between the US and China has practically closed. The US leads in investment ($285.9 billion), but has lost around 89 percent of its incoming AI researchers since 2017. Generative AI reached 53 percent of the population faster than PCs or the internet, yet only 23 percent of the US public views the impact on the labor market as positive. Ask about this article… Search The AI Index Report 2026 from Stanford HAI documents major performance leaps in AI models, a narrowing gap between the US and China, and mounting safety problems, all while public trust continues to erode. The AI Index Report 2026 is Stanford's Institute for Human-Centered AI annual assessment of artificial intelligence, tracking progress across research, industry, and societal impact. This year's edition shows just how far the technology has come: AI models now outperform human baselines on PhD-level science questions and competition-level math. On the SWE-bench Verified coding benchmark, performance jumped from 60 to nearly 100 percent in a single year, according to the report. Ad Google's Gemini Deep Think won a gold medal at the International Mathematical Olympiad. But despite all this progress, the "jagged frontier" phenomenon persists. The same top-tier model can only read analog clocks correctly 50.1 percent of the time. Ad DEC_D_Incontent-1 The performance gap between the US and China has essentially closed, according to the report. Since early 2025, models from both countries have been trading the top spot back and forth. As of March 2026, Anthropic's leading model holds just a 2.7 percent edge. China dominates in publication volume, citations, and industrial robotics, while the US leads in the number of top models and investment: $285.9 billion flowed into private AI investment in 2025, 23 times more than in China. However, the number of AI researchers moving to the US has dropped 89 percent since 2017. Productivity gains come with shrinking entry-level jobs The report documents productivity gains of 14 to 26 percent in customer support and software development, and up to 72 percent in marketing teams. For tasks that require more judgment, though, the effects are weaker or even negative. AI agent adoption across businesses remains in the single digits in nearly every department. Ad There's a flip side to this story: in software development, where measured productivity gains are strongest, employment among US developers aged 22 to 25 dropped nearly 20 percent since 2024. Meanwhile, the number of older developers continues to grow. Over 50 percent adoption, but education can't keep up Generative AI reached 53 percent of the population within three years, spreading faster than either the PC or the internet, according to the report. Ad DEC_D_Incontent-2 Among younger people, adoption is even higher: four out of five US students use AI for schoolwork. Yet only half of middle and high schools have AI policies in place, and just 6 percent of teachers say those policies are clearly defined. Ad Experts and the public live in different AI worlds The report's most revealing finding may be the perception gap: 73 percent of US experts view AI's impact on the job market positively, but only 23 percent of the general public shares that assessment. Similar divides show up around the economy and healthcare. Trust in government AI regulation varies widely around the world. Among the countries surveyed, the US ranks dead last in public trust in its own government to regulate AI, at just 31 percent, according to the Stanford report. Globally, the EU enjoys more trust than either the US or China when it comes to effective AI regulation. AI News Without the Hype – Curated by Humans Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section. Subscribe now Source: AI Index

스탠퍼드 AI 보고서 AI 안전성 AI 일자리 대체 미중 AI 경쟁 대중 신뢰