TechCrunch AI • 60일 전

AI 없이 코딩 못 하는 개발자들, 부메랑이 될 수도

IMP

8/10

핵심 요약

최근 연구에 따르면 개발자들이 AI 코딩 도구에 과도하게 의존하고 있지만, 오히려 코드 생성 속도는 빨라도 오류 수정과 유지보수 비용이 급증하는 역효과가 발생하고 있습니다. 우버 등 주요 기업들도 막대한 AI 예산을 소진했음에도 실질적인 생산성 향상을 입증하지 못하고 있어, AI 코딩 도구의 실질적 효용성과 장기적 리스크에 대한 경고가 커지고 있습니다.

번역된 본문

연구진에 따르면 2026년 현재, 개발자들의 손에서 AI 코딩 도구를 빼앗는 것은 불가능에 가깝다. AI가 코드 작성 속도를 높여주는 것은 분명하지만, 더 나은 코드를 만들어내고 있는지는 미지수라는 다른 연구진들의 경고가 나오고 있다. 이는 결국 개발자들에게 부메랑이 되어 돌아올 수 있다.

구체적으로, 2026년 2월 저명한 AI 연구 기관인 METR은 놀라운 사실을 발표했다. 대부분의 개발자가 제한된 수의 작업이라도 이제는 AI 없이는 작업하려 하지 않는다는 것이다. METR은 2025년 말 발표되었던 AI 코딩 생산성에 대한 획기적인 연구를 업데이트하고자 했다. 해당 연구에서 연구진은 오픈소스 개발자들이 수동으로 작업할 때와 AI를 사용할 때의 소요 시간을 측정했다. 연구에 참여한 개발자들은 AI가 자신의 생산성을 높인다고 보고했지만, 실제로는 작업 속도가 느려졌다는 사실에 충격을 받았다. AI가 코드를 빠르게 생성하기는 했지만, 정작 오류를 찾고 수정하고, AI를 올바른 방향으로 유도하며 작업이 완료될 때까지 기다리는 데 추가적인 시간을 소모해야 했던 것이다.

METR이 AI 기술의 발전과 개발자의 숙련도 향상을 측정하기 위해 이 실험을 반복하려 했을 때, 그들은 실험을 진행할 수 없었다. 연구진은 "단순히 연구를 위해서라도 개발자들이 AI 없이 일하는 것을 원하지 않기 때문에" 실험 참여를 거부했다고 밝혔다. 대신 METR은 5월에 기술 직군 직원들이 AI를 통한 생산성 향상을 스스로 평가하는 설문조사를 발표했다. 놀랍지 않게도, 그들은 AI 덕분에 조직 내에서 자신의 가치가 두 배로 높아졌다고 인식하고 있었다.

하지만 이른바 '토큰맥싱(Tokenmaxxing)'이라 불리는 터무니없는 비용 지출에 대한 최근의 보도와 여러 연구 결과들은 이러한 자기 인식이 얼마나 의심스러운지를 보여준다. 토큰맥싱, 즉 개인이 사용하는 토큰(Token)의 수를 AI 생산성의 대리 지표로 삼는 것은 2026년 들어 가장 뜨거운 트렌드였다. 하지만 이 트렌드는 이미 막을 내릴 수도 있다. 파이낸셜 타임즈는 이번 주 아마존이 직원들이 AI 에이전트를 과도하게 사용하여 비용을 급증시키고 시스템을 악용하자, 'Kirorank'라는 내부 토큰 추적 리더보드를 폐쇄했다고 보도했다. 이 직원들은 AI 사용이 곧 생산성 향상으로 자동 연결되지는 않는다는 것을 증명해버린 셈이다. 더 인포메이션에 따르면 우버는 올해 들어 4개월 만에 2026년 AI 예산을 모두 소진했다. 우버의 앤드루 맥도널드 COO는 최근 한 팟캐스트에서 이러한 지출이 프로젝트나 생산성의 측정 가능한 증가로 이어지지 않았다고 말했다.

프로그래머이자 작가인 제임스 쇼어(James Shore)는 해커 뉴스에서 화제가 된 블로그 게시물에서 AI가 생성한 코드가 반드시 지속적인 코드 유지보수의 필요성을 줄여주는 것은 아니며, 오히려 이를 증가시킬 수 있다고 우아하게 주장했다. 그는 "이제 코드 작성 속도가 두 배나 빨라졌나요? 유지보수 비용도 절반으로 줄어들길 바라야 할 겁니다."라며 "그렇지 않으면 곤란해질 겁니다. 여러분은 일시적인 속도 향상을 영구적인 부채와 교환하는 셈이니까요."라고 적었다.

AI가 코드 유지보수의 골칫거리를 증가시킨다는 다른 증거들도 있다. 안정성 엔지니어링 에이전트 스타트업 'Entelligence AI'의 설립자이자 CEO인 아이스와리아 산카르의 바이럴 트윗에 따르면, 기업들은 자체 AI가 생성한 버그를 수정하는 데 전체 토큰의 44%를 소비하고 있다. 한편 코드 리뷰 도구 기업인 코드래빗(Code Rabbit)은 오픈소스 풀 리퀘스트(Pull Request)를 분석한 결과, AI가 사람이 작성한 코드보다 1.7배 더 많은 문제를 일으킨다고 밝혔다. 솔직히 말해, 이는 AI 코드 리뷰 도구를 팔려는 기업들의 자사 홍보용 통계일 수 있다. 그럼에도 불구하고 독립적인 연구자들도 이러한 문제를 발견했다. 저명한 싱가포르 경영대학(SMU)의 연구진은 4월에 보고서를 발표하며, "AI가 생성한 코드는 실제 소프트웨어 프로젝트에 장기적인 유지보수 비용을 초래할 수 있다"고 경고했다.

프로그래머들이 자신의 AI 어시스턴트를 사랑한다는 점을 고려할 때, 해결책은 무엇일까? AI 코딩 에이전트를 팔고자 하는 사람들은 개발자들이 단순히 AI 코딩 에이전트를 사용하여 AI가 뱉어내는 코드를 수정하는 피곤한 작업을 빠르게 처리하면 된다고 말한다. 이것이 바로 AI 코딩 에이전트 '데빈(Devin)'의 개발사인 코그니션(Cognition)의 설립자이자 CEO인 스콧 우가 제안하는 바이다. 하지만 그조차도 데빈이 독립적으로 작업할 수 있긴 하지만, 현재 데빈의 기술 수준은 작업에 따라 주니어에서 미드 레벨 프로그래머 수준에 그친다고 인정했다. 이는 단순히 작업을 떠넘기고 잊어버리는 해결책이 아니다. SMU 연구진은...

원문 보기

원문 보기 (영어)

In 2026, you cannot snatch AI coding tools out of developers' vice-grip hands, researchers have discovered. But while AI is undoubtedly helping coders produce code faster, it may not be producing better code, other researchers warn. And that could cause problems down the road for them. Specifically, in February 2026, respected AI research lab METR published a surprising revelation: most developers won't work, even on a limited number of tasks, without AI anymore. METR had hoped to provide an update to some groundbreaking research published a few months earlier, in 2025, on AI coding productivity. In it, researchers measured how much time open source developers took to do tasks by hand versus with AI. While developers in that study reported that AI was making them more productive, they were shocked to learn it actually slowed them down. Sure, it generated code faster, but then they spent extra time finding and fixing errors, steering the AI and waiting on it to complete tasks. When METR set out to repeat the experiment to measure advances in AI and coder proficiency, they couldn’t. Devs weren't willing to participate “because they do not wish to work without AI" even just for the study, the researchers confessed. Instead, METR published a survey in May that allowed technical employees to self-report their AI productivity gains. Not surprisingly, they perceived that AI made them twice as valuable to their organizations. But recent headlines about the wild expense of so-called tokenmaxxing , coupled with a smattering of recent research, make such self-perceptions dubious. Tokenmaxxing, or using the number of tokens a person uses as a proxy for productivity with AI, has been the trend of 2026 so far. And it may already be over. Amazon shut down its internal token-tracking leaderboard called Kirorank after employees were gaming it by using AI agents excessively, and running up costs, the Financial Times reported this week. The employees proved that AI use does not automatically translate to increased productivity. Uber blew through its 2026 AI budget within the first four months of the year, The Information reported. COO Andrew Macdonald recently said on a podcast that such spending hadn’t led to a measurable increase in projects or productivity. AI-generated code also doesn’t necessarily reduce ongoing code maintenance needs, and may even increase it, programmer and author James Shore elegantly argued in a blog post that went viral on Hacker News. “You write code twice as quick now? Better hope you’ve halved your maintenance costs,” he wrote. “Otherwise, you’re screwed. You’re trading a temporary speed boost for permanent indenture.” There’s other evidence that AI can increase code maintenance woes. A viral tweet from Aiswarya Sankar, founder and CEO of reliability engineering agent startup Entelligence AI, proclaims that companies are spending 44% of their tokens on bug fixes that their AI generated. Meanwhile, code-reviewing tool company Code Rabbit says it analyzed open source pull requests and found that AI produced 1.7x more problems than human code. Those are, admittedly, self-serving stats from those trying to sell AI code reviewing tools. Yet independent researchers have also found such issues. Researchers from the respected Singapore Management University published a report in April warning that “AI-generated code can introduce long-term maintenance costs into real software projects.” Given that programmers love their AI assistants, what’s the solution? Well, those who want to sell you AI coding agents say devs can just use AI coding agents to do the bone-wearying tasks of fixing code as fast as AI spits it out. That’s what Cognition founder and CEO Scott Wu —the maker of AI coding agent Devin — suggests . But even he admits that, while Devin can work independently, he’d currently rate its skill between a junior and mid-level programmer, depending on the task. This is not a hand-it-off and forget it solution. The SMU researchers suggest a more human approach. Programmers should know what tasks AI does and doesn’t do well as deeply as they know their favorite coding languages. They need strong quality assurance systems designed for AI and they are stuck with carefully reviewing the AI’s work as if it were a junior dev. Meanwhile, the researchers say (and Wu agrees), humans should still be doing the big-picture work like software architecture and security design. Topics AI , TC When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence. Julie Bort Venture Editor Julie Bort is the Startups/Venture Desk editor for TechCrunch. You can contact or verify outreach from Julie by emailing julie.bort@techcrunch.com or via @Julie188 on X. View Bio October 13-15 San Francisco Early Bird ticket savings of up to $410 end May 29, 11:59 p.m. PT. Connect with 10,000+ tech leaders in unparalleled networking, 200+ hands-on sessions led by 250+ industry leaders, and discover 300+ showcasing startups. REGISTER NOW Most Popular Blue Origin's New Glenn rocket explodes during testing in Florida Sean O'Kane Anthropic releases Opus 4.8 with new ‘dynamic workflow' tool Russell Brandom Meta launches Instagram, Facebook, and WhatsApp subscriptions, with more to come, including AI plans Sarah Perez Tech CEOs are apparently suffering from AI psychosis Julie Bort DuckDuckGo installs are up 30% as users reject being ‘force-fed’ Google’s AI Search Rebecca Bellan 6 kitchen gadgets that make adulting feel easier Lauren Forristal I tried Amazon's Bee wearable and am both intrigued and slightly creeped out Lucas Ropek

AI 코딩 생산성 역설 코드 유지보수 토큰맥싱 개발자 트렌드