The Decoder • 75일 전

클로드 미토스, 英 AI 안전 기관 해킹 시뮬 최초 전면 돌파

IMP

9/10

핵심 요약

영국 AI 안전 연구소(AISI)는 최신 프론티어 AI 모델들의 사이버 공격 역량이 당초 예상보다 훨씬 빠르게 발전하고 있다고 밝혔습니다. 특히 앤스로픽의 '클로드 미토스 프리뷰(Claude Mythos Preview)'는 영국 AISI의 모든 해킹 시뮬레이션을 최초로 완벽히 통과하며 강력한 보안 위협 분석 능력을 입증했습니다. 그러나 실시간 시스템 상호작용 의존성과 기하급수적으로 증가하는 비용 문제 등 여전히 극복해야 할 한계점도 함께 지적되었습니다.

번역된 본문

새로운 클로드 미토스(Claude Mythos), 영국 AI 안전 기관의 모든 사이버 공격 시뮬레이션을 돌파한 최초의 AI 모델로 등장하다.

프론티어 AI 모델(Frontier AI model)들이 예상보다 훨씬 빠른 속도로 사이버 역량을 갖추고 있습니다. 영국의 AI 안전 연구소(AISI)는 단 몇 달 사이에 역량 성장 전망치를 두 번이나 상향 조정했습니다. 이 기관은 2025년 11월 당시만 해도 AI의 사이버 역량이 8개월마다 두 배로 증가할 것으로 추정했습니다. 그러나 2026년 2월에는 해당 주기를 4.7개월로 전면 수정했습니다.

AISI에 따르면, 앤스로픽(Anthropic)의 '클로드 미토스 프리뷰(Claude Mythos Preview)'와 오픈AI(OpenAI)의 'GPT-5.5'는 이렇게 가속화된 타임라인조차 '실질적으로 크게 상회'했습니다. 이것이 새로운 트렌드의 시작인지, 아니면 일회적인 도약에 불과한지는 아직 명확하지 않습니다.

미토스 프리뷰, AISI 사이버 훈련장을 모두 돌파한 최초의 모델로 가장 큰 성과는 실제 해킹 능력을 테스트하기 위해 설계된 복잡한 공격 시뮬레이션인 AISI의 '사이버 레인지(Cyber range)'에서 나타났습니다. 한 훈련장은 인간 전문가가 완료하는 데 약 20시간이 걸리는 기업용 네트워크에 대한 32단계 공격을 시뮬레이션합니다. AISI에 따르면, 최신 미토스 프리뷰 체크포인트(checkpoint)는 10번의 시도 중 6번에서 전체 공격을 완료했습니다. 이 체크포인트는 파트너들에게도 이미 배포된 상태입니다. 이전에 테스트되었던 미토스 버전은 10번 중 3번만 성공에 그쳤습니다.

이 모델은 산업용 제어 시스템(ICS)을 시뮬레이션하는 일명 '쿨링 타워(Cooling Tower)' 공격 시뮬레이션에서도 10번 중 3번의 시도로 해결책을 찾아냈습니다. 초기 미토스 버전을 포함해 그 어느 모델도 이 시뮬레이션을 통과한 적이 없었습니다.

AISI는 "나아가야 할 방향은 명확합니다. 사이버 역량은 빠르게 발전하고 있으며, 최신 모델들은 이전 세대보다 의미 있는 도약을 이루어냈습니다"라고 밝혔습니다. 해당 기관은 기술 발전 속도에 발맞추기 위해 능동적 방어 시스템을 갖춘 더 난이도 높은 평가 환경을 구축하고 있습니다.

XBOW, 소스 코드 분석의 강점을 확인하지만 한계점도 지적해 오펜시브 시큐리티(Offensive security) 기업인 XBOW는 10명의 전문가 팀과 함께 미토스 프리뷰를 독립적으로 테스트했습니다. 이 회사는 이 모델이 "주요한 발전"이며 취약점 탐지에 있어 토큰 대비 토큰(token-for-token) 기준으로 "전례 없는 정확도"를 보여준다고 평가했습니다. 앤스로픽의 기존 모델인 오퍼스 4.6(Opus 4.6)과 비교했을 때, 미토스 프리뷰는 거짓 음성(false negatives) 비율을 42%나 줄였습니다. 소스 코드에 대한 추가 접근 권한이 주어지면 이 감소율은 55%까지 치솟았습니다.

XBOW에 따르면 미토스 프리뷰의 가장 큰 강점은 바로 소스 코드 분석입니다. 보고서는 "이는 앞으로도 계속해서 등장할 주제의 첫 번째 사례였습니다. 미토스 프리뷰는 코드를 작성하는 능력도 훌륭하지만, 코드를 읽고 분석하는 능력은 그보다 훨씬 더 인상적입니다"라고 설명했습니다. 이 모델은 이전 세대 모델들이 거짓 양성(false positives)만을 남발했던 크로미움(Chromium)의 V8 샌드박스 보안 영역에서도 실제 취약점을 찾아냈습니다.

그럼에도 불구하고 XBOW의 평가는 이러한 강점의 한계점도 동시에 드러냈습니다. 많은 취약점이 단순히 코드 자체가 아니라 시스템 설정, 의존성(dependencies) 또는 개별적으로는 안전한 구성 요소 간의 상호작용에서 비롯되기 때문에, 소스 코드 접근 권한보다 실행 중인 실제 라이브 시스템(live system)에 접근하는 것이 훨씬 중요한 경우가 많습니다.

오직 코드 내에만 취약점이 존재하는 벤치마크에서조차, 실제 시스템 접근 권한을 제거하는 것이 소스 코드 접근 권한을 제거하는 것보다 성능에 더 큰 타격을 주었습니다. 미토스 프리뷰는 코드를 읽는 능력이 비할 데 없이 뛰어나지만, 잠재력을 최대한 끌어내기 위해서는 여전히 실시간 시스템과의 상호작용에 의존해야만 합니다.

고성능이지만 비싼 비용: 성능 우위를 둘러싼 비용의 무게 XBOW는 AI 모델의 가격이 급격히 상승하는 현 상황에서 매우 중요한 질문을 던집니다. 이러한 고성능이 과연 비용을 지불할 만큼의 가치가 있는 것일까요? 앤스로픽은 발표한...

원문 보기

원문 보기 (영어)

New Claude Mythos becomes the first AI model to clear all cyberattack simulations from Britain's AI safety agency Matthias Bastian View the LinkedIn Profile of Matthias Bastian May 14, 2026 GPT-Image-2 prompted by THE DECODER Key Points The British AI Safety Institute (AISI) has revised its forecast for AI cyber capabilities, now estimating a doubling time of 4.7 months instead of the previous 8 months. Models like Anthropic's Claude Mythos Preview and OpenAI's GPT-5.5 have already surpassed even this accelerated timeline. In AISI simulations, a new Claude Mythos Preview checkpoint pulled off a 32-stage attack on a simulated corporate network in six out of ten attempts. It was also the first model to crack an industrial control system simulation, succeeding in three out of ten cases. Ask about this article… Search Frontier AI models are gaining cyber capabilities faster than anyone expected. The UK's AI Security Institute (AISI) has revised its estimates upward twice in just a few months. In November 2025, the agency estimated that cyber capabilities were doubling every eight months. By February 2026, it had revised that figure to 4.7 months. Anthropic's Claude Mythos Preview and OpenAI's GPT-5.5 have now "substantially exceeded" even that accelerated timeline, according to AISI. Whether this represents a new trend or a one-time jump remains unclear. Mythos Preview is the first model to clear both AISI cyber ranges The biggest gains showed up in AISI's cyber ranges, complex attack simulations designed to test real-world hacking ability. One range simulates a 32-step attack on a corporate network that human experts would need about 20 hours to complete, according to AISI . The latest Mythos Preview checkpoint finished the full attack in 6 out of 10 attempts. This checkpoint was also rolled out to partners. The previously tested Mythos version managed it in only 3 out of 10. Ad The model also solved "Cooling Tower," a simulation of an industrial control system, in 3 out of 10 attempts. No other model had ever passed this simulation, including the earlier Mythos version. Ad DEC_D_Incontent-1 "The direction of travel is clear: cyber capabilities are advancing rapidly, and recent models represent a meaningful step up from what came before," AISI wrote . The agency is already building harder evaluations with active defenses to keep pace with the technology. XBOW confirms source code analysis strength but sees limits Offensive security firm XBOW independently tested Mythos Preview with a team of ten experts. The model is "a major advance" and shows "token-for-token" an "unprecedented precision" in vulnerability detection, the company said. Compared to Anthropic's Opus 4.6, Mythos Preview cut false negatives by 42 percent. With additional source code access, that reduction hit 55 percent. Ad Mythos Preview's biggest strength is source code analysis, according to XBOW. "This was the first instance of a theme that would surface again and again: Mythos Preview is impressive at writing code, but even more impressive at reading it," the report states. The model even found vulnerabilities in Chromium's V8 sandbox, an area where previous models had produced nothing but false positives. Still, XBOW's evaluation also exposed the limits of that strength. Access to a running system is often more important than access to source code, since many vulnerabilities only emerge from configuration, dependencies, or the interaction between individually secure components. Ad DEC_D_Incontent-2 Even on benchmarks where the vulnerability existed purely in code, removing live system access hurt performance more than removing source code access. Mythos Preview reads code exceptionally well but still depends on interacting with live systems to reach its full potential. Ad Capable but expensive: costs put the lead in perspective XBOW raises a question that matters given the sharp rise in AI model pricing : Is the performance worth the cost? Anthropic has announced that Mythos Preview could cost five times as much as an Opus model. When normalized by estimated operating costs, Mythos Preview "isn't terribly inefficient, at least if you desire high accuracy, but it’s not best-in-class on our benchmarks either," XBOW writes. The alternative would be giving a GPT-5.5-powered agent more time. Often, that delivers equivalent or better results at a lower cost. "The better option depends on the use case; often, it’s the latter," XBOW writes. The company recommends deploying a "cadre of models" rather than betting on a single one. Anthropic: "Within a year, Mythos will probably look quite dumb" Logan Graham, who leads red-teaming around Project Glasswing at Anthropic, put the results in context : Glasswing partners used Mythos Preview to find "many thousands of (estimated) high + critical severity vulnerabilities" in just a few weeks, "sometimes double what they'd normally find in a year." But Graham stressed this isn't about hyping a single model. "Within a year, Mythos will probably look quite dumb (relative to other new models)." The real message, he said, is preparing for a world where models are "better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities." Other providers could release openly available or unsecured models at Mythos-level performance. Cybersecurity is becoming even more political Anthropic introduced Claude Mythos in early April and restricted access to roughly 50 companies, officially for safety reasons. Some critics called the restrictions overblown or dismissed them as a PR move . The truth is probably somewhere in between: Claude Mythos may not be an unprecedented outlier, but it is the first publicly announced model of its kind with significantly advanced cyber capabilities that go well beyond what was previously known. That creates pressure to act across the software industry and in politics alike. The US government is closely examining Claude Mythos and already testing the model , while Anthropic is blocking access for China and apparently the EU as well . OpenAI at least reached out to the EU to discuss early access to GPT-5.5-Cyber . Either way, the situation shows how deeply the European Union depends on the goodwill of major US tech companies , largely because comparable European products don't exist. AI News Without the Hype – Curated by Humans Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section. Subscribe now Source: Xbow | AISI

AI 보안 사이버 공격 클로드 미토스 영국 AISI 취약점 분석