The Decoder • 67일 전

앤스로픽: AI 취약점 발견 속도가 패치 속도 앞질렀다

IMP

9/10

핵심 요약

앤스로픽의 최신 AI 모델인 '클로드 미토스 프리뷰(Claude Mythos Preview)'가 약 50개 파트너사와 협력하여 단 한 달 만에 핵심 시스템 소프트웨어에서 1만 건 이상의 심각한 보안 취약점을 발견했습니다. AI의 버그 탐지 속도가 기업의 패치 및 검증 속도를 크게 앞지르면서 심각한 보안 격차와 위험이 발생하고 있습니다. 이에 따라 오픈소스 개발자들은 패치를 설계할 시간이 필요하다며 AI의 취약점 공개 속도 조절을 요청하고 있습니다.

번역된 본문

앤스로픽, 개발자의 패치 속도보다 빠른 버그 탐지 경고 작성자: Matthias Bastian 게재일: 2026년 5월 23일

핵심 요약: 앤스로픽과 약 50개 파트너사는 '클로드 미토스 프리뷰(Claude Mythos Preview)' AI 모델을 사용하여 단 한 달 만에 시스템 핵심 소프트웨어에서 1만 건 이상의 치명적인 보안 취약점을 식별했습니다. 이 회사에 따르면, 취약점 발견 속도가 이미 확인 및 수정 능력을 앞지르고 있습니다. 클로드 미토스와 같은 AI 모델이 조직이 패치할 수 있는 것보다 훨씬 빠르게 취약점을 탐지할 수 있어 앤스로픽은 위험한 과도기가 될 수 있다고 경고합니다. 이는 계속 증가하는 보안 격차를 만들어내고 있습니다.

이 기사에 대해 문의하기… 검색

'프로젝트 글래스윙(Project Glasswing)'을 출시한 지 한 달 후, 앤스로픽이 첫 결과를 공유했습니다. 이 회사에 따르면 약 50개 파트너사와 협력한 '클로드 미토스 프리뷰' AI 모델이 시스템 핵심 소프트웨어에서 심각도가 높거나 치명적인 수준의 취약점 1만 건 이상을 찾아냈습니다. 앤스로픽은 블로그 게시물에서 이 모델이 이제 보안팀이 취약점을 확인, 공개 및 패치하는 것보다 더 빠르게 보안 결함을 발견하고 있다고 밝혔습니다.

새로운 취약점을 공개하는 업계 기한이 90일이며, 대부분의 발견 사항은 최종 사용자를 위험에 빠뜨리지 않고는 아직 설명할 수 없기 때문에 앤스로픽은 구체적인 기술적 세부 사항은 보류하고 있습니다.

파트너사들, 버그 발견 건수 10배 급증 보고 앤스로픽은 글래스윙 파트너사들이 인터넷 및 기타 핵심 인프라의 근간이 되는 소프트웨어를 운영하고 구축한다고 말했습니다. 각 파트너사는 수백 건의 치명적인 취약점을 발견했습니다. 또한 여러 기관은 버그 발견율이 10배 이상 급증했다고 보고했습니다.

광고 클라우드플레어(Cloudflare)는 2,000개의 버그를 적발했으며, 그중 400개는 심각도가 높거나 치명적인 수준이었다고 밝혔습니다. 오탐(false positive) 비율은 인간 테스터보다 낮았습니다. 모질라(Mozilla)는 파이어폭스(Firefox) 150에서 271개의 취약점을 발견하고 수정했으며, 이는 전신인 클로드 오퍼스(Claude Opus) 4.6이 파이어폭스 148에서 잡아낸 것보다 10배 이상 많은 수치입니다.

광고 외부 리뷰 역시 이러한 수치를 뒷받침합니다. 영국 AI 보안 연구소(AISI)는 최신 미토스 프리뷰 체크포인트가 사내 모의 다단계 사이버 공격 환경인 '사이버 레인지(cyber ranges)'를 모두 완벽하게 해결한 최초의 모델이라고 밝혔습니다. 독립 보안 플랫폼인 XBOW는 이 모델이 "전례 없는 정확도"를 인용하며 기존의 모든 모델을 뛰어넘는 중대한 도약이라고 평가했습니다.

앤스로픽에 따르면 미토스 프리뷰는 학술적 벤치마크인 '익스플로잇벤치(ExploitBench)'와 '익스플로잇짐(ExploitGym)'에서도 최고의 성능을 기록했습니다. GPT-5.5는 이러한 대부분의 벤치마크에서 근접한 성능을 보이며 이미 공개된 상태입니다.

패치 규모에서도 그 영향이 나타나고 있습니다. 팔로알토 네트웍스(Palo Alto Networks)는 최신 릴리스에서 평소보다 5배나 많은 패치를 배포했습니다. 마이크로소프트(Microsoft)는 새로운 패치의 수가 "당분간 계속해서 늘어나는 추세"일 것이라고 말했습니다. 오라클(Oracle)은 예전보다 몇 배나 더 빠르게 결함을 찾고 수정하고 있다고 주장했습니다.

광고 미토스 프리뷰는 단순한 버그 사냥 그 이상으로도 유용한 것으로 입증되었습니다. 앤스로픽에 따르면 한 파트너 은행에서 이 모델은 150만 달러 이상의 사기성 전신 송금을 적발하고 차단하는 데 도움을 주었습니다.

오픈소스 프로젝트에서 발견된 6,000건 이상의 잠재적 결함 파트너사와의 작업 외에도, 앤스로픽은 미토스 프리뷰를 통해 1,000개 이상의 오픈소스 프로젝트를 스캔했습니다. 이 모델은 6,202개의 심각도가 높거나 치명적인 취약점을 발견한 것으로 추산했으며, 모든 심각도를 통틀어 총 23,019건의 결과를 도출했습니다.

광고 독립 보안 회사들과 부분적으로 앤스로픽 자체적으로 심각도가 높거나 치명적인 것으로 분류된 발견 사항 중 1,752건을 검토했습니다. 그 결과 90.6%가 실제 취약점(true positive)으로 확인되었습니다. 62.4%는 진정한 의미의 높거나 치명적인 수준으로 확인되었습니다.

이러한 분류 비율을 바탕으로, 앤스로픽은 미토스 프리뷰가 오픈소스 코드에서 확인된 심각도가 높거나 치명적인 취약점 약 3,900건을 밝혀냈다고 추정합니다. 이 회사는 스캔을 계속할 계획입니다.

광고 블로그 게시물에 따르면 여러 오픈소스 유지 관리자들은 "패치를 설계할 시간이 더 필요하다"며 앤스로픽에 취약점 공개 속도를 늦춰달라고 요청했습니다. 심각도가 높거나 치명적인 버그를 수정하는 데는 평균적으로 2주가 소요됩니다. 지금까지 530건의 버그가 유지 관리자들에게 보고되었습니다. 그중 75건이 패치되었으며 65건은 공개된 권고문(advisory)을 통해 알려졌습니다.

원문 보기

원문 보기 (영어)

Anthropic warns Claude Mythos Preview finds bugs faster than developers can patch them Matthias Bastian View the LinkedIn Profile of Matthias Bastian May 23, 2026 Anthropic Key Points Anthropic and around 50 partners used the Claude Mythos Preview AI model to identify more than 10,000 critical security vulnerabilities in system-relevant software within just one month. The rate of discovery already outpaces the capacity to verify and fix the identified flaws, according to the company. Anthropic warns of a dangerous transition period, as AI models like Claude Mythos can detect vulnerabilities far faster than organizations can patch them, creating a growing security gap. Ask about this article… Search A month after launching Project Glasswing, Anthropic is sharing its first results. The company says its Claude Mythos Preview AI model, working with about 50 partners, has found more than 10,000 high- or critical-severity vulnerabilities in system-critical software. The model now spots security flaws faster than teams can verify, disclose, and patch them, Anthropic writes in a blog post. The company is holding back specific technical details, since the standard industry deadline for disclosing new vulnerabilities is 90 days and most findings can't be described yet without putting end users at risk. Partners report a tenfold jump in bug discovery Anthropic says the Glasswing partners run and build software that's core to the internet and other critical infrastructure. Each has found hundreds of critical vulnerabilities. Several also say their bug-finding rate jumped more than tenfold. Ad Cloudflare says it flagged 2,000 bugs, 400 of them high or critical severity. Its false positive rate beat human testers. Mozilla found and fixed 271 vulnerabilities in Firefox 150 , more than ten times what its predecessor, Claude Opus 4.6, caught in Firefox 148. Ad DEC_D_Incontent-1 Outside reviews back up these numbers. The UK's AI Security Institute says the latest Mythos Preview checkpoint is the first model to fully solve both of its in-house cyber ranges —simulated multi-stage cyberattacks. Independent security platform XBOW calls the model a major step beyond all prior models, citing "unprecedented precision." Anthropic says Mythos Preview also tops the academic benchmarks ExploitBench and ExploitGym , with GPT-5.5 being close in most of these benchmarks and already openly available. The impact is also showing up in patch volumes, according to Anthropic: Palo Alto Networks shipped five times as many patches as usual in its latest release. Microsoft said the number of new patches will "continue trending larger for some time." Oracle claims it's finding and fixing flaws several times faster than before. Ad Mythos Preview has also proven useful beyond just hunting bugs. At one partner bank, the model helped catch and block a fraudulent wire transfer worth over $1.5 million, Anthropic says. Over 6,000 potential flaws in open-source projects Alongside partner work, Anthropic says it scanned more than 1,000 open-source projects with Mythos Preview. The model estimates it found 6,202 high- or critical-severity vulnerabilities, with 23,019 total findings across all severity levels. Ad DEC_D_Incontent-2 Independent security firms—and partly Anthropic itself—have reviewed 1,752 of the high- or critical-severity findings so far. 90.6 percent turned out to be true positives. 62.4 percent were confirmed as genuinely high or critical. Based on those triage rates, Anthropic estimates Mythos Preview has uncovered close to 3,900 confirmed high- or critical-severity vulnerabilities in open-source code. The company plans to keep scanning. Ad Several open-source maintainers have asked Anthropic to slow down disclosures because "they need more time to design patches," the blog post says. On average, fixing a high- or critical-severity bug takes two weeks. So far, 530 such bugs have been reported to maintainers. Of those, 75 have been patched and 65 got public advisories. Another 827 confirmed vulnerabilities are still waiting to be disclosed. Making things worse, maintainers are already drowning in low-quality, AI-generated bug reports. Anthropic warns of a high-risk transition period Anthropic says models with similar cybersecurity skills will soon be widely available. Some likely already are. OpenAI's GPT-5.5 fits the profile, and there's also a more specialized variant called GPT-5.5 Cyber , though it's unclear what exactly sets the two apart. Either way, these new capabilities create a transition period where vulnerabilities get found fast but patched slowly. That gap brings new risks, Anthropic says. Mythos-class models slash the time and cost of finding and exploiting flaws. No company, Anthropic included, has built safeguards strong enough to stop misuse of these models and prevent serious damage. Over time, these models should help developers build far more secure software by catching bugs before code ships. Currently, Anthropic says software teams should shorten their patch cycles and make updates as easy as possible for users. Network defenders should stick to the basics: multi-factor authentication, hardened configs, and thorough logging. AI News Without the Hype – Curated by Humans Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section. Subscribe now Source: Anthropic

AI 보안 취약점 탐지 앤스로픽 클로드 미토스 오픈소스