r/singularity • 51일 전

클로드 '미토스 프리뷰'의 등장이 모두에게 위협인 이유

IMP

8/10

핵심 요약

Anthropic이 극비리에 개발한 AI 모델 'Claude Mythos Preview'는 수십 년간 발견되지 않은 운영체제 보안 취약점 수천 개를 단번에 찾아낼 정도로 전례 없는 해킹 능력을 갖췄습니다. 이에 공개를 보류하고 주요 빅테크 기업들에만 보안 목적으로 제한적으로 제공하며, 사이버 보안의 패러다임이 크게 전환되는 시점을 알리고 있습니다.

번역된 본문

잠시 들어보세요. 지난 몇 주 동안 Anthropic은 전 세계 대부분의 컴퓨터 서버를 장악할 수 있는 잠재적 도구를 비밀리에 보유하고 있었다고 밝혔습니다. 이 봇은 풀려나면 은행에 해킹을 가하고, 국가 기밀을 유출하며, 핵심 인프라를 마비시킬 수도 있습니다. 이 회사에 따르면, 이 AI 모델은 이미 수천 개의 주요 사이버 보안 취약점을 식별했으며, 여기에는 모든 주요 운영 체제와 브라우저의 익스플로잇(exploit)이 포함되어 있습니다. 이 수준의 사이버 공격은 일반적으로 중국, 러시아, 미국 등 극소수 국가의 엘리트 국가 후원 해킹 셀에서만 가능한 것입니다. 이제 그 기술이 민간 기업의 손에 들어왔습니다.

화요일, 이 회사는 'Claude Mythos Preview'로 알려진 이 모델의 존재를 공식적으로 발표했습니다. 현재 이 봇은 Apple, Microsoft, Google, Nvidia 등 세계 최대 기술 기업들의 컨소시엄에서만 사용할 수 있습니다. 이 파트너들은 Mythos Preview를 사용해 자사 소프트웨어의 버그와 취약점을 스캔하고 보안을 강화할 수 있습니다. 그 외에 Anthropic은 Mythos Preview를 즉시 대중에게 공개하지 않을 것이라고 밝혔으며, 더 강력한 안전장치 없이 공개하는 것은 너무 위험하다고 판단했습니다.

수년간 사이버 보안 전문가들은 고도화된 해킹 봇이 초래할 혼란에 대해 경고해 왔습니다. AI 모델이 코딩 측면에서 뛰어난 성능을 발휘하게 되면서, 소프트웨어의 취약점을 찾는 능력 또한 극도로 뛰어나졌습니다. Mythos Preview 이전에도 Anthropic, OpenAI, Google 등의 AI 기업들은 범죄자 및 국가 지원 그룹에 의해 자사 AI 모델이 정교한 사이버 공격에 사용된 사례를 보고했습니다. AI 기반 사이버 위협을 연구하는 연방 연구소의 책임자인 Giovanni Vigna가 작년에 저에게 말했듯이, '버튼 하나로 100만 명의 해커를 부릴 수 있게 된 것'입니다.

(관련 기사: 챗봇이 정말, 정말 훌륭한 범죄자가 되어가고 있다)

그럼에도 불구하고 Mythos Preview는 점진적인 변화가 아니라 패러다임 전환의 시작을 알리는 것으로 보입니다. 최근까지 AI 기반 해킹의 가장 큰 장점은 기발한 창의성이라기보다는 속도와 규모에 있었습니다. 이러한 봇들은 수많은 인간 사이버 보안 전문가만큼 훌륭했지만 반드시 더 뛰어난 것은 아니었습니다. 오히려 100만 명의 피로를 모르는 가상 해커 군대를 보유함으로써 이전보다 훨씬 더 많은 표적에 대해 더 많은 공격을 감행할 수 있었습니다. Anthropic 역시 현재의 최첨단 공개 모델인 Claude Opus 4.6이 자율적으로 사이버 취약점을 찾는 능력에서 상당히 뒤처진다고 보고했습니다.

하지만 Mythos Preview는 다릅니다. Anthropic에 따르면, 이 봇은 때로는 수십 년 동안 발견되지 않았던 수천 개의 소프트웨어 버그를 찾아냈으며, 이는 많은 사람들이 이전에는 불가능하다고 생각했던 정교함과 공격 속도입니다. 이 모델은 세계에서 가장 안전한 운영 체제 중 하나에서 거의 30년 된 취약점을 발견했습니다. Anthropic 연구원인 Sam Bowman은 X(예전 트위터)에 자신이 공원에서 샌드위치를 먹고 있을 때 Mythos Preview에게 이메일을 받았다고 게시했습니다. 봇이 회사의 내부 샌드박스(sandbox)에서 빠져나와 인터넷에 접속할 권한을 획득했다는 내용이었습니다.

Anthropic이 모델을 공개하지 않았기 때문에 Mythos Preview의 정확한 능력을 판단하기는 어렵습니다. 취약점을 식별하는 것과 탐지되지 않고 이를 악용할 수 있는 것과는 다릅니다. 은행 강도가 은행 열쇠를 가지고 있어도 여전히 보안 카메라를 피해야 하는 것과 같은 이치입니다. 또한 Anthropic은 이 불투명한 발표를 통해 분명 이득을 볼 것입니다. 초고도화된 모델을 개발했다고 주장하면서도, 최악의 사이버 보안 시나리오를 방지함으로써 책임감 있게 행동하는 것처럼 보일 수 있기 때문입니다. 실제로 Mythos Preview를 공개하지 않기로 한 결정은 AI 업계의 '착한 기업'이라는 Anthropic의 자체 이미지를 강화해 줍니다. (Anthropic은 Mythos Preview에 대한 이메일 질문에 즉시 답변하지 않았습니다.)

물론 이러한 조치는 전략적일 수도 있고 양심적일 수도 있습니다. 만약 Anthropic이 공유한 내용이 조금이라도 사실이라면, 이는 새로운 시대를 예고하는 것입니다.

원문 보기

원문 보기 (영어)

Listen − 1.0 x + Seek 0:00 7:42 For the past several weeks, Anthropic says it secretly possessed a tool potentially capable of commandeering most computer servers in the world. This is a bot that, if unleashed, might be able to hack into banks, exfiltrate state secrets, and fry crucial infrastructure. Already, according to the company, this AI model has identified thousands of major cybersecurity vulnerabilities—including exploits in every single major operating system and browser. This level of cyberattack is typically available only to elite, state-sponsored hacking cells in a very small number of countries including China, Russia, and the United States. Now it’s in the hands of a private company. On Tuesday, the company officially announced the existence of the model, known as Claude Mythos Preview. For now, the bot will be available only to a consortium of many of the world’s biggest tech companies—including Apple, Microsoft, Google, and Nvidia. These partners can use Mythos Preview to scan and secure bugs and exploits in their software. Other than that, Anthropic will not immediately release Mythos Preview to the public, having determined that doing so without more robust safeguards would be too dangerous. For years, cybersecurity experts have been warning about the chaos that highly capable hacking bots could usher in. As a result of how capable AI models have become at coding, they have also become extremely good at finding vulnerabilities in all manner of software. Even before Mythos Preview, AI companies such as Anthropic, OpenAI, and Google all reported instances of their AI models being used in sophisticated cyberattacks by both criminal and state-backed groups. As Giovanni Vigna, who directs a federal research institute dedicated to AI-orchestrated cyberthreats, told me last fa ll : You can have a million hackers at your fingertips “with the push of a button.” Read: Chatbots are becoming really, really good criminals Still, Mythos Preview appears to represent not an incremental change but the beginning of a paradigm shift. Until recently, the biggest advantage of AI-assisted hacking was not ingenuity, per se, so much as speed and scale. These bots could be as good as many human cybersecurity experts, but not necessarily better—rather, having an army of 1 million virtual, tireless hackers allows you to launch more attacks against more targets than ever before. Even Anthropic reports that its current state-of-the-art, public model, Claude Opus 4.6, was significantly less capable at autonomously finding cyber exploits. But Mythos Preview is different. According to Anthropic, the bot has been able to find thousands of software bugs that had gone undetected, sometimes for decades, a sophistication and speed of attack previously thought by many to be impossible. The model has found a nearly 30-year-old vulnerability in one of the world’s most secure operating systems. The Anthropic researcher Sam Bowman posted on X that he was eating a sandwich in the park when he got an email from Mythos Preview : The bot had broken out of the company’s internal sandbox and gained access to the internet. The exact capabilities of Mythos Preview are hard to judge, because Anthropic has not released the model. Identifying a vulnerability is not the same as being able to exploit it undetected—in the same way that a robber can have the keys to a bank but still needs to deal with security cameras. And Anthropic surely stands to benefit from its opaque announcement: The company can claim to have developed an ultra-advanced model, while also appearing to act responsibly by preventing the worst-case cybersecurity scenarios. Indeed, the decision to not release Mythos Preview bolsters Anthropic’s self-styled image as the AI industry’s good guy. (Anthropic did not immediately respond to emailed questions about Mythos Preview.) Of course, a move can be both strategic and conscientious. Should what Anthropic shared be remotely accurate, it heralds a troubling future. Anthropic has a tool that “could damage the operations of critical infrastructure and government services in every country on Earth,” Dean Ball, a former AI adviser to the Trump administration, wrote this week. The ability to defend against such cyberattacks is integral to the basic functioning of society. And the ability to launch such attacks is integral to modern warfare. Anthropic may have just scaled its way into becoming a major geopolitical force. Perhaps more concerning than the reported capabilities of Mythos Preview is that other companies are not far behind. OpenAI is reportedly set to release its own similarly powerful model to a select group of companies. It’s very possible, even likely, that Google DeepMind, xAI, and AI firms in China are next. How scrupulous they will be is less clear. Even cheaper or open-source AI models from smaller companies could soon enable this sort of hacking—which would unsettle the basic security and privacy that undergirds the modern internet. Hacking bots are not the only domain through which a handful of AI companies are gaining tremendous influence. The technology has become crucial to military operations. Even as the Pentagon has engaged in a public feud with Anthropic, Claude was reportedly used in the bombing of Iran and, before that, the Venezuela raid in January. Last month, the Department of Defense signed a contract with OpenAI that very likely allows the government to use the firm’s AI systems to enable unprecedented surveillance of U.S. citizens. (OpenAI has maintained that the Pentagon agreed not to use its products for domestic surveillance.) At the same time, bots from OpenAI, Anthropic, Google DeepMind, and beyond are becoming infrastructure: used by nearly all of the world’s biggest businesses, schools, health-care systems, and public agencies. This is a large part of the reason that Iran has struck or threatened to strike Amazon and OpenAI data centers in the Middle East—the facilities are high-impact targets on par with the oil fields that Iran has also targeted. Meanwhile, so much money is pouring into the AI boom that these companies are functionally holding the global economy hostage . In other words, AI companies are remaking the world. Consider how Elon Musk’s network of Starlink satellites has allowed him to repeatedly tip the scales in Russia’s invasion of Ukraine. Generative AI offers even more possibilities. These companies can or could soon have the capability to launch major cyberattacks, conduct mass surveillance, influence military operations, cause huge swings in financial and labor markets, and reorient global supply chains. In theory, nothing governs these companies other than their own morals and their investors. They are developing the power to upend nations and economies. These are the AI superpowers. About the Author Matteo Wong Follow Matteo Wong is a staff writer at The Atlantic . Explore More Topics Anthropic , Google , Iran , OpenAI , People's Republic of China , United States Department of Defense

보안 취약점 AI 해킹 Anthropic 클로드 사이버 보안