The Decoder • 103일 전

앤스로픽 클로드 4.7, 코딩 대폭 향상 및 보안 능력 축소

IMP

8/10

핵심 요약

앤스로픽이 자율적 코딩 능력과 이미지 해상도를 대폭 향상한 '클로드 오푸스 4.7'을 출시했습니다. 이 모델은 코딩 벤치마크에서 64.3%를 기록하며 전작 대비 큰 성능 향상을 보여주지만, 악용을 막기 위해 사이버 보안 공격 관련 능력은 의도적으로 축소 및 통제했습니다. 단, 새로운 토크나이저 도입으로 인해 실제 사용 비용이 최대 35% 증가할 수 있어 사용자의 주의가 필요합니다.

번역된 본문

앤스로픽의 새로운 플래그십 모델인 클로드 오푸스 4.7(Claude Opus 4.7)은 코딩 작업에서 상당한 발전을 이루었습니다. 이번 훈련 과정에서 회사는 특정 사이버 보안 능력을 의도적으로 축소하려 노력했습니다.

앤스로픽은 전작인 오푸스 4.6의 직접적인 업그레이드 버전인 클로드 오푸스 4.7을 출시했습니다. 회사는 이 모델을 주로 자율적 코딩(Autonomous coding) 분야에서의 도약으로 포지셔닝하고 있습니다.

SWE-bench Pro 코딩 벤치마크에서 오푸스 4.7은 64.3%를 기록했습니다. 이는 전작의 53.4%보다 크게 향상된 수치이며, 오픈AI의 GPT-5.4가 기록한 57.7%보다도 앞서는 성과입니다. 단, 앤스로픽의 최상위 모델인 클로드 미토스 프리뷰(Claude Mythos Preview)는 여전히 77.8%로 압도적인 격차를 유지하고 있습니다.

앤스로픽에 따르면 오푸스 4.7은 전작보다 지시 사항을 훨씬 더 정확하게 따릅니다. 회사는 기존 모델에 맞춰 작성된 프롬프트가 예상치 못한 결과를 낳을 수 있다고 지적했습니다. 오푸스 4.6이 지시 사항을 대략적으로 해석하거나 일부를 건너뛰는 경향이 있었던 반면, 오푸스 4.7은 지시를 더욱 글자 그대로 해석하기 때문입니다.

이미지 해상도 3배 향상으로 시각적 이해력 개선 오푸스 4.7은 긴 변 기준 최대 2,576픽셀까지 이미지를 처리할 수 있으며, 앤스로픽은 이를 약 375만 픽셀(3.75 megapixels) 수준이라고 설명했습니다. 이는 기존 클로드 모델이 처리할 수 있었던 해상도보다 3배 이상 높은 것입니다. 이는 단순한 API 설정이 아닌 모델 수준의 변화로, 이미지가 자동으로 더 높은 해상도로 처리되지만 그만큼 더 많은 토큰을 소비하게 됩니다. 추가적인 디테일이 필요 없는 사용자는 전송 전에 이미지 해상도를 낮출 수 있습니다. 앤스로픽은 이러한 고해상도 처리가 복잡한 스크린샷을 읽어야 하는 컴퓨터 사용 에이전트나 복잡한 다이어그램에서 데이터를 추출할 때 큰 이점이 된다고 봅니다. 문서 추론 벤치마크인 OfficeQA Pro에서는 정확도가 오푸스 4.6의 57.1%에서 80.6%로 크게 향상되었습니다. 또한 생체분자 추론(biomolecular reasoning) 및 시각적 내비게이션(ScreenSpot-Pro) 벤치마크에서도 상당한 성능 향상을 보여주었습니다.

의도적인 사이버 보안 능력 통제 이번 출시의 가장 주목할 만한 측면 중 하나는 앤스로픽이 모델의 사이버 보안 능력을 다루는 방식입니다. 회사는 훈련 과정에서 특정 사이버 보안 능력을 실험적으로 차별화하여 축소하려고 시도했습니다. 새로운 안전장치는 금지되거나 고위험인 사이버 보안 사용을 시사하는 요청을 자동으로 감지하고 차단하도록 설계되었습니다.

이러한 조치의 배경에는 최근 발표된 '프로젝트 글래스윙(Project Glasswing)'이 있습니다. 이 프로젝트에서 앤스로픽은 AI 모델이 사이버 보안에 미치는 위험과 이점을 다루었습니다. 회사는 성능이 더 뛰어난 미토스 프리뷰의 출시를 제한하고, 먼저 성능이 낮은 모델에서 새로운 안전장치를 테스트하겠다고 설명한 바 있습니다. 오푸스 4.7은 이 전략에 대한 첫 번째 테스트 케이스입니다.

모델을 침투 테스트나 레드팀(Red-teaming)에 사용하려는 보안 연구원은 새롭게 마련된 '사이버 검증 프로그램(Cyber Verification Program)'에 신청할 수 있습니다.

환각(Hallucination) 현상은 감소했지만 완전히 사라지지는 않아 시스템 카드에 따르면, 앤스로픽은 환각 현상을 두 가지 유형으로 구분합니다. 조작된 인용문이나 잘못된 데이터처럼 세상에 대한 잘못된 주장인 '사실적 환각(Factual hallucination)'과 모델이 실제로는 존재하지 않는 도구나 첨부 파일에 액세스할 수 있는 것처럼 행동하는 '입력 환각(Input hallucination)'이 그것입니다. 사실적 환각의 경우, 오푸스 4.7은 네 가지 벤치마크에서 오푸스 4.6과 동등하거나 더 나은 성능을 보여줍니다.

원문 보기

원문 보기 (영어)

Anthropic's Claude Opus 4.7 makes a big leap in coding, while deliberately scaling back cyber capabilities Maximilian Schreiner View the LinkedIn Profile of Maximilian Schreiner Apr 16, 2026 Anthropic Key Points Anthropic has released Claude Opus 4.7, which delivers a major jump in autonomous coding: it scores 64.3 percent on the SWE-bench Pro coding benchmark, up from 53.4 percent for its predecessor Opus 4.6. New features include triple the image resolution and deliberately throttled cybersecurity capabilities - Anthropic tried to reduce risky cyber capabilities during training and automatically blocks related requests. Per-token prices stay the same, but a new tokenizer maps the same text to up to 35 percent more tokens - meaning the actual cost per request can rise significantly. Ask about this article… Search Anthropic's new flagship model Claude Opus 4.7 delivers major improvements in coding tasks. During training, the company deliberately tried to reduce certain cybersecurity capabilities. Anthropic has released Claude Opus 4.7 , a direct upgrade to its predecessor, Opus 4.6. The company positions the model primarily as a step forward in autonomous coding. On the SWE-bench Pro coding benchmark, Opus 4.7 scores 64.3 percent, up from 53.4 percent for its predecessor and ahead of OpenAI's GPT-5.4 at 57.7 percent. Anthropic's own top model, Claude Mythos Preview, still leads by a wide margin at 77.8 percent. Anthropic says Opus 4.7 follows instructions more precisely than its predecessor. The company notes that prompts written for older models may now produce unexpected results, as Opus 4.7 interprets instructions more literally than Opus 4.6, which sometimes loosely interpreted or skipped parts of them entirely. Ad Ad DEC_D_Incontent-1 Image resolution triples for better visual understanding Opus 4.7 processes images at up to 2,576 pixels on the long edge, which Anthropic says works out to roughly 3.75 megapixels, more than three times what earlier Claude models could handle. This isn't an API setting but a model-level change: images are automatically processed at higher resolution, though they consume more tokens as a result. Users who don't need the extra detail can downscale images before sending them. Anthropic sees this as a major advantage for computer-use agents that need to read dense screenshots and for extracting data from complex diagrams. On the Document Reasoning benchmark (OfficeQA Pro), the company reports 80.6 percent accuracy, up from 57.1 percent with Opus 4.6. The benchmarks also show significant gains in biomolecular reasoning and visual navigation (ScreenSpot-Pro). Ad Anthropic deliberately throttles cyber capabilities One of the more notable aspects of this release is how Anthropic handles the model's cybersecurity capabilities. The company says it experimentally tried to reduce certain cyber capabilities differentially during training. New safeguards are designed to automatically detect and block requests that suggest prohibited or high-risk cybersecurity use. The background here is the recently announced Project Glasswing , in which Anthropic addressed the risks and benefits of AI models for cybersecurity. The company had explained that it would restrict the release of the more capable Mythos Preview and first test new safeguards on less capable models. Opus 4.7 is the first test case for this strategy. Ad DEC_D_Incontent-2 Security researchers who want to use the model for penetration testing or red-teaming can sign up for a new Cyber Verification Program . Ad Hallucinations drop but don't disappear According to the system card , Anthropic distinguishes between two types of hallucinations: factual hallucinations - wrong claims about the world, like fabricated quotes or incorrect data - and input hallucinations, where the model acts as if it has access to a tool or attachment that doesn't actually exist. For factual hallucinations, Opus 4.7 performs better than or on par with Opus 4.6 across four benchmarks but falls short of Mythos Preview. Anthropic says the gap comes mainly from Mythos Preview's higher hit rate on obscure facts, not from a higher error rate in Opus 4.7. For input hallucinations, Opus 4.7 achieves the lowest hallucination rate of all tested models when users request a tool that isn't available. When context information is missing, it comes close to Mythos Preview and sits well ahead of older models. Anthropic acknowledges, however, that the test cases for the tool set were tailored to Opus 4.6's weaknesses, which skews that model's results. When dealing with questions based on made-up facts Opus 4.7 performs on par with Opus 4.6 and below Mythos Preview. Under pressure, such as when users or system prompts push the model to contradict its own assessment, Opus 4.7 is more honest than Opus 4.6 but less firm than Mythos Preview. Alignment results are a mixed bag Overall, Anthropic describes Opus 4.7's safety profile as similar to Opus 4.6, with low rates of deception, sycophancy, and cooperation with misuse. The model is more resistant to prompt injection attacks. A known issue from earlier Claude models partially persists: refusing to help with legitimate AI safety research. According to the system card, Opus 4.7 still refuses to assist in 33 percent of simulated safety research tasks. That's a significant drop from 88 percent with Opus 4.6, but still a substantial share. Same per-token prices, potentially much higher real-world costs Pricing stays at $5 per million input tokens and $25 per million output tokens. However, Opus 4.7 uses a new tokenizer that can map the same text to up to 1.35 times as many tokens. The model also generates more output tokens at higher effort levels. In practice, the cost per request can rise significantly even though the per-token prices remain unchanged. A new effort level called "xhigh" slots in between "high" and "max." Claude Code also gets a new "/ultrareview" command for dedicated code reviews and an expanded "Auto Mode" for Max users, where Claude makes decisions on its own. Opus 4.7 is available through the Claude API, Amazon Bedrock, Google Cloud Vertex AI, and Microsoft Foundry. More details and tips are available in the migration guide for Opus 4.7 . AI News Without the Hype – Curated by Humans Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section. Subscribe now Source: Anthropic / Claude Opus 4.7 announcement | Anthropic / Cyber Verification Program | Anthropic / System Card | Anthropic / Migration Guide

앤스로픽 클로드 코딩 AI AI 보안 신모델 출시