Wired AI • 116일 전

메타, 보안 사태로 데이터 제공업체 머코어와 업무 일시 중단

IMP

8/10

핵심 요약

데이터 하청 기업 머코어(Mercor)가 해킹 공격을 받으면서 오픈AI, 메타 등 주요 AI 기업들의 핵심인 '맞춤형 모델 학습 데이터'가 유출될 위기에 처했습니다. 이에 따라 메타는 머코어와의 전면적인 업무를 무기한 중단했으며, 오픈AI 역시 자사 데이터 노출 여부에 대한 내부 조사에 착수했습니다. 이번 사태는 공급망 해킹 공격으로 인해 경쟁사에 절대 노출되어서는 안 될 각 AI 모델의 핵심 훈련 방식과 데이터가 타격을 입을 수 있다는 점에서 산업계에 큰 충격을 주고 있습니다.

번역된 본문

WIRED(와이어드)가 입수한 두 소식통에 따르면, 메타(Meta)는 스타트업 머코어(Mercor)에 영향을 미친 대규모 보안 침해 사태를 조사하는 동안 데이터 하청 기업인 머코어와의 모든 업무를 일시 중단했습니다. 소식통들은 이 중단 조치가 무기한이라고 밝혔습니다. 사건의 범위를 평가하면서 다른 주요 AI 연구소들 역시 머코어와의 협력 관계를 재검토하고 있다고 이 사건에 익숙한 인사들이 전했습니다.

머코어는 오픈AI(OpenAI), 앤스로픽(Anthropic) 및 기타 AI 연구소들이 자사 모델용 훈련 데이터를 생성하기 위해 의존하는 몇 안 되는 기업 중 하나입니다. 이 회사는 대규모 인적 도급업체 네트워크를 고용해 이 연구소들을 위한 맞춤형 독자 데이터셋을 생성합니다. 이 데이터셋은 일반적으로 챗GPT(ChatGPT)나 클로드 코드(Claude Code)와 같은 제품을 구동하는 가치 있는 AI 모델을 만드는 핵심 요소이기 때문에 극비로 유지됩니다. AI 연구소들은 이 데이터가 경쟁사(미국 및 중국의 다른 AI 연구소 포함)에게 AI 모델 훈련 방식에 대한 핵심 세부 정보를 드러낼 수 있기 때문에 매우 민감하게 반응합니다.

현재로서는 머코어의 유출 사태로 노출된 데이터가 경쟁사에게 실질적인 도움을 줄 수 있는지 여부가 불분명합니다. 오픈AI의 대변인이 WIRED에 확인한 바에 따르면, 오픈AI는 머코어와의 현재 프로젝트를 중단하지는 않았지만, 자사의 독점적인 훈련 데이터가 어떻게 노출되었을 수 있는지 조사하기 위해 이 스타트업의 보안 사고를 내부 검토하고 있습니다. 그러나 이번 사태가 결코 오픈AI 사용자 데이터에는 영향을 미치지 않는다고 대변인은 덧붙였습니다. 앤스로픽은 WIRED의 코멘트 요청에 즉각적인 답변을 하지 않았습니다.

머코어는 3월 31일 직원들에게 보낸 이메일에서 이번 공격을 확인했습니다. 회사 측은 "전 세계 수천 개의 다른 조직과 함께 우리 시스템에 영향을 미친 최근의 보안 사고가 있었다"고 적었습니다. WIRED가 파악한 바에 따르면, 머코어의 한 직원은 목요일 도급업체들에게 보낸 메시지에서 이러한 점들을 반복해 언급했습니다. 이 소식통에 따르면 메타 프로젝트에 투입된 계약직 근로자들은 프로젝트가 재개될 경우에나 다시 근무 시간을 기록할 수 있으며, 이는 실질적으로 일자리를 잃게 될 수도 있음을 의미합니다. WIRED가 열람한 내부 대화에 따르면, 회사는 영향을 받은 근로자들을 위해 추가적인 프로젝트를 찾고 있습니다. 머코어의 계약직 근로자들은 자신들의 메타 프로젝트가 중단된 정확한 이유를 듣지 못했습니다. '코더스(Chordus)' 이니셔티브(AI 모델이 여러 인터넷 출처를 사용하여 사용자 쿼리에 대한 응답을 검증하도록 가르치는 메타 전용 프로젝트)와 관련된 슬랙(Slack) 채널에서 프로젝트 리더는 직원들에게 머코어가 "현재 프로젝트 범위를 재평가하고 있다"고 말했습니다.

TeamPCP로 알려진 공격자는 최근 AI API 도구인 LiteLLM의 두 가지 버전을 손상시킨 것으로 보입니다. 이번 침해 사건은 LiteLLM을 통합한 기업 및 서비스를 노출시켰고, 오염된 업데이트를 설치하게 만들었습니다. 다른 주요 AI 기업들을 포함해 수천 명의 피해자가 있을 수 있지만, 머코어에서 발생한 침해 사건은 손상된 데이터가 얼마나 민감한지를 잘 보여줍니다. 머코어와 그 경쟁사인 서지(Surge), 핸드셰이크(Handshake), 튜링(Turing), 레이블박스(Labelbox), 스케일 AI(Scale AI) 등은 주요 AI 연구소에 제공하는 서비스에 대해 극비를 유지하는 것으로 정평이 나 있습니다. 이 회사들의 CEO가 제공하는 구체적인 업무에 대해 공개적으로 이야기하는 것은 드물며, 내부적으로는 프로젝트를 설명하기 위해 코드명을 사용합니다.

해킹을 둘러싼 혼란을 더한 것은, 잘 알려진 이름인 랩서스(Lapsus$)를 사칭한 한 그룹이 이번 주 자신들이 머코어를 해킹했다고 주장한 것입니다. 텔레그램(Telegram) 계정과 브리치포럼스(BreachForums) 복제 사이트에서 이 해킹 범은 200GB 이상의 데이터베이스, 거의 1TB의 소스 코드, 그리고 3TB의 비디오 및 기타 정보를 포함하는 다양한 것으로 추정되는 머코어 데이터를 판매하겠다고 제안했습니다. 하지만 연구원들은 많은 사이버 범죄 그룹이 현재 주기적으로 랩서스라는 이름을 사용한다고 말합니다. 머코어가 LiteLLM과의 연관성을 확인한 것은 공격자가 TeamPCP이거나 해당 그룹과 연결된 인물일 가능성이 높다는 것을 의미합니다. TeamPCP는 최근 몇 달 동안 기세를 더해가면서 훨씬 더 대규모의 공급망 해킹 공세의 일환으로 두 가지 LiteLLM 업데이트를 손상시킨 것으로 보이며, 이로 인해 TeamPCP는 업계의 주목을 받고 있습니다. 그리고 데이터 협박 공격을 감행하고 랜섬웨어를 다루는 등...

원문 보기

원문 보기 (영어)

Comment Loader Save Story Save this story Comment Loader Save Story Save this story Meta has paused all its work with the data contracting firm Mercor while it investigates a major security breach that impacted the startup, two sources confirmed to WIRED. The pause is indefinite, the sources said. Other major AI labs are also reevaluating their work with Mercor as they assess the scope of the incident, according to people familiar with the matter. Mercor is one of a few firms that OpenAI , Anthropic , and other AI labs rely on to generate training data for their models. The company hires massive networks of human contractors to generate bespoke, proprietary datasets for these labs, which are typically kept highly secret as they’re a core ingredient in the recipe to generate valuable AI models that power products like ChatGPT and Claude Code . AI labs are sensitive about this data because it can reveal to competitors—including other AI labs in the US and China—key details about the ways they train AI models. It’s unclear at this time whether the data exposed in Mercor’s breach would meaningfully help a competitor. While OpenAI has not stopped its current projects with Mercor, it is investigating the startup’s security incident to see how its proprietary training data may have been exposed, a spokesperson for the company confirmed to WIRED. The spokesperson says that the incident in no way affects OpenAI user data, however. Anthropic did not immediately respond to WIRED’s request for comment. Mercor confirmed the attack in an email to staff on March 31. “There was a recent security incident that affected our systems along with thousands of other organizations worldwide,” the company wrote. A Mercor employee echoed these points in a message to contractors on Thursday, WIRED has learned. Contractors who were staffed on Meta projects cannot log hours until—and if—the project resumes, meaning they could functionally be out of work, a source familiar claims. The company is working to find additional projects for those impacted, according to internal conversations viewed by WIRED. Mercor contractors were not told exactly why their Meta projects were being paused. In a Slack channel related to the Chordus initiative—a Meta-specific project to teach AI models to use multiple internet sources to verify their responses to user queries—a project lead told staff that Mercor was “currently reassessing the project scope.” An attacker known as TeamPCP appears to have recently compromised two versions of the AI API tool LiteLLM. The breach exposed companies and services that incorporate LiteLLM and installed the tainted updates. There could be thousands of victims, including other major AI companies, but the breach at Mercor illustrates the sensitivity of the compromised data. Mercor and its competitors—such as Surge, Handshake, Turing, Labelbox, and Scale AI—have developed a reputation for being incredibly secretive about the services they offer to major AI labs. It’s rare to see the CEOs of these firms speaking publicly about the specific work they offer, and they internally use codenames to describe their projects. Adding to the confusion around the hack, a group going by the well-known name Lapsus$ claimed this week that it had breached Mercor. In a Telegram account and on a BreachForums clone, the actor offered to sell an array of alleged Mercor data, including a 200-plus GB database, nearly 1 TB of source code, and 3 TBs of video and other information. But researchers say that many cybercriminal groups now periodically take up the Lapsus$ name and that Mercor’s confirmation of the LiteLLM connection means that the attacker is likely TeamPCP or an actor connected to the group. TeamPCP appears to have compromised the two LiteLLM updates as part of an even larger supply chain hacking spree in recent months that has been gaining momentum, catapulting TeamPCP to prominence. And while launching data extortion attacks and working with ransomware groups, such as the group known as Vect, TeamPCP has also strayed into political territory, spreading a data wiping worm known as “CanisterWorm” through vulnerable cloud instances with Farsi as their default language or clocks set to Iran’s time zone. “TeamPCP is definitely financially motivated,” says Allan Liska, an analyst for the security firm Recorded Future who specializes in ransomware. “There might be some geopolitical stuff as well, but it’s hard to determine what’s real and what’s bluster, especially with a group this new.” Looking at the dark-web posts of the alleged Mercor data, Liska adds, “There is absolutely nothing that connects this to the original Lapsus$.”

메타 데이터유출 보안사고 오픈AI AI학습데이터