Hacker News • 112일 전

오픈AI, 너무 위험해 공개 불가한 GPT-2 발표 (2019)

IMP

7/10

핵심 요약

2019년 오픈AI는 특정 주제를 주면 문맥에 맞는 글을 매우 자연스럽게 생성하는 텍스트 생성 모델 GPT-2를 발표했습니다. 그러나 가짜 뉴스나 악의적 콘텐츠 생성 등 안전 및 보안상의 우려로 인해 원본 모델이 아닌 축소된 버전만 공개하기로 결정했습니다. 이 결정은 AI 알고리즘의 위험성과 윤리적 공개 기준에 대한 업계 전문가들의 치열한 논쟁을 촉발하는 계기가 되었습니다.

번역된 본문

지난주 비영리 연구 단체인 오픈AI는 특정 주제에 대한 프롬프트가 주어지면 일관성 있고 다양한 산문을 작성할 수 있는 새로운 텍스트 생성 모델을 개발했다고 밝혔습니다. 그러나 이 단체는 '안전 및 보안상의 우려'로 인해 전체 알고리즘을 공개하지 않겠다고 말했습니다. 대신 오픈AI는 '훨씬 더 작은' 버전의 모델만 공개하기로 결정하고, 모델 개발에 사용된 데이터셋과 학습 코드는 공개하지 않기로 했습니다.

GPT-2라는 이 모델에 대한 지식이 관련 뉴스 보도의 헤드라인에서만 얻은 것이라면, 오픈AI가 무기급의 챗봇을 만들었다고 생각할 수도 있습니다. 영국 매트로 지의 헤드라인은 '일론 머스크가 설립한 오픈AI, 인류를 위해 반드시 잠가두어야 할 만큼 강력한 AI를 개발'이라고 보도했습니다. CNET의 또 다른 기사는 '머스크 후원 AI 그룹: 우리의 텍스트 생성기는 너무 뛰어나서 무섭다'고 전했습니다. 가디언 지의 칼럼은 아이러니하게도 'AI가 나처럼 글을 쓸 수 있다. 로봇 종말론에 대비하라'는 제목이었습니다. 이는 꽤 경악할 만한 내용입니다. 하지만 머신러닝 분야의 전문가들은 오픈AI의 주장이 다소 과장되었을 수 있다는 점에 대해 논쟁을 벌이고 있습니다. 이 발표는 잠재적으로 위험한 AI 알고리즘의 확산을 어떻게 처리할 것인가에 대한 논쟁도 촉발시켰습니다.

오픈AI는 스페이스X와 테슬라의 창립자 일론 머스크, 벤처 캐피털리스트 피터 틸, 링크드인 공동 창립자 레이드 호프만 같은 거장들의 초기 자금 지원을 받아 설립된 인공지능 연구 분야의 선구자입니다. 이 비영리 단체의 사명은 남용되거나 유해한 애플리케이션으로부터 벗어나 AI 개발을 책임감 있게 이끄는 것입니다. 텍스트 생성 외에도 오픈AI는 스스로 간단한 작업을 학습할 수 있는 로봇 팔, 전략 비디오 게임인 Dota 2의 프로 게이머를 이길 수 있는 시스템, 그리고 학습 과정에 인간의 입력을 통합할 수 있는 알고리즘 등을 개발했습니다.

2월 14일, 오픈AI는 연구원들이 800만 개의 웹페이지 텍스트를 사용해 글의 다음 단어를 예측하도록 언어 모델을 학습시키는 방법을 상세히 설명하는 블로그 게시물을 통해 또 다른 머신러닝의 경이로움을 발표했습니다. 이 단체에 따르면 결과물로 나온 알고리즘은 놀라웠습니다. 이는 '조건부 텍스트의 스타일과 내용에 적응'할 수 있었고 사용자가 '선택한 주제에 대해 현실적이고 일관된 뒷이야기를 생성'할 수 있게 해주었습니다. 이 성과를 입증하기 위해 오픈AI는 특정 인간이 작성한 프롬프트를 바탕으로 GPT-2가 생성한 텍스트 샘플을 제공했습니다.

예를 들어, 연구원들은 생성기에 다음과 같은 시나리오를 입력했습니다: 놀라운 발견으로, 과학자들은 안데스 산맥의 외지고 이전에 탐험되지 않은 계곡에서 유니콘 떼가 살고 있는 것을 발견했습니다. 연구원들에게 더욱 놀라운 사실은 유니콘이 완벽한 영어를 구사했다는 것입니다.

이에 GPT-2 알고리즘은 뉴스 기사 형식의 글을 작성했습니다: 과학자들은 이 집단의 특징적인 뿔을 따라 '오비드의 유니콘'이라고 이름 붙였습니다. 이 네 개의 뿔을 가진 은백색 유니콘은 과거에 과학계에 알려지지 않았습니다. 이제 거의 두 세기가 지난 후, 이 기이한 현상을 촉발한 미스터리가 마침내 해결되었습니다. 라파즈 대학의 진화생물학자인 호르헤 페레즈 박사와 여러 동료들은 안데스 산맥을 탐험하다가 다른 동물이나 인간이 없는 작은 계곡을 발견했습니다. 페레즈는 계곡에 바위와 은빛 눈으로 둘러싸인 천연 분수처럼 보이는 것이 있다는 것을 알아챘습니다. 페레즈와 일행은 계곡의 더 깊은 곳으로 모험을 떠났습니다. "우리가 한 봉우리의 정상에 도달했을 때, 물은 파란색을 띠었고 그 위에 약간의 결정체가 있었습니다."라고 페레즈는 말했습니다.

다른 샘플들에서는 GPT-2가 반지의 제왕의 또 다른 전투 장면을 작성하는 소설가, 재활용에 대해 분노하는 칼럼니스트, 그리고 가상의 사이보그로 부활한 후 미국 국민을 향해 연설문을 작성하는 존 F. 케네디의 대필작가 등 다양한 역할을 소화하는 모습을 보여주었습니다. 최근의 역사가 보여주듯, 이를 억누르거나 통제하려는 시도는...

원문 보기

원문 보기 (영어)

Copy Link Share Share Comment Copy Link Share Share Comment Last week, the nonprofit research group OpenAI revealed that it had developed a new text-generation model that can write coherent, versatile prose given a certain subject matter prompt. However, the organization said, it would not be releasing the full algorithm due to “safety and security concerns.” Instead, OpenAI decided to release a “much smaller” version of the model and withhold the data sets and training codes that were used to develop it. If your knowledge of the model, called GPT-2, came solely on headlines from the resulting news coverage, you might think that OpenAI had built a weapons-grade chatbot. A headline from Metro U.K. read , “Elon Musk-Founded OpenAI Builds Artificial Intelligence So Powerful That It Must Be Kept Locked Up for the Good of Humanity.” Another from CNET reported , “Musk-Backed AI Group: Our Text Generator Is So Good It’s Scary.” A column from the Guardian was titled, apparently without irony, “AI Can Write Just Like Me. Brace for the Robot Apocalypse.” That sounds alarming. Experts in the machine learning field, however, are debating whether OpenAI’s claims may have been a bit exaggerated. The announcement has also sparked a debate about how to handle the proliferation of potentially dangerous A.I. algorithms. OpenAI is a pioneer in artificial intelligence research that was initially funded by titans like SpaceX and Tesla founder Elon Musk, venture capitalist Peter Thiel, and LinkedIn co-founder Reid Hoffman. The nonprofit’s mission is to guide A.I. development responsibly, away from abusive and harmful applications. Besides text generation, OpenAI has also developed a robotic hand that can teach itself simple tasks, systems that can beat pro players of the strategy video game Dota 2 , and algorithms that can incorporate human input into their learning processes. On Feb. 14, OpenAI announced yet another feat of machine learning ingenuity in a blog post detailing how its researchers had trained a language model using text from 8 million webpages to predict the next word in a piece of writing. The resulting algorithm, according to the nonprofit, was stunning: It could “[adapt] to the style and content of the conditioning text” and allow users to “generate realistic and coherent continuations about a topic of their choosing.” To demonstrate the feat, OpenAI provided samples of text that GPT-2 had produced given a particular human-written prompt. For example, researchers fed the generator the following scenario: Advertisement Advertisement Advertisement Advertisement In a shocking finding, scientist discovered a herd of unicorns living in a remote, previously unexplored valley, in the Andes Mountains. Even more surprising to the researchers was the fact that the unicorns spoke perfect English. The GPT-2 algorithm produced a news article in response: The scientist named the population, after their distinctive horn, Ovid’s Unicorn. These four-horned, silver-white unicorns were previously unknown to science. Now, after almost two centuries, the mystery of what sparked this odd phenomenon is finally solved. Dr. Jorge Pérez, an evolutionary biologist from the University of La Paz, and several companions, were exploring the Andes Mountains when they found a small valley, with no other animals or humans. Pérez noticed that the valley had what appeared to be a natural fountain, surrounded by two peaks of rock and silver snow. Pérez and the others then ventured further into the valley. “By the time we reached the top of one peak, the water looked blue, with some crystals on top,” said Pérez. Advertisement Other samples exhibited GPT-2’s turns as a novelist writing another battle passage of The Lord of the Rings , a columnist railing against recycling, and a speechwriter composing John F. Kennedy’s address to the American people in the wake of his hypothetical resurrection as a cyborg. If recent history is any indication, trying to suppress or control the proliferation of AI tools may be a losing battle. While researchers admit that the algorithm’s prose can be a bit sloppy—it often rambles, uses repetitive language, can’t quite nail topic transitions, and inexplicably mentions “fires happening under water”—OpenAI nevertheless contends that GPT-2 is far more sophisticated than any other text generator that it has developed. That’s a bit self-referential, but most in the A.I. field seem to agree that GPT-2 is truly at the cutting edge of what’s currently possible with text generation. Most A.I. tech is only equipped to handle specific tasks and tends to fumble anything else outside a very narrow range. Training the GPT-2 algorithm to adapt nimbly to various modes of writing is a significant achievement. The model also stands out from older text generators in that it can distinguish between multiple definitions of a single word based on context clues and has a deeper knowledge of more obscure usages. These enhanced capabilities allow the algorithm to compose longer and more coherent passages, which could be used to improve translation services, chatbots, and A.I. writing assistants. That doesn’t mean it will necessarily revolutionize the field. Advertisement Advertisement Nevertheless, OpenAI said that it would only be publishing a “much smaller version” of the model due to concerns that it could be abused. The blog post fretted that it could be used to generate false news articles, impersonate people online, and generally flood the internet with spam and vitriol. While people can, of course, create such malicious content themselves, the implementation of sophisticated A.I. text generation may augment the scale at which it’s generated. What GPT-2 lacks in elegant prose stylings it could more than make up for in its prolificacy. Advertisement Advertisement Yet the prevailing notion among most A.I. experts, including those at OpenAI, was that withholding the algorithm is a stopgap measure at best. Plus, “It’s not clear that there’s any, like, stunningly new technique they [OpenAI] are using. They’re just doing a good job of taking the next step,” says Robert Frederking, the principal systems scientist at Carnegie Mellon’s Language Technologies Institute. “A lot of people are wondering if you actually achieve anything by embargoing your results when everybody else can figure out how to do it anyway.” Advertisement An entity with enough capital and knowledge of A.I. research that’s already out in the public could build a text generator comparable to GPT-2, even by renting servers from Amazon Web Services. If OpenAI had released the algorithm, you perhaps would not have to spend as much time and computing power developing your own text generator. But the process by which it built the model isn’t exactly a mystery. (OpenAI did not respond to Slate’s requests for comment by publication.) Some in the machine learning community have accused OpenAI of exaggerating the risks of its algorithm for media attention and depriving academics, who may not have the resources to build such a model themselves, the opportunity to conduct research with GPT-2. However, David Bau, a researcher at MIT’s Computer Science and Artificial Intelligence Laboratory, sees this decision more of a gesture intended to start a debate about ethics in A.I. “One organization pausing one particular project isn’t really going to change anything long term,” says Bau. “But OpenAI gets a lot of attention for anything they do … and I think they should be applauded for turning a spotlight on this issue.” Advertisement Advertisement Advertisement It’s worth considering, as OpenAI seems to be encouraging us to do, how researchers and society in general should approach powerful A.I. models. The dangers that come with the proliferation of A.I. won’t necessarily involve insubordinate killer robots . Let’s say, hypothetically, that OpenAI had managed to create a truly unprecedented text generator that could be easily download

오픈AI GPT-2 안전 및 보안 텍스트 생성 AI 윤리

GPT-2에서 클로드 미토스까지: '출시엔 위험'했던 AI의 귀환

과거 OpenAI가 GPT-2의 전면 공개를 미뤘던 논쟁이 Anthropic의 신규 모델 'Claude Mythos'를 통해 다시금 주목받고 있습니다. 이번에는 가드레일과 안전성 평가를 거친 후 공개하는 업계의 기존 방식을 넘어, 보안 취약점 발견에 특화된 모델을 통제된 환경에서만 방어적 목적으로 배포하는 'Project Glasswing'이 소개되었습니다. 글로벌 테크 기업들이 연합에 참여하며, 단순히 모델을 보류하는 것이 아닌 철저히 통제하며 활용하는 새로운 안전 기준을 제시하고 있습니다.

AI 안전성 Anthropic 사이버 보안