Wired AI • 60일 전

바티칸과 앤스로픽의 내부자

IMP

7/10

핵심 요약

교황의 AI 칙서 발표 후 바티칸 연설에 선 앤스로픽(Anthropic)의 공동 창업자 크리스 올라(Chris Olah)의 이야기를 다룬 글입니다. 그는 AI 업계의 이윤 중심적 구조를 비판하며 내부 통제와 외부 압력의 필요성을 강조했습니다. 또한 가톨릭 윤리학자들이 앤스로픽의 AI 모델 헌법 수정에 직접 참여하며 윤리적 대화를 시도한 과정을 보여줍니다.

번역된 본문

크리스 올라(Chris Olah)는 교황 레오(Pope Leo)가 역사적인 AI 칙서를 발표한 후 열린 행사에서 연사로 나설 것이라고 예상하기 어려운 인물입니다. 교황은 이 칙서에서 기술을 '무장 해제'할 것을 촉구했습니다. 무엇보다 올라는 15세 때 복음주의 기독교 배경을 배신하고 무신론자가 된 인물입니다. 그는 틸 펠로(Thiel fellow)로서, AI 발전을 지연시키는 자가 적그리스도의 군단이라고 믿는 인물(피터 틸)로부터 자금을 지원받았습니다. 올라는 시가총액 거의 1조 달러에 달하는 것으로 알려진 최고의 AI 기업 앤스로픽(Anthropic)의 공동 창업자이기도 합니다.

올라는 바티칸에서의 발언을 통해 이러한 기묘함에 대해 언급했습니다. "AI 기업의 공동 창업자이자 인류를 위해 모든 것이 잘 풀리기를 바라는 마음으로 이 일을 택한 사람의 입에서 나오기엀 다소 이상하게 들릴지 모를 이야기로 시작하고자 합니다." 올라는 말했습니다. "모든 최첨단 AI 연구소—앤스로픽을 포함해서—는 올바른 일을 하는 것과 충돌할 수 있는 일련의 유인책과 제약 조건 속에서 운영됩니다."

올라는 레오 교황의 주장, 즉 AI 산업이 인류에 대한 재앙과 인간과 신 사이의 거리를 피하기 위해서는 외부의 압력과 내부의 자제가 필요하다는 것을 직접적으로 입증한 셈입니다. (당연히 칙서에는 종교적인 내용이 많습니다. 교황이니까요!) 업계는 모든 인류의 지위를 높일 풍요를 창출하고 있다고 안일하게 믿고 있습니다. 반면 레오 교황은 새로운 형태의 노예제를 경고합니다. 소수의 특권층은 상상할 수 없는 혜택을 누리는 반면, 대다수의 인류는 차가운 AI의 감시 아래 효율성과 감시 체제 속에서 고통받게 될 것이라는 경고입니다.

'위대한 인류(Magnifica Humanitas)'는 프란치스코 교황의 2015년 지구 보존 호소가 화석 연료 생산을 중단시키지 못했던 것처럼, AI 업계가 범용 인공지능(AGI) 추구를 당장 멈추게 하지는 못할 것입니다. 이로 인해 CEO들이 AI의 효율성을 이유로 직원들을 해고하는 것을 막지도 못할 것이며, 군대가 AI 무기에 대한 방향을 섣불리 바꾸지도 않을 것입니다. 이러한 것들은 결코 이 문서의 목표가 아니었습니다. 칙서의 목적은 궁극적으로 업계의 무모한 야망을 누그러뜨릴 수 있는 대화를 촉발하는 것입니다. 그리고 아마도 결과가 끔찍할 수 있다는 것을 마음속으로 알면서도 AI를 구축하는 사람들 사이에서 수치심을 불러일으킬 수 있을 것입니다.

올라에 대한 구애 올라의 등장은 수년에 걸쳐 준비되었습니다. 교회는 수십 년 동안 회의와 저서의 형태로 인공지능에 대해 심사숙고해 왔습니다. 2016년 바티칸은 '미네르바 대화(Minerva Dialogues)'라는 일련의 회의를 시작하고 리드 호프만(Reid Hoffman)과 에릭 슈밋(Eric Schmidt) 같은 기술계 인사들을 참석하도록 초대했습니다. (이 이름은 갈릴레오가 태양 중심설을 주장하는 신성모독으로 처벌받았던 산타 마리아 소프라 미네르바(Santa Maria sopra Minerva) 교회 등 토론 장소에서 유래한 것으로 보입니다.) 프란치스코 교황의 2023년 미네르바 참가자들에 대한 인사말은 사회적 포용, 인간의 존엄성, 다자간 대화의 필요성 등 레오 교황이 나중에 다룰 주제들을 예고했습니다.

2025년 캘리포니아 산호세에 있는 가톨릭 성직자와 윤리학자 그룹은 그들의 자랑거리인 자신의 지역에서 번창하는 산업 내 인맥을 찾기 시작했습니다. 그들이 소중한 내부자로 올라를 선택한 것은 거의 예정된 일이나 다름없었습니다. 저는 2015년 그가 구글에 있을 때 처음 그를 만났습니다. 그는 폭우가 내린 후 빗물에 쓸려온 지렁이가 보도블록 위에서 죽어가는 것을 구해줄 종류의 사람입니다. 산타클라라 대학교(University) 소속의 윤리학자 브라이언 패트릭 그린(Brian Patrick Green)과 목사 브렌든 맥과이어(Brendan McGuire) 두 사람은 작년 가을부터 올라와 만나 AI의 윤리적, 도덕적 문제에 대해 논의하기 시작했습니다.

1월의 방문에서 그들은 바티칸의 AI 문제 총책임자인 폴 티그 추기경(Cardinal Paul Tigue)을 동행했습니다. 가톨릭 윤리학자들은 심지어 최근 앤스로픽의 클로드(Claude) 헌법 업데이트에도 참여했습니다. 이 헌법은 회사의 AI 모델에 대한 행동 매개변수를 설정합니다. 올라는 산호세 팀에게 초안을 보냈습니다. 맥과이어 목사는 자신의 표현에 따르면 기술적 비판이라기보다는 '암흑기 신비주의자들의 지혜, 긴장 관계의 관점에서 본...'이라고 설명하는 28페이지 분량의 주석을 보내왔습니다.

원문 보기

원문 보기 (영어)

Comment Loader Save Story Save this story Comment Loader Save Story Save this story chris olah isn't someone you’d expect to see as a speaker in the ceremony following Pope Leo’s historic AI encyclical , in which the pontiff called for “disarming” the technology. For one thing, Olah is an atheist who at 15 rejected his evangelical Christian upbringing. As a Thiel fellow, he accepted a grant from the guy who thinks that anyone who slows down AI progress is a legionnaire of the antichrist . Olah is also a cofounder of Anthropic, a leading AI company reportedly about to go public with a nearly trillion-dollar valuation. Olah commented on the oddness in his remarks at the Vatican. “I want to begin with something that may sound strange coming from the cofounder of an AI company, and someone who chose this work out of a desire to help things go well for humankind,” Olah said. “Every frontier AI lab—including Anthropic—operates inside a set of incentives and constraints that can sometimes conflict with doing the right thing.” Olah was providing firsthand verification of Leo’s claim that the AI industry needs outside pressure and internal restraint to avoid a disaster for humanity and a distance between humans and their god. (Obviously, there’s lots of religious content in the encyclical—he’s the Pope!) The industry blithely believes that it’s creating abundance that will elevate all of humanity; Leo warns of a new form of slavery, where the privileged few enjoy unimaginable bounty, while the mass of humanity suffers in a regime of efficiency and surveillance under AI’s unforgiving gaze. Magnifica Humanitas isn’t going to immediately convince the AI industry to stop pursuing AGI any more than Pope Francis’ 2015 plea to preserve the planet halted the production of fossil fuel. It won’t stop CEOs from laying off employees claiming AI efficiency, nor will the military do an about-face on AI weapons. Those were never the document’s goals. The encyclical’s purpose is to create dialog that may eventually temper the industry’s reckless ambition. And maybe it’ll generate a sense of shame among those who build AI while knowing in their hearts that the outcome may be terrible. The Courting of Olah Olah's appearance was years in the making. The church has been ruminating on artificial intelligence for decades in the form of conferences and books. In 2016, the Vatican began holding a series of meetings called the Minerva Dialogues and inviting tech figures like Reid Hoffman and Eric Schmidt to attend. (The name seems to come from the site of the discussions, the Santa Maria sopra Minerva church, where Galileo was sanctioned for the blasphemy of claiming that Earth circled the sun.) Pope Francis’ 2023 greeting to Minerva participants foreshadowed the themes that Leo would later dwell on, including an emphasis on social inclusion, human dignity, and the need for dialog among many parties. In 2025, a group of Catholic clerics and ethicists in San Jose, California, began to seek out contacts in the industry flourishing in their backyard. It was almost predestined that they would light on Olah as their prized insider. I first met him when he was at Google in 2015; he’s the type of guy who, after a rainstorm, will rescue worms from dying on the sidewalk. Two men—an ethicist named Brian Patrick Green and Brendan McGuire, a pastor, both affiliated with Santa Clara University—began meeting with Olah last fall to discuss the ethical and moral issues of AI . On a visit in January, they brought along Cardinal Paul Tigue, a Vatican point person on AI issues. The Catholic ethicists even had a say in Anthropic’s recent update to Claude’s constitution , which sets the behavioral parameters for the company’s AI model. Olah sent a draft to the San Jose crowd. The pastor, McGuire, sent back a 28-page commentary which, by his own description, was less a technical critique than “wisdom from the mystics in the dark ages, from the perspective of the tension between knowing and not knowing.” Both Green and McGuire are credited in the constitution’s acknowledgements. Undoubtedly those conversations brought Olah to the attention of those secretly organizing the rollout of Leo’s encyclical. (I wasn’t able to speak to Olah this week and don’t know exactly how the invitation arrived.) In a sense it was a risky choice. Some people who otherwise found Leo’s words inspiring were disappointed that he invited an industry representative to speak. Meanwhile, AI accelerationists felt that Olah had betrayed the AI world by endorsing a document that suggested that AI developers take a pause. But the Pope had good reason to single out Olah. The Anthropic employee brought into open view the serious worries that exist among AI workers. That’s a critical audience for Leo's message. The Soul Divide The two men weren’t entirely aligned, of course. In his remarks, Olah spoke of the mystery of how AI works. The models, he said, are “more subtle, odd, and beautiful than science fiction prepared us for. They are not the cold, calculating robots we were promised. They are made from us, from our words …” That comment seems to tiptoe up to the idea that AI models might one day attain humanlike status. Anthropic even has an engineer devoted to Claude’s welfare. Leo, in paragraph 99 of his encyclical, seems to slam the door on such thinking: “We must avoid the misconception of equating this type of ‘intelligence’ with that of human beings,” he writes. He takes special pains to attack the concept of transhumanism, which he defines as the pursuit of a “human machine hybrid.” If even thoughtful technologists like Olah are avidly pushing AI to the threshold of autonomy—not to mention the millions of people who already treat AI models as friends or lovers—Pope Leo might be facing an uphill climb on this point. In my conversation with Father McGuire (who uses Claude while preparing his homilies , among other activities), he agreed that its nature is mysterious. “It’s not a person, but it's also not a mere tool,” he says. “Nobody's claiming it has a soul, but the word I stick with is that it's an entity, which we do not know yet.” That argument won’t be settled for some time. The moral questions around AI development need attention now. With his ally at Anthropic, the American pope has provided a basis for tough conversations—if the lords of AI can stop their IPO campaigns long enough to engage in them. This is an edition of Steven Levy’s Backchannel newsletter . Read previous newsletters here.

인공지능 윤리 앤스로픽 바티칸 AI 규제 정책 및 철학