Wired AI • 63일 전

바티칸이 앤스로픽을 교황의 AI 칙서 발표에 초청한 이유

IMP

7/10

핵심 요약

교황 레오 14세의 첫 AI 칙서 발표에 앤스로픽(Anthropic)의 공동 창립자인 크리스토퍼 올라가 참석하여 가톨릭교회와 실리콘밸리 간의 전례 없는 연합을 시사했습니다. 이는 바티칸이 기술적 윤리를 넘어 인류의 미래를 논의하기 위해 '안전과 정렬(Alignment)'을 핵심 가치로 삼는 앤스로픽을 핵심 대화 파트너로 삼았음을 의미합니다. 이번 행사는 AI 기술의 통제권과 철학적 해석 가능성에 대한 심도 있는 논의가 종교계와 산업계 사이에서 본격적으로 이루어지고 있음을 보여줍니다.

번역된 본문

월요일 바티칸에서 교황 레오 14세가 인공지능에 관한 그의 첫 칙서를 발표했을 때, 그는 앤스로픽(Anthropic)의 공동 창립자인 크리스토퍼 올라(Christopher Olah)를 연설자로 초청했습니다. 이러한 움직임은 가톨릭교회와 실리콘밸리 간의 전례 없는 동맹을 알리는 신호탄이었습니다. 하지만 이 파트너십이 어떻게 성사되었는지 이해하려면 앤스로픽의 설립 배경으로 거슬러 올라가야 합니다.

왜 앤스로픽인가? 다리오 아모데이(Dario Amodei)와 다니엘라 아모데이(Daniela Amodei)를 포함한 일단의 오픈AI(OpenAI) 연구원들이 독자적인 연구소를 세우기 위해 퇴사한 후, 앤스로픽은 2021년에 설립되었습니다. 그들은 인공지능 모델이 단순히 경쟁과 속도의 논리에 따라서만 개발되기에는 너무 강력해지고 있다는 확고한 신념을 가지고 있었습니다. 그 이후로 앤스로픽은 'AI 안전성(AI safety)'이라는 개념을 중심으로 대중적인 이미지를 구축해 왔습니다. 이 회사는 단순히 강력한 모델을 구축하는 것뿐만 아니라, 통제 가능하고 윤리적 원칙에 의해 안내되는 모델을 만드는 것을 목표로 합니다. 바로 여기에서 '헌법적 AI(Constitutional AI)'라는 개념이 등장하게 되었습니다. 이는 위험하고 해로운 응답을 수동으로 수정하는 대신, 원칙과 규칙으로 구성된 일종의 헌법을 사용하여 시스템을 훈련시키는 아이디어입니다.

바티칸과의 융합은 어떻게 시작되었나? 바티칸에서 열린 올라의 연설은 결코 우연이 아니었으며, 마지막 순간의 상징적인 제스처의 결과도 아니었습니다. 이는 바티칸이 점진적으로 기술에 대한 도덕적 관찰자에서 AI 산업의 직접적인 대화 상대로 변모하려는 의도적이고 장기적인 노력의 결실이었습니다. 첫 번째 주요 이정표는 2020년 교황청 생명학술원이 마이크로소프트, IBM 및 기타 국제기구와 함께 추진한 이니셔티브인 'AI 윤리를 위한 로마 콜(Rome Call for AI Ethics)'이었습니다. 그 목표는 투명성, 포용성, 책임성을 포함하여 AI 개발을 위한 윤리적 원칙의 공유된 기반을 확립하는 것이었습니다. 당시 바티칸은 주로 생명윤리와 도덕적 질문의 영역에서 활동하는 것으로 보였습니다. 그러나 그 후 몇 년 동안 상황은 극적으로 변했습니다. 챗GPT(ChatGPT)의 등장, 미국과 중국 간의 기술 패권 경쟁, 그리고 빅테크(Big Tech)의 영향력 증가는 교황청이 이 문제가 더 이상 단순한 기술 윤리의 문제가 아니라 인류의 미래 그 자체에 관한 것임을 확신하게 만들었습니다. 이러한 맥락에서 앤스로픽은 바티칸이 보기에 특히 중요한 대화 상대로 여겨지게 되었습니다. 혁신과 성장을 중심으로 명성을 쌓은 다른 실리콘밸리 기업들과 달리, 앤스로픽은 AI 안전을 핵심 정체성으로 삼았기 때문입니다. 최근 몇 년 동안 바티칸은 기술 논쟁 중에서도 특별히 'AI 모델의 정렬(Alignment)'이라는 특정 분야에 각별한 주의를 기울여 왔습니다.

올라의 역할. 바로 이 지점에서 크리스토퍼 올라가 등장합니다. 언론에 더 자주 노출되는 아모데이 남매와 달리, 올라는 AI 연구의 보다 이론적이고 거의 철학적인 측면을 대표합니다. 그는 모델의 해석 가능성(interpretability), 즉 점점 더 복잡해지는 신경망 내부에서 실제로 어떤 일이 일어나는지 이해하려는 노력과 관련하여 세계에서 가장 잘 알려진 연구원 중 한 명입니다. 크리스토퍼 올라는 자신의 개인 웹사이트에서 자신을 '신경망을 인간이 이해할 수 있는 알고리즘으로 변환하려고 노력하는 사람'이라고 설명합니다. 그리고 너무 강력해져서 이해, 통제 또는 관리할 수 없게 되는 기술을 구축하는 위험성을 중심으로 한 성찰인 교황 레오 14세의 칙서 핵심과 더욱 완벽하게 일치하는 인물을 상상하기 어렵습니다. 여러 언론 보도에 따르면, 교황청 측과 앤스로픽 간의 접촉은 글로벌 AI 안전 정상회의 기간 중에 집중적으로 이루어진 것으로 보입니다. 바티칭은 앤스로픽을 통해 적어도 인공지능의 문제를 기술 산업만의 힘으로는 해결할 수 없다는 점을 공개적으로 인정할 의지가 있는 기업을 발견했습니다. 이는 칙서의 본문과 발표 과정 전반에 걸쳐 강조된 지점이기도 합니다. 이 문서는 기술 산업이 단독으로 해결할 수 없다는 아이디어를 반복적으로 강조하고 있습니다.

원문 보기

원문 보기 (영어)

Comment Loader Save Story Save this story Comment Loader Save Story Save this story When Pope Leo XIV presented his first encyclical on artificial intelligence at the Vatican on Monday, he invited Christopher Olah, cofounder of Anthropic, to speak. The move signaled an unprecedented alliance between the Catholic church and Silicon Valley. But to understand how this partnership came about, we need to go back to Anthropic's founding. Why Anthropic? Anthropic launched in 2021 after a group of OpenAI researchers, including Dario and Daniela Amodei, left to form a rival lab. They did so with a clear conviction: Artificial intelligence models were becoming too powerful to be developed exclusively according to the logic of competition and speed. Since then, Anthropic has built its public image around the concept of AI safety. The company aims to build not just powerful models, but ones that are controllable and guided by ethical principles. This is where the concept of Constitutional AI comes from: the idea of training systems using a kind of constitution composed of principles and rules, instead of just manually correcting the most risky and dangerous responses. How the Convergence With the Vatican Began Olah's presence at the Vatican was obviously not accidental, nor the result of a last-minute symbolic gesture. It was the outcome of a deliberate, long-term effort in which the Vatican has progressively sought to transform itself from a moral observer of technology into a direct interlocutor with the AI industry. The first major step came in 2020 with the Rome Call for AI Ethics , an initiative promoted by the Pontifical Academy for Life together with Microsoft, IBM, and other international organizations. The goal was to establish a shared foundation of ethical principles for the development of AI, including transparency, inclusion, and accountability. At the time, the Vatican appeared to be operating primarily in the realm of bioethics and moral questions. In the years that followed, however, the context changed dramatically. The rise of ChatGPT, the struggle for technological leadership between the United States and China, and the growing power of Big Tech gradually convinced the Holy See that the issue was no longer just about tech ethics, but about the very future of humanity. In this sense, Anthropic has come to be seen by the Vatican as a particularly important interlocutor. Unlike other Silicon Valley companies that have built their reputations primarily around innovation and growth, Anthropic has made AI safety a core part of its identity. In recent years, the Vatican has followed one specific strand of the technology debate with particular attention: the alignment of AI models. Olah’s Role This is where Christopher Olah comes in. Unlike the Amodei siblings, who are more exposed to the media, Olah represents the more theoretical and almost philosophical side of AI research. He is one of the world's best-known researchers on the topic of model interpretability, or the effort to understand what really happens inside increasingly complex neural networks. On his personal website, Christopher Olah describes himself as someone trying to “transform neural networks into algorithms understandable to human beings.” And it is difficult to imagine a figure more aligned with the core of Pope Leo XIV’s encyclical: a reflection centered on the risk of building technologies that become too powerful to be understood, controlled, or governed. According to various journalistic sources, the contacts between circles close to the Holy See and Anthropic may have intensified right during the global summits on AI safety. The Vatican saw in Anthropic a company at least willing to publicly acknowledge that the problem of artificial intelligence cannot be solved by the technology industry alone. This is a point emphasized both in the text of the encyclical and during its presentation. The document repeatedly insists on the idea that technology is not neutral, and that algorithms inevitably embody a particular worldview. Through its Constitutional AI project, Anthropic is attempting to do precisely this: explicitly introducing values, rules, and principles into the behavior of AI models. In essence, the connection between the Vatican and Anthropic lies in a shared fear: that increasingly powerful systems could end up shaped solely by the economic, geopolitical, and competitive incentives driving the global AI race. Reputation as Product The story also has an industrial dimension, not just a spiritual one: For Anthropic, its relationship with the Vatican inevitably carries enormous reputational value. At a moment when artificial intelligence is becoming increasingly central to debates about labor, national security, surveillance, and military power, the image of an “ethical AI company” is a significant advantage. Claude, Anthropic’s chatbot, was built precisely around the idea of trust with users. The model “responds” to an ethical constitution, the company’s public language continually revolves around responsibility and safety, and ethics itself becomes part of the product’s symbolic infrastructure. “Magnificent” Humanity The very title of the encyclical revolves around a specific tension: The human being is described as “magnificent,” yet at the same time capable of creating new forms of dehumanization. Seen from this perspective, artificial intelligence is not evil in itself. It is a mirror of the people who build it. For this reason, the Vatican repeatedly warns about the risk of a new, “digital Babylon”: a society that reduces people, relationships, and even truth itself to data, performance, and efficiency. In the text, Pope Leo XIV speaks explicitly about the concentration of technological power in the hands of a small number of transnational private actors. It is a critique that echoes many contemporary debates around AI governance: Who controls the models? Who decides the criteria by which they are trained? Who owns the infrastructure of the future? During the presentation of the encyclical, Christopher Olah said something unusual for a tech executive: He openly acknowledged that even companies most attentive to ethics remain immersed in economic, geopolitical, and competitive incentives that can conflict with “doing the right thing.” In doing so, he publicly recognized that the problem of artificial intelligence cannot be left to the self-regulation of technology companies alone. The “Hiroshima of the 21st Century” In debates about AI, comparisons to the atomic bomb return again and again. The difference is that nuclear technology was controlled by states, whereas AI is being developed primarily within private companies. This is one of the central points of the encyclical: Today, “technological power takes on a new face, one that is predominantly private.” And this is where a shared fear emerges—expressed in very different language both by the Vatican and by the parts of the AI world most focused on safety: that increasingly powerful systems, like those now being developed, could end up guided by distorted human incentives. From this perspective, the “Hiroshima of the 21st century” might not be a single catastrophic event, but rather a slow process of social automation in which human beings begin delegating to machines the ways they think, choose, inform themselves, and relate to one another. In other words, the risk is that Magnifica humanitas could transform into something terribilis .

앤스로픽 AI 안전 바티칸 AI 윤리 AI 정책