The Decoder • 96일 전

오픈AI, 개인정보 자동 삭제 오픈소스 모델 공개

IMP

8/10

핵심 요약

오픈AI가 텍스트 내 개인정보를 자동으로 탐지하고 삭제하는 오픈소스 모델 '프라이버시 필터(Privacy Filter)'를 공개했습니다. 이 모델은 15억 파라미터(요청당 5천만 개 활성)의 가벼운 크기로, 클라우드 연결 없이 로컬 노트북이나 브라우저에서도 빠르고 안전하게 구동되어 상업적 활용이 가능합니다. 단, 영어 외 언어 처리 성능이 다소 떨어지며 법적 준법을 완벽히 보장하지 않으므로 의료, 법무 등 민감한 분야에서는 반드시 인간의 검토가 병행되어야 합니다.

번역된 본문

오픈AI, 텍스트 내 개인정보를 제거하는 오픈소스 모델 공개 막시밀리안 슈라이너 (Maximilian Schreiner) / 2026년 4월 23일

핵심 요약:

오픈AI는 로컬에서 구동되어 추가 처리 전 텍스트 내 개인정보를 자동으로 삭제하는 오픈소스 AI 모델 '프라이버시 필터(Privacy Filter)'를 공개했습니다.
이 모델은 이름, 주소, 비밀번호 등 8개 데이터 카테고리를 탐지하며, 긴 문서 처리를 지원하고 삭제 민감도를 조절할 수 있습니다. 상업적 사용도 허용됩니다.
이 모델은 법적으로 완벽한 익명화를 보장하지 않으며 영어 외 텍스트에서는 오류가 발생할 수 있어, 민감한 사용 사례의 경우 오픈AI는 인간의 검토를 권장합니다.

오픈AI는 텍스트 내 개인정보를 탐지하고 삭제하기 위해 설계된 오픈소스 모델인 '프라이버시 필터(Privacy Filter)'를 공개했습니다.

오픈AI에 따르면, 프라이버시 필터는 자체 AI 모델 학습이나 제3자와의 데이터 공유 등 추가 처리를 진행하기 전 대량의 텍스트를 정제해야 하는 팀을 위해 개발되었습니다.

이 모델은 15억 파라미터(1.5 billion parameters)의 비교적 작은 크기이며, 요청당 5천만 개의 활성 파라미터(50 million active parameters)만 사용합니다. 또한 클라우드 연결 없이 노트북이나 브라우저에서 직접 구동하는 것을 명시적으로 지원하여 로컬 하드웨어 환경에서도 실행 가능합니다.

이 모델은 총 8가지 카테고리의 민감한 콘텐츠(이름, 주소, 이메일 주소, 전화번호, URL, 날짜, 계좌 번호 및 비밀번호나 API 키와 같은 기타 보안 정보)를 탐지합니다. 기존의 챗봇과 달리 새로운 텍스트를 생성하지 않으며, 입력된 내용을 단 한 번 읽어 들여 각 부분이 어느 카테고리에 속하는지 레이블을 지정하는 방식으로 작동합니다. 오픈AI에 따르면 128,000 토큰의 컨텍스트 윈도우(context window) 덕분에 긴 문서를 분할하지 않고도 처리할 수 있습니다.

사용자는 설정을 조정하여 모델이 민감하게 삭제할지(재현율이 높고 오탐지가 많음), 아니면 보수적으로 삭제할지(오탐지는 적지만 누락되는 항목이 많음) 제어할 수 있습니다. 자체 데이터셋을 보유한 팀은 추가로 모델 파인튜닝(fine-tuning)도 진행할 수 있습니다.

프라이버시 필터는 깃허브(GitHub)와 허깅페이스(Hugging Face)에서 아파치 2.0(Apache 2.0) 라이선스로 제공되며, 상업적 사용이 허용됩니다.

민감한 사용 사례에 대한 명확한 한계점

오픈AI는 프라이버시 필터가 익명화나 법적 준수에 대해 어떠한 법적 보장도 제공하지 않는다는 점을 명확히 밝혔습니다. 이 모델은 포괄적인 데이터 보호 전략의 한 레이어로만 사용되도록 의도되었습니다.

오픈AI는 몇 가지 약점도 직접 나열했습니다. 희귀하거나 특정 지역에서만 쓰이는 이름은 탐지되지 않고 누락될 가능성이 높으며, 잘 알려진 공인이나 조직의 이름이 잘못 삭제되는 경우도 가끔 발생합니다. 또한 영어가 아닌 텍스트나 비 라틴 문자(Non-Latin scripts) 환경에서는 성능이 저하됩니다.

의료, 법률, 금융, 인사(HR)와 같은 민감한 분야의 경우 오픈AI는 프로세스 내에 반드시 인간의 검토(human review)를 포함할 것을 명시적으로 권장하고 있습니다. 레이블 카테고리는 런타임 중에 변경할 수 없으므로, 다른 정책이 필요한 팀은 모델을 직접 파인튜닝해야 합니다.

원문 보기

원문 보기 (영어)

OpenAI releases open-source model that strips personal data from text Maximilian Schreiner View the LinkedIn Profile of Maximilian Schreiner Apr 23, 2026 Nano Banana Pro prompted by THE DECODER Key Points OpenAI has published "Privacy Filter," an open-source AI model that runs locally and automatically redacts personal data from text before further processing. The model detects eight data categories including names, addresses, and passwords, handles long documents, and offers adjustable redaction sensitivity. Commercial use is permitted. Since the model doesn't guarantee legally compliant anonymization and struggles with non-English text, OpenAI recommends human review for sensitive use cases. Ask about this article… Search OpenAI has released Privacy Filter, an open-source model designed to detect and redact personal data in text. According to OpenAI, Privacy Filter is built for teams that need to clean large volumes of text before processing it further, whether for training their own AI models or sharing data with third parties. The model is relatively small at 1.5 billion parameters, uses only 50 million active parameters per request, and runs on a laptop or even directly in a browser, OpenAI says. Running it on local hardware without any cloud connection is explicitly supported. The model detects eight categories of sensitive content: names, addresses, email addresses, phone numbers, URLs, dates, account numbers, and other secrets like passwords or API keys. Unlike traditional chatbots, it doesn't generate new text. Instead, it makes a single pass through the input and labels which parts belong to which category. A 128,000-token context window lets it process long documents without splitting them up, according to OpenAI. Ad Users can adjust settings to control whether the model redacts aggressively (high recall, more false positives) or conservatively (fewer false positives, but more missed items). Teams with their own datasets can also fine-tune the model further. Ad DEC_D_Incontent-1 Privacy Filter is available under the Apache 2.0 license on GitHub and Hugging Face , and commercial use is permitted. Clear limits for sensitive use cases OpenAI is clear that Privacy Filter does not provide any legal guarantee of anonymization or compliance. The model is meant to be just one layer in a broader data protection strategy. OpenAI itself lists several weaknesses: rare or regionally uncommon names are more likely to be missed, well-known public figures or organizations sometimes get incorrectly redacted, and performance drops with non-English text or non-Latin scripts. Ad For sensitive fields like healthcare, law, finance, or human resources, OpenAI explicitly recommends keeping human review in the loop. The label categories also can't be changed at runtime, meaning that teams that need a different policy will have to fine-tune the model. Ad DEC_D_Incontent-2 AI News Without the Hype – Curated by Humans Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section. Subscribe now Source: GitHub / Privacy Filter | Hugging Face / Privacy Filter

오픈AI 개인정보보호 오픈소스 로컬AI