Hacker News • 68일 전

AI 변호사의 구조적 한계

IMP

8/10

핵심 요약

법률 분야는 언어와 문서 중심의 작업 방식 덕분에 AI가 혁신을 일으키기 가장 적합한 분야로 여겨졌으나, 실제로는 단순한 도입에 그치고 업무 프로세스의 근본적인 변화는 일어나지 않고 있습니다. 이는 법률 데이터가 소수의 거대 플랫폼(Westlaw, Lexis 등)에 의해 독점된 '데이터 해자(Data Moat)'와 같은 고질적인 구조적 장벽 때문입니다. 이러한 장벽을 이해하는 것은 법률이 곧 사회 기반 시설이며, AI가 법조계에 스며들지 못하면 사회 전반의 영향력도 제약될 수밖에 없기 때문에 매우 중요합니다.

번역된 본문

AI 변호사를 가로막는 구조적 장벽: AI가 아직 법률을 혁신하지 못한 이유

법률은 AI가 접근하기 가장 쉬운 분야일 것으로 여겨졌습니다. 이 직업은 문서를 기반으로 돌아갑니다. 계약서, 소장, 동의서, 증거개시 요청, 규제 기관 제출 서류 등이 그것입니다. 청구 가능한 모든 시간(billable hour)은 흔적을 남깁니다. AI가 생물학적 시스템의 복잡성과 씨름해야 하는 의학이나, 마이크로초 단위의 차익거래 이점이 즉시 사라지는 금융과 달리, 법률 업무는 인간의 시간 척도와 인간의 언어로 작동합니다. 1982년의 계약 분쟁은 2024년의 것과 매우 흡사하게 읽힙니다. 제안서는 저절로 써집니다. 몇 초 만에 문서를 작성하고, 몇 분 만에 증거개시 자료를 검토하며, 새벽 2시에 졸린 초급 변호사(associates)가 놓친 오류를 잡아내는 AI 시스템 말입니다.

기술은 이미 존재합니다. Westlaw의 'Deep Research'는 10분 이내에 포괄적인 법률 조사를 약속합니다. Clio의 'Vincent AI'는 논문(Treatise)으로부터 맞춤형 기사를 만들어 제공합니다. 엘리트 로펌의 업무 결과물로 학습된 Harvey.AI는 대형 로펌(BigLaw)에 에이전트 형태의 변호사 보조 도구를 제공합니다.

그럼에도 불구하고. 최근 조사에 따르면 법률 분야의 AI 도입 비율이 인상적이어서, 최대 79%의 변호사가 자신의 로펌에서 AI를 사용한다고 답했습니다. 하지만 이 수치는 실제 '통합(integration)'이 아닌 단순 '노출(exposure)'을 측정한 것입니다. Copilot을 활성화해 두거나 Relativity와 같은 기존 도구에 내장된 AI 기능을 사용하는 것은 실제 업무 흐름은 그대로임에도 불구하고 설문 조사에서 '도입'으로 간주됩니다. 전국 각지의 컨퍼런스와 평생법학교육(CLE) 행사에서 제가 만나는 변호사들은 다른 이야기를 들려줍니다. 대부분의 로펌이 AI를 시험해 보았지만, 실제 업무 방식을 변화시킨 곳은 거의 없습니다. 2026년 기준 평범한 미국 변호사는 여전히 데스크톱 컴퓨터에서 작업하고, Westlaw나 Lexis에 비용을 지불하며, 10년 전 클라우드 컴퓨팅을 대했던 것과 같은 경계심으로 AI를 대합니다.

구조적 장벽은 다른 산업이 직면하지 않는 방식으로 법률 실무의 기술적 확산을 방해합니다. 이러한 장벽을 이해하는 것은 법률이 AI와 시민 기반 시설(civic infrastructure)이 만나는 지점이기 때문에 중요합니다. 법원, 계약, 규제, 권리는 변호사를 거쳐갑니다. AI가 법률 분야에 확산되지 못하면 그 광범위한 사회적 영향력은 계속 제한될 것입니다.

Diffuse AI를 읽어주셔서 감사합니다! 향후 게시물을 받아보려면 구독하세요.

데이터 해자 (Data Moat) 법률 AI는 대부분의 산업이 직면하지 않는 '데이터 문제'를 가지고 있으며, 이는 두 가지 층위로 나뉩니다.

첫 번째 층위는 원시 법률 데이터입니다. 법률 조사에 유용한 AI를 구축하려면 판례, 법령, 규정 및 2차 자료에 대한 포괄적인 데이터베이스가 필요합니다. 미국에서 거의 완벽한 커버리지를 갖춘 곳은 세 곳뿐입니다. Westlaw(Thomson Reuters), Lexis(RELX), 그리고 2025년 11월 Clio이 10억 달러에 인수한 vLex/Fastcase입니다. 이 인수로 세 번째로 의미 있는 법률 조사 데이터베이스가 중소형 로펌의 실무 관리에 집중하는 회사 산하로 들어갔으며, Clio의 50억 달러 가치와 5억 달러 규모의 시리즈 G 투자 라운드는 투자자들이 미국 내 단 3개뿐인 완전한 법률 데이터 세트 중 하나를 소유하는 전략적 가치를 보고 있음을 시사합니다. 이 세 곳 중 한 곳에서 라이선스를 취득하지 않는 한, 다른 모든 곳은 불완전한 데이터로 작업해야 합니다.

두 번째 층위는 그 데이터베이스에 비용을 지불할 만한 가치를 부여하는 요소입니다. Westlaw과 Lexis는 원시 판결문(대부분 공개되어 있음)을 파는 것이 아닙니다. 그들은 그 위에 구축된 편집 인프라를 팝니다. 수백만 건의 판결문을 검색 가능한 범주로 분류하는 판결요지(Headnote) 분류 체계, 수십 년간 전문가가 작성한 실무 지침서, 그리고 1차 법률을 실용적인 지침으로 종합한 논문(Treatises)이 그것입니다. 'Miller and Starr'에 접근할 수 없는 캘리포니아 부동산 변호사는 심각한 불리함을 겪을 것입니다. 기반이 되는 판례가 숨겨져 있어서가 아니라, 전문가가 엄선한 로드맵 없이 그것을 탐색하는 데는 기하급수적으로 더 많은 시간이 걸리기 때문입니다. 무언가를 배우기 위해 백과사전을 받아드는 것과, 해당 절차를 천 번 겪은 실무자 패널이 만든 훌륭하게 엄선된 20페이지 분량의 가이드를 받아드는 것을 상상해 보십시오. 바로 이것이 차이점입니다. 실체적 지식(Substantive knowledge)과 절차적 지식(Procedural knowledge)의 결합 말입니다.

원문 보기

원문 보기 (영어)

The Structural Barriers to AI Lawyers Why AI Hasn’t Transformed Law (Yet) Sean A. Harrington May 12, 2026 71 11 15 Share Law was supposed to be easy for AI. The profession runs on documents. Contracts, briefs, motions, discovery requests, regulatory filings. Every billable hour leaves a paper trail. And unlike medicine, where AI must contend with the complexity of biological systems, or finance, where microsecond arbitrage advantages disappear instantly, legal work operates on human timescales with human language. A contract dispute from 1982 reads much like one from 2024. The pitch writes itself: AI systems that draft documents in seconds, review discovery in minutes, and catch errors that bleary-eyed associates miss at 2 AM. The technology exists. Westlaw’s Deep Research promises comprehensive legal research in under ten minutes. Clio’s Vincent AI will hand-craft you a personalized article from a treatise. Harvey.AI, trained on elite law firm work product, offers an agentic attorney assistant to BigLaw. And yet. Recent surveys report impressive AI adoption numbers in law, with up to 79% of attorneys claiming to use artificial intelligence at their firms . But these figures measure exposure, not integration. Having Copilot enabled or using the AI features baked into existing tools like Relativity counts as “adoption” in survey responses, even when actual workflows remain unchanged. The attorneys I speak with at conferences and Continuing Legal Education events across the country tell a different story: most firms have experimented with AI, few have transformed how they practice. The modal American lawyer in 2026 still works on a desktop computer, still pays for Westlaw or Lexis, and still approaches AI with the same wariness they brought to the cloud a decade ago. Structural barriers make legal practice resistant to technological diffusion in ways that other industries don’t face. Understanding these barriers matters because law is where AI meets civic infrastructure. Courts, contracts, regulations, and rights flow through lawyers. If AI can’t diffuse through law, its broader social impact will remain constrained. Thanks for reading Diffuse AI! Subscribe to get future pieces in your inbox. Subscribe The Data Moat Legal AI has a data problem that most industries don’t face, and it has two layers. The first layer is raw legal data. To build useful AI for legal research, you need comprehensive databases of case law, statutes, regulations, and secondary materials. Only three entities in the United States have anything approaching complete coverage: Westlaw (Thomson Reuters), Lexis (RELX), and vLex/Fastcase, which Clio acquired in a $1 billion deal in November 2025 . That deal pulled the third meaningful legal research database under a company focused on small and mid-size firm practice management, and Clio’s $5 billion valuation and $500 million Series G round suggest investors see the strategic value of owning one of only three complete legal datasets in the country. Everyone else either licenses from one of these three or works with incomplete data. The second layer is what makes those databases worth paying for. Westlaw and Lexis don’t sell raw judicial opinions (much of that is publicly available). They sell the editorial infrastructure built on top: headnote taxonomies that organize millions of opinions into searchable categories, practice guides written by specialists over decades, and treatises that synthesize primary law into usable guidance. A California real estate attorney without access to Miller and Starr would be at a serious disadvantage, not because the underlying case law is hidden, but because navigating it without an expert-curated roadmap takes exponentially longer. Imagine being handed an encyclopedia to learn something vs. having a beautifully curated twenty-page guide from a panel of practitioners who have been through the procedure a thousand times. That’s the difference: substantive knowledge plus procedural shorthand, built up over years of practice in a single area of law. The litigation around database access shows how fiercely incumbents defend this moat. Thomson Reuters sued Ross Intelligence not over case law itself, but over Westlaw’s headnote taxonomy, the editorial layer that organizes and summarizes judicial opinions. In February 2025, the court sided with Thomson Reuters, rejecting Ross’s fair use defense. The message: even if the underlying legal materials are free, the value-added structure built on top of them is proprietary and protected. Open-source alternatives like SALI have emerged in response, offering a vendor-neutral taxonomy that AI developers can use without licensing risk. Cracks in the Moat The data moat is real, but it may be more porous than it appears. The Free Law Project’s CourtListener provides free access to millions of federal and state court opinions, oral arguments, and PACER documents. State-level open data initiatives, like Oklahoma’s , have made primary legal materials freely accessible. Harvard’s Caselaw Access Project digitized every official state and federal case through 2020. All U.S. state bar associations now provide members with free access to either vLex Fastcase or Decisis, which, for a solo practitioner handling state court matters, might be enough. The editorial layer that was essential for human researchers may matter less for AI systems. vLex’s Vincent AI demonstrates a different approach: using AI to generate the synthesis layer rather than paying human experts to write it. Damien Riehl (Clio’s Tech Evangelist, perhaps best known for his viral TED Talk on music and copyright ) calls this a “Me-Tise,” a personalized knowledge base rather than the traditional legal treatise. If AI can create practice guide-quality analysis from primary sources, the competitive advantage of having the best human-written treatises diminishes. The moat doesn’t disappear, but it gets shallower. And there’s a whole category of legal technology that has no data moat at all. Legal research incumbents sit behind proprietary datasets. But a huge swath of legal tech (eDiscovery platforms, case management, billing, client intake, compliance, marketing, document automation) consists of traditional SaaS offerings where the value proposition is software engineering and workflow, not proprietary data. When Anthropic launched legal skills as open-source plugins for its Claude Cowork platform on February 2, 2026, the market reaction was immediate and brutal: Thomson Reuters dropped nearly 16% in a single day (its worst on record), LegalZoom fell almost 20%, RELX lost 14%, and Wolters Kluwer shed 13%, roughly $285 billion overnight. The damage was concentrated in the SaaS-heavy segments. That’s why the vendor halls at Legalweek and Techshow are packed with AI startups attempting new, innovative ways to integrate AI into traditional workflows. These segments face a threat that most legal tech vendors didn’t anticipate: frontier AI labs are no longer content to serve as infrastructure underneath vertical software. They’re increasingly building application-specific capabilities to directly serve users. Anthropic’s Claude Cowork, OpenAI’s Codex, and Perplexity’s computer-use agents automate entire desktop workflows, not just individual legal tasks. They draft documents, manage calendars, send emails, organize files, and handle billing without any legal-specific software in the stack. When the AI operates at the OS layer, the SaaS application sitting on top of it starts to look redundant. Could frontier AI labs eventually purchase one of the Big Three legal datasets? They have the capital. The reason it hasn’t happened yet is simpler than the moat theory suggests: Thomson Reuters has a market cap of around $75 billion; RELX sits around $85 billion, which sounds enormous until you compare it to the markets these companies are already chasing in healthcare, finance, enterprise software, and consumer products. Legal d

법률 AI 데이터 독점 AI 도입 산업 구조 기술 확산