Hacker News • 67일 전

사망한 조종사 음성 AI 복원에 여객기 사고 DB 접근 중단

IMP

8/10

핵심 요약

인터넷 커뮤니티 사용자들이 스펙트로그램(PDF)과 AI·Griffin‑Lim 알고리즘으로 UPS 2976 사고의 조종석 음성을 복원해 소셜 미디어에 유포한 사건이 발생했습니다. 이에 미국 국가교통안전위원회(NTSB)는 연방법(조종석 음성 공개 금지) 위반 소지와 사생활 침해 우려로 즉시 공개 자료 접근을 전면 일시 중단했습니다. AI·신호처리 기술이 기존의 익명화·보안 조치를 무력화할 수 있음을 보여주는 중요한 사례입니다.

번역된 본문

인터넷 탐정들이 소프트웨어와 AI 도구를 사용해 치명적인 화물기 추락 사고 마지막 순간의 조종사 음성을 복원했습니다. 복원된 음성 녹음이 퍼지자 미국 정부 기관은 연방법이 조종실 음성 기록장치(CVR) 오디오의 공개를 금지하고 있어 민간 교통사고 데이터베이스에 대한 모든 공개 접근을 일시 중단했습니다.

미국 국가교통안전위원회(NTSB)는 항공기 추락 및 기타 민간 교통 사고 조사에서 수집된 사실 보고서와 증거를 공유합니다. 그러나 NTSB는 5월 21일, 해당 정보가 포함된 온라인 기록 시스템(docket)이 조사의 일환으로 공개된 자료를 통해 조종석 음성 기록을 재구성할 수 있게 된 것을 검토하면서 “일시적으로 이용할 수 없다”고 발표했습니다.

NTSB 성명에 따르면, “NTSB는 이미지 인식과 컴퓨팅 기법의 발전으로 인해 NTSB 조사의 일환으로 공개된 소리 스펙트럼 이미지에서 조종실 음성 기록 오디오의 근사치를 재구성할 수 있게 된 것을 인지하고 있다. 여기에는 켄터키주 루이빌에서 작년에 발생한 UPS 2976편 추락 사고에 대한 진행 중인 조사도 포함된다. NTSB는 조종실 음성 녹음을 공개하지 않는다.”

UPS 2976편은 2025년 11월 4일 켄터키주 루이빌에서 이륙 직후 구조적 결함으로 엔진이 물리적으로 분리되어 추락한 유나이티드 택배 서비스(United Parcel Service) MD-11F 화물기였습니다. 탑승했던 3명의 조종사(교대 조종사 포함)가 사망했습니다. 지상의 다른 12명이 사망했으며 23명이 부상했습니다.

미국 의회는 항공 승무원의 프라이버시를 보호하기 위해 NTSB가 조종실 음성 또는 영상 기록장치의 어떠한 부분도 공개하는 것을 금지하는 연방법을 1990년에 제정했습니다. 이 법은 1988년 8월 댈러스-포트워스 국제공항에서 발생한 델타항공 1141편 추락과 관련된 조종석 대화가 방송국을 통해 방영된 것에 대해 항공기 조종사들이 반발한 이후에 만들어졌습니다.

이전에 NTSB에서 일하고 유나이티드 항공에서 보잉 737을 조종했던 사고 조사관이자 분석가인 벤 버먼(Ben Berman)은 다음과 같이 말했습니다. “수십 년 동안 항공기 조종사들이 일상적인 작업 공간에서 하루 종일 녹음되는 것을 기꺼이 수용하게 만든 중요한 요인은, 업무 중 사망할 수 있다는 위협 속에서도 프라이버시가 보호된다는 점이었습니다. 사람들은 자신의 마지막 순간이 공개되어 사고 조사 이외의 목적으로 사용되는 것을 끔찍하게 생각하며, 이것이 연방법의 근거입니다.”

결과적으로 NTSB는 민간 항공 사고 조사 기간 동안 조종실 음성 기록장치를 보호하기 위해 여러 가지 예방 조치를 취합니다. 전 NTSB 의장 로버트 섬월트(Robert Sumwalt)는 연방 기관이 접근 권한을 극소수의 사람으로 제한하며, 이들은 먼저 로그와 비공개 계약에 서명해야 하고, 휴대전화를 밖에 두고 들어가며, 필기한 메모는 이후 파기된다고 설명했습니다. 오디오 전사는 지속적인 재생과 그룹 토론을 통해 수동으로 이루어집니다.

인터넷에서의 반응

NTSB는 5월 19일과 20일에 걸쳐 진행된 이틀간의 조사 청문회에서 UPS 2976편 추락 사고의 조종석 음성 기록에 대한 서면 전사본을 공개했습니다. 그러나 해당 기관은 소리 신호를 시각적으로 나타낸 스펙트로그램(spectrogram)이 포함된 PDF도 공개했는데, 이는 조종석 음성 기록의 마지막 30초를 보여주었습니다. 이 스펙트로그램 덕분에 여러 사람이 조종실 음성 녹음에서 조종사의 목소리와 다른 소리를 오디오 버전으로 복원할 수 있었으며, 이러한 사례가 X(옛 트위터) 및 레딧(Reddit)과 같은 소셜 미디어 사이트에 등장했습니다. 스펙트로그램 자체도 소셜 미디어에 게시되어 유포되었습니다. 이러한 오디오 재구성은 종종 원래 출판된 그리핀-림(Griffin-Lim) 알고리즘에 의존합니다.

원문 보기

원문 보기 (영어)

Text settings Story text Size Small Standard Large Width * Standard Wide Links Standard Orange * Subscribers only Learn more Minimize to nav Pilots’ voices from the last seconds of a fatal cargo plane crash have been re-created by Internet sleuths using software and AI tools. The spread of reconstructed audio recordings has prompted a US government agency to suspend all public access to its database of civil transportation accidents—because federal law prohibits investigators from publicly releasing audio from cockpit voice recorders. The US National Transportation Safety Board (NTSB) usually shares factual reports and evidence gathered from investigations of aircraft crashes and other civil transportation incidents. But on May 21, the NTSB announced that the online docket system containing such information was “temporarily unavailable” as it reviewed the publicly available materials that had enabled people to re-create cockpit audio recordings from aircraft disasters. “The NTSB is aware that advances in image recognition and computational methods have enabled individuals to reconstruct approximations of cockpit voice recorder audio from sound spectrum imagery released as part of NTSB investigations, including the ongoing investigation of the crash last year of UPS flight 2976 in Louisville, Kentucky,” according to an NTSB statement . “The NTSB does not release cockpit audio recordings.” UPS flight 2976 was a United Parcel Service MD-11F cargo aircraft that crashed shortly after takeoff from Louisville, Kentucky, on November 4, 2025, following a structural failure that led to an engine physically detaching as the aircraft left the ground. The three pilots aboard the aircraft, including a relief pilot, were killed. Another 12 people on the ground were killed, with 23 people being injured. The US Congress enacted a federal law in 1990 prohibiting the NTSB from publicly sharing any part of a cockpit voice or video recorder to protect the privacy of air crews. That law followed airline pilots’ pushback over the controversial TV station airing of a cockpit conversation relating to the August 1988 crash of Delta Air Lines Flight 1141 at Dallas-Fort Worth International Airport. “It’s been an important factor for decades in having airline pilots be willing to have their voices recorded at their normal workplace, day in and day out, with the threat of being killed during their workday,” Ben Berman , an accident investigator and analyst who previously worked for the NTSB and flew a Boeing 737 for United Airlines, told Ars. “People are horrified with the idea of their last moments being made public and used for anything other than accident investigation, which is why the federal law supports that.” As a result, the NTSB takes multiple precautions for securing cockpit voice recorders during investigations of civil aviation incidents. Robert Sumwalt , former chairman of NTSB, has described how the federal agency restricts listening access to a handful of people who must first sign a log and nondisclosure agreement, with cellphones being left outside and handwritten notes being destroyed afterward. Transcription of the audio is done manually through constant replays and group discussions. The Internet does its thing The NTSB released written transcripts of the cockpit audio recordings from the crash of UPS flight 2976 during a two-day investigative hearing held on May 19 and May 20. But the agency also publicly shared a PDF with a spectrogram—a visual representation of sound signals—showing the last 30 seconds of cockpit audio recording. That spectrogram apparently enabled a number of individuals to reconstruct audio versions of the pilots’ voices and other sounds from the cockpit voice recording, with examples appearing on social media sites such as X and Reddit. The spectrogram itself has also been posted and distributed across social media. Such audio re-creations frequently rely on the Griffin-Lim algorithm that was originally published in a 1984 paper by Daniel Griffin and Jae Lim. Updated versions of the method have since been incorporated into speech processing algorithms and implemented through programming languages such as Python. Various Python implementations of the algorithm are available on GitHub . More recently, the widespread availability of AI models capable of retrieving the necessary information and writing code has made it easier for people to re-create cockpit audio recordings. One account on X mentioned taking just 10 minutes with OpenAI’s Codex model to “reconstruct rough audio from the spectrogram” that was initially shared by NTSB. “I was shocked to hear about this, because I hadn’t imagined that it was possible to do something like this,” Berman told Ars. “But all kinds of things are possible now.” Some cockpit audio recordings have been released after NTSB investigations concluded because of lawsuits related to aviation incidents, Berman said. He pointed out that cockpit voice recorder transcripts have also been dramatized through Broadway plays and TV program reenactments. But those examples don’t bother Berman as much as the AI-assisted re-creations of pilots’ voices based on spectrogram waveforms. It may be too late to stop the spread of re-created audio in the case of UPS flight 2976. But the NTSB’s decision to shut down public access to its entire investigations database while it reviews the materials within suggests that the agency is trying to prevent future incidents. The NTSB declined to provide additional comment when contacted by Ars, but said it would share any updates on its website or through its X account . “The NTSB docket system is temporarily unavailable as we examine the scope of the issue and evaluate solutions,” according to the NTSB statement. “We hope to restore access to the docket system as soon as possible.” Jeremy Hsu Tech Reporter Jeremy Hsu Tech Reporter Jeremy Hsu is a reporter exploring a wide range of topics across deep tech and AI. He has previously written for New Scientist, Scientific American, IEEE Spectrum, Wired, Undark Magazine and MIT Tech Review, among many other publications, about topics such as deepfakes, data centers, drones, battery tech, robotics, and GPS jamming. He also has a Master of Arts in Journalism from NYU, and a bachelor's degree from University of Pennsylvania in History and Sociology of Science, with a minor in English. 36 Comments

음성 복원 프라이버시 스펙트로그램 교통안전 AI 규제