TechCrunch AI • 70일 전

구글 지니(Genie), 스트리트뷰로 실제 거리 시뮬레이션 가능

IMP

8/10

핵심 요약

구글 딥마인드가 AI 세계 모델인 '프로젝트 지니(Project Genie)'에 구글 스트리트뷰 데이터를 통합하여, 실제 장소를 기반으로 날씨 등 다양한 환경을 조성해 상호작용할 수 있는 3D 환경 시뮬레이션을 선보였습니다. 이 기술은 로봇 훈련 및 웨이모(Waymo) 자율주행 시뮬레이션에 활용되어 극희귀 상황 대비나 신규 지역 진출을 돕는 등 산업적 가치가 높습니다. 아직 완벽한 물리 법칙을 이해하거나 사실적인 수준에는 미치지 못하지만, AI 기반 공간 시뮬레이션의 가능성을 확장했다는 점에서 매우 중요합니다.

번역된 본문

우리 모두는 구글 지도에서 스트리트뷰를 켜서 어릴 적 살았던 집의 모습을 친구에게 보여주거나, 파리의 거리에 그 작은 노란 사람 아이콘을 올려놓고 우리가 예약한 호텔이 멋진 동네에 있는지 확인해 본 경험이 있을 것입니다. 이제 이런 경험을 더 몰입적이고 인터랙티브한 방식으로, 즉 거리와 주변 환경을 실제로 시뮬레이션하고 심지어 날씨를 조절하거나 영화 '투모로우' 같은 시나리오에서 그곳이 어떻게 보일지 확인할 수 있다고 상상해 보십시오. 이것이 바로 구글의 최신 통합 목표 중 하나입니다.

오늘부터 구글 딥마인드(Google DeepMind)는 다양하고 인터랙티브한 환경을 생성할 수 있는 회사의 범용 세계 모델(World Model)인 '프로젝트 지니(Project Genie)'에 스트리트뷰를 연동하기 시작했습니다. 이 새로운 기능은 구글 I/O 개발자 컨퍼런스 기간 중에 공개되었습니다.

딥마인드의 오픈엔디드니스(Open-endedness) 팀 소속 연구원인 잭 파커-홀더(Jack Parker-Holder)는 테크크런치(TechCrunch)와의 인터뷰에서 "이 기능은 에이전트(Agent) 및 로봇 공학 사용 사례뿐만 아니라 사람들이 즐기는 용도로도 매우 강력하며, 이는 항상 지니(Genie)의 기본 전제였다"고 말했습니다. 그는 햇빛을 보기 힘든 런던에 새로운 로봇이 배치된다면 어떨지에 대한 예를 들었습니다. 파커-홀더는 지니가 빅토리아 양식 주택에 햇빛이 비치는 귀한 순간을 시뮬레이션하여, 실제 햇빛이 비칠 때 로봇이 충격을 받지 않도록 미리 훈련할 수 있다고 설명했습니다.

그는 이어서 "동시에 '나는 뉴욕시에 갈 건데 지금은 이맘때가 아니야. 눈이 올 거야. 눈 오는 날 그 거리가 어떻게 보일지 보고 싶어'라고 말할 수도 있을 것"이라고 덧붙였습니다.

구글은 카메라가 장착된 자동차나 '트래커 백팩'을 착용한 사람들을 통해 지난 20년 동안 스트리트뷰 데이터를 수집해 왔습니다. 이 기술 거인은 7개 대륙, 110개국에 걸쳐 2800억 장 이상의 이미지를 수집했습니다.

잭은 "스트리트뷰를 통해 전 세계 상당 부분의 이미지를 보유하고 있다"며 "이 풍부한 실제 세계의 정보 및 데이터 출처와 세계를 시뮬레이션하는 능력을 결합하는 것이 잠재적으로 얼마나 강력해질 수 있는지 상상해 볼 수 있다"고 말했습니다.

구글은 지난 8월 최신 세계 모델인 '지니 3(Genie 3)'를 연구 프리뷰용으로 공개했고, 1월에는 미국의 구글 AI 울트라(Ultra) 구독자들에게 이 도구에 대한 접근 권한을 열어주어 고객이 텍스트 프롬프트나 이미지로 인터랙티브한 게임 세계를 만들 수 있도록 했습니다. 궁극적인 목표는 지니를 교육적 경험, 게임, 로봇 훈련에 사용하는 것입니다.

지니 3는 이미 웨이모(Waymo)의 시뮬레이터 중 하나를 구동하는 데 사용되어 자율주행차가 토네이도나 우연한 코끼리 조우와 같은 '극히 희귀한 상황'에서도 훈련할 수 있도록 돕고 있습니다. 여기에 스트리트뷰 데이터를 추가하면 웨이모가 전 세계 더 많은 도시에 상륙할 준비를 하는 데 큰 도움이 될 수 있습니다. 웨이모는 자체적인 시뮬레이터를 활용해 미국 11개 도시로 서비스를 확장하고 여러 다른 도시에서도 AI 운전자를 테스트해 왔습니다.

파커-홀더는 지니만의 차이점에 대해, 기존의 시뮬레이션 데이터는 모두 자동차 관점에서 출발한 반면 스트리트뷰를 활용하면 실제 장소를 기반으로 세계를 시뮬레이션할 수 있을 뿐만 아니라 관점을 사람이나 로봇 같은 다른 유형의 에이전트로 전환할 수 있다고 설명했습니다.

구글은 오늘부터 미국의 일부 울트라 사용자를 대상으로 지니에서 스트리트뷰 기능을 출시하며, 앞으로 점진적으로 대규모로 접근 권한을 확대할 예정입니다. 회사 측에 따르면 전 세계 울트라 사용자들은 앞으로 몇 주 내에 이 기능을 이용할 수 있게 됩니다.

딥마인드의 제품 관리자인 디에고 리바스(Diego Rivas)는 연구진의 목표가 이 새로운 기능을 가능한 한 많은 사람들에게 제공하는 것이라고 밝혔습니다. 하지만 그는 스트리트뷰는 물론 지니 전반이 여전히 실험 단계에 있으므로 정확도 측면에서 개선해야 할 부분이 많다고 경고했습니다. 구글팀이 제가 과거에 거주했던 동네의 수중 시뮬레이션 등을 시연해 준 샘플들을 보면 결과는 인상적이고 알아볼 수 있는 수준이었지만, 여전히 사진과 같은 실사 수준이라기보다는 비디오 게임 수준의 퀄리티였습니다.

또한 이 모델들은 아직 물리학적 인지(Physics-aware) 능력이 없습니다. 즉, 인과관계를 아직 이해하지 못한다는 뜻입니다. 예를 들어, 눈 쌓인 조슈아 트리(Joshua Tree)에서 여성이 달려가는 시뮬레이션에서 그녀는 선인장과 관목을 그대로 통과해 달렸습니다. 이를 이제 완벽한 텍스트를 생성하는 구글의 이미지 생성기 '나노 바나나(Nano Banana)'나 종이배가 떠내려가는 것을 이해하는 비디오 생성기 'Veo'와 비교해 보십시오.

원문 보기

원문 보기 (영어)

We’ve all pulled up Street View on Google Maps to show a friend what our childhood home looked like, or dropped that little person icon onto the streets of Paris to see if we booked a hotel in a cool neighborhood. Imagine being able to do that, but in a more immersive, interactive way that allows you to really simulate the street and its environs, and even do things like adjust the weather or see what it would look like in a “Day After Tomorrow” scenario. That’s one of the goals of Google’s latest integration. Starting today, Google DeepMind is connecting Street View to Project Genie, the company’s general-purpose world model that can generate diverse, interactive environments. The new feature launched during the Google I/O developer conference. “It’s really powerful for both the agent [and robotics] use case and for humans to play with, and that’s always been the thesis of Genie,” Jack Parker-Holder, a research scientist on DeepMind’s open-endedness team, told TechCrunch. He gave the example of a new robot being deployed in London, which rarely sees the sun. Genie could, Parker-Holder says, simulate those scarce occasions when the sun glints off the Victorian housing, so the rays don't shock the robot when it happens. Loading the player… “Simultaneously, you might say, ‘I’m going to New York City, but not this time of year,’” he continued. “‘It’s going to be snowy. I want to see what that block looks like in the snow.’” Google has been collecting Street View data for 20 years via cars with cameras and individuals strapped with “tracker backpacks.” The tech giant has collected north of 280 billion images across 110 countries and seven continents. “With Street View, we have imagery from a large quantity of the world,” Jack said. “You can imagine how potentially powerful it is to combine this rich source of real-world information and data with an ability to simulate worlds.” Google released its latest world model Genie 3 for research preview last August and opened up access to the tool to Google AI Ultra subscribers in the U.S. in January, allowing customers to create interactive game worlds from text prompts or images. The goal is to use Genie for educational experiences, gaming, and robotics training. Genie 3 is already helping to power one of Waymo’s simulators to train its self-driving cars on “exceedingly rare events” like tornadoes or casual elephant encounters. Adding Street View data to that could help Waymo prepare to launch in more cities around the globe. Waymo has its own simulator that it relied on to scale to 11 U.S. cities and test its AI driver in several more. The difference with Genie, says Parker-Holder, is that those are all from the car’s point of view. Street View allows for not only simulating a world anchored to a real place, but also shifting the point of view to other types of agents, like a human or a robot. Google is launching Street View in Genie to some Ultra users in the United States starting today, with access rolling out at scale over time. Global Ultra users will gain access over the next few weeks, per the company. The researchers’ goal is to put this new capability into as many hands as possible, per Diego Rivas, a product manager at DeepMind. He cautioned that Street View in particular and Genie in general is still an experiment, so there’s much to improve upon in terms of accuracy. In the samples the Google team showed me — including an underwater simulation of a neighborhood I used to live in — the results are impressive and recognizable, but still video game quality rather than photorealistic. The models are also not yet physics-aware, meaning they don’t yet understand cause and effect. For example, in a simulation of a woman running through a snowy Joshua Tree, she ran right through cacti and bushes. Compare that to, say, Google’s image generator Nano Banana — which can now generate perfect text in infographics — or its video generator Veo — which understands that paper boats drift on water currents, smoke disperses into the air, and fabric drapes over forms. Physics isn’t hard-coded into these models; they learn it intuitively over time through passive observation, as a living being would. “I think for this kind of model, it’s maybe six to 12 months behind video in terms of the accuracy and quality, so I think it’s something we will solve,” Parker-Holder said. Jonathan Herbert, director of Google Maps who started on the Street View team as an intern 12 years ago, said that Genie can’t yet create a faithful reconstruction of a street. He thinks the real breakthrough is the AI’s spatial continuity. If you turn 360 degrees, the AI correctly remembers and simulates the environment behind you. From that point on, the model can build a new environment on top of that. “We have long thought about how we can build out the best and richest model of the world on top of Street View data," Herbert said. “It’s definitely been an idea of ours to use Maps Data in new ways and for new kinds of AI research for a pretty long time.” Topics AI , genie 3 , Google , Google I/O , google io 2026 , Google Maps , Robotics , Street View When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence. Rebecca Bellan Senior Reporter Rebecca Bellan is a senior reporter at TechCrunch where she covers the business, policy, and emerging trends shaping artificial intelligence. Her work has also appeared in Forbes, Bloomberg, The Atlantic, The Daily Beast, and other publications. You can contact or verify outreach from Rebecca by emailing rebecca.bellan@techcrunch.com or via encrypted message at rebeccabellan.491 on Signal. View Bio May 27 Athens, Greece StrictlyVC Athens is up next. Hear unfiltered insights straight from Europe’s tech leaders and connect with the people shaping what’s ahead. Lock in your spot before it’s gone. REGISTER NOW Most Popular Elon Musk has lost his lawsuit against Sam Altman and OpenAI Tim Fernholz Users turn to jailbreaking their older Kindles as Amazon ends support Lauren Forristal OpenAI launches ChatGPT for personal finance, will let you connect bank accounts Ivan Mehta US orders travelers on Air Force One to throw away gifts, pins, and burner phones after China trip Lorenzo Franceschi-Bicchierai OpenAI is reportedly preparing legal action against Apple; it wouldn't be the first partner to feel burned Connie Loizos How to turn off Instagram's new Instants feature and retract photos you accidentally shared Aisha Malik Musk’s xAI is running nearly 50 gas turbines unchecked at its Mississippi data center Tim De Chant

구글 딥마인스 프로젝트 지니 월드 모델 스트리트뷰 자율주행