The Decoder • 70일 전

구글 I/O: 신규 모델, 24시작동 클라우드 AI 에이전트

IMP

9/10

핵심 요약

구글이 I/O 개발자 컨퍼런스에서 새로운 AI 모델인 제미나이 3.5 플래시와 통합 멀티모달 모델 Omni, 그리고 클라우드에서 24시간 돌아가는 개인형 에이전트 스파크(Spark)를 공개했습니다. 특히 3.5 플래시는 기존 모델을 성능으로 압도하면서도 비용을 절반 수준으로 낮춰 기업의 AI 도입 가성비를 획기적으로 개선할 수 있어 주목됩니다. 또한 개발자를 위한 멀티 에이전트 관리 플랫폼 업데이트와 함께 제미나이 앱의 대대적인 UI 개편도 진행되었습니다.

번역된 본문

구글의 I/O 발표: 새로운 모델, 쉬지 않는 클라우드 에이전트, 그리고 리디자인된 제미나이(Gemini) 앱

작성자: 막시밀리안 슈라이너(Maximilian Schreiner) | 2026년 5월 19일

핵심 요약 구글은 더 빠르고 비용 효율적인 AI 모델인 제미나이 3.5 플래시(Flash)와 비디오, 이미지, 텍스트 생성을 위한 새로운 멀티모달 시스템인 제미나이 옴니(Omni)를 소개했습니다. 또한 클라우드에서 지속적으로 실행되는 개인형 AI 에이전트인 제미나이 스파크(Spark)를 출시했으며, 제미나이 앱의 대대적인 시각적 개편을 단행했습니다. 개발자를 위해 구글은 여러 자율 에이전트를 관리할 수 있는 안티그라비티(Antigravity) 플랫폼을 업데이트했고, AI 검색 기능과 SynthID 워터마크 도구 역시 확대했습니다.

구글은 I/O 개발자 컨퍼런스를 활용해 수많은 새로운 AI 제품을 공개했습니다. 주요 발표 내용은 새로운 모델인 제미나이 3.5 플래시, 멀티모달 모델인 제미나이 옴니, 그리고 클라우드에서 24시간 내내 작동하는 개인형 에이전트인 제미나이 스파크입니다. 또한 제미나이 앱 역시 대대적인 리프레시를 진행했습니다.

구글의 순다르 피차이(Sundar Pichai) CEO에 따르면, 제미나이 3.5 플래시는 약 4개월 전에 출시된 제미나이 3.1 프로를 거의 모든 벤치마크에서 능가합니다. 구글은 특히 경제적으로 가치 있는 작업을 측정하는 GDP Val 벤치마크에서 그 성능 향상 폭이 매우 뚜렷하다고 밝혔습니다.

인공지능 분석 기관 Artificial Analysis의 분석에 따르면, 플래시는 지능 대 속도 그래프의 우측 상단 영역에 위치한 유일한 모델이었으며, 피차이 CEO는 기존 최첨단 모델들보다 4배 빠르다고 설명했습니다. 구글의 자체 에이전트 플랫폼인 안티그라비티에서 최적화된 버전으로 구동할 경우 속도는 무려 12배나 빠르다고 회사측은 주장합니다.

비용 측면에서 구글은 동급 모델 대비 약 1/3에서 절반 수준에 불과하다고 전했습니다. 피차이 CEO는 간단한 계산을 시연했는데, 기업들이 자사 워크로드의 80%를 3.5 플래시와 프로 모델 조합으로 전환할 경우 연간 10억 달러 이상을 절약할 수 있을 것이라고 밝혔습니다. 참고로 제미나이 3.5 프로는 다음 달에 출시될 예정입니다.

구글의 최고 AI 설계자인 코라이 카부크쿨루(Koray Kavukcuoglu)는 3.5 시리즈가 에이전트 작업(Agentic work)을 위해 설계되었다고 강조했습니다. 이 모델은 몇 시간 동안 자율 세션을 유지하고 복잡한 코딩 파이프라인을 독자적으로 실행할 수 있다고 덧붙였습니다. 내부 테스트에서는 이 모델이 제로부터 작동하는 운영 체제를 스스로 구축하기도 했습니다.

구글, 제미나이 옴니(Omni)를 비디오 생성 모델 'Veo'의 멀티모달 후속작으로 포지셔닝

제미나이 옴니(Omni)는 모든 입력을 받아 모든 형태의 출력을 생성하도록 설계된 구글의 새로운 모델입니다. 비디오 생성 기능이 먼저 출시되며, 이후 이미지와 텍스트 생성 기능이 추가될 예정입니다. 순수 텍스트-비디오 변환 모델이었던 기존 Veo와 달리, 옴니는 제미나이 아키텍처를 기반으로 처음부터 멀티모달로 학습되었다고 카부크쿨루는 설명했습니다.

이 모델은 자신이 생성한 출력을 다시 입력으로 사용하는 피드백 루프가 가능하여 반복적인 편집을 수행할 수 있습니다. 사용자는 자신의 비디오를 업로드하고 캐릭터를 교체하거나 스타일을 변경할 수 있습니다. 옴니가 Veo를 대체하는 것이냐는 질문에 카부크쿨루는 옴니가 Veo의 확장된 일반화(Generalization) 버전이라고 답했습니다. 그는 이제 진정한 멀티모달을 향해 일관되게 나아가고 있다고 덧붙였습니다.

빠르고 가벼운 첫 번째 변형인 '옴니 플래시'는 화요일부터 제미나이 앱, Flow, 유튜브 쇼츠(YouTube Shorts)를 통해 구글 AI+, 프로(Pro), 울트라(Ultra) 구독자에게 제공됩니다. 구글에 따르면 향후 API 버전과 더 강력한 성능의 옴니 프로(Omni Pro)도 출시될 예정입니다. 모든 생성 콘텐츠에는 구글의 보이지 않는 워터마크인 SynthID가 삽입됩니다.

구글 "제미나이 스파크(Spark)는 노트북을 닫아도 계속 작동합니다"

제미나이 스파크(Spark)는 일반 사용자를 위한 구글의 개인형 에이전트입니다. 제미나이 앱과 AI 스튜디오 팀을 이끄는 조시 우드워드(Josh Woodward)에 따르면, 이 에이전트는 구글 클라우드의 전용 가상 머신에서 실행되며 24시간 내내 대기 상태를 유지하고, 사용자의 기기가 꺼져 있어도 백그라운드에서 계속 작업을 수행합니다.

스파크는 코딩 작업을 위해 안티그라비티 파이프라인이 통합된 제미나이 3.5를 사용합니다. 지메일(Gmail), 문서(Docs) 등 기타 워크스페이스(Workspace) 제품과의 연동 기능도 이번 출시에 포함됩니다. 회사에 따르면 향후 몇 주 내로 서드파티 서비스와 연결할 수 있는 MCP(Model Context Protocol) 연결 기능이 추가될 예정입니다.

이후 크롬(Chrome) 브라우저와의 연동이 뒤를 이을 것이며, 안드로이드에서는 화면 상단에 에이전트가 현재 수행 중인 작업을 보여주는 '안드로이드 헤일로(Android Halo)'라는 새로운 UI 요소가 도입됩니다. 안드로이드 헤일로는 올해 하반기 출시 예정입니다.

원문 보기

원문 보기 (영어)

Google's I/O announcements: new models, a cloud agent that never sleeps, and a redesigned Gemini app Maximilian Schreiner View the LinkedIn Profile of Maximilian Schreiner May 19, 2026 Google Key Points Google has introduced Gemini 3.5 Flash, a faster and more cost-effective AI model, alongside Gemini Omni, a new multimodal system for video, image, and text generation. The company also launched Gemini Spark, a personal AI agent that runs continuously in the cloud, and rolled out a major visual redesign for the Gemini app. For developers, Google updated its Antigravity platform to manage multiple autonomous agents, while also expanding its AI search features and the SynthID watermarking tool. Ask about this article… Search Google used its I/O developer conference to unveil a wave of new AI products. The highlights: a new model called Gemini 3.5 Flash, a multimodal model called Gemini Omni, and a personal agent named Gemini Spark that runs around the clock in the cloud. The Gemini app also gets a major refresh. According to Google CEO Sundar Pichai, Gemini 3.5 Flash beats the roughly four-month-old Gemini 3.1 Pro on nearly every benchmark. The jump is especially sharp on GDP Val, a benchmark for economically relevant tasks, Google says. In an analysis by Artificial Analysis, Flash was the only model in the upper-right quadrant of intelligence versus speed—four times faster than other frontier models, according to Pichai. An optimized version running on Google's in-house agent platform Antigravity is even twelve times faster, the company claims. Costs come in at roughly a third to half of comparable models, according to Google. Pichai did the math: companies that shift 80 percent of their workloads to a mix of 3.5 Flash and Pro could save over a billion dollars a year. Gemini 3.5 Pro is expected next month. Ad Chief AI Architect Koray Kavukcuoglu stressed that the 3.5 series was built for agentic work. The model can sustain autonomous sessions for several hours and run complex coding pipelines on its own, he said. Internally, Google had it build a working operating system from scratch as a test. Ad DEC_D_Incontent-1 Google positions Gemini Omni as Veo's multimodal successor Gemini Omni is Google's new model designed to produce any output from any input. Video ships first, with image and text coming later. Unlike Veo, which is a pure text-to-video model, Omni is built on the Gemini architecture and trained as multimodal from the ground up, according to Kavukcuoglu. The model can take its own generated output and feed it back in as input, allowing iterative editing, Google says. Users can upload their own videos, swap out characters, or change the style. Asked whether Omni replaces Veo, Kavukcuoglu said Omni is a generalization of Veo. The path now leads consistently toward true multimodality, he added. Ad The first, faster variant named Omni Flash launches Tuesday for Google AI+, Pro, and Ultra subscribers in the Gemini app, in Flow, and in YouTube Shorts. An API version is coming, along with a more powerful Omni Pro, according to Google. All generated content carries Google's invisible SynthID watermark. Google says Gemini Spark keeps working even when your laptop is closed Gemini Spark is Google's personal agent for end users. According to Josh Woodward, who leads the Gemini app and AI Studio teams, it runs on dedicated virtual machines in Google Cloud, stays available 24/7, and keeps working in the background even when the user's device is off. Spark uses Gemini 3.5 with the full Antigravity pipeline for coding tasks, Google says. Ad DEC_D_Incontent-2 Integration with Gmail, Docs, and other Workspace products is part of the launch. MCP connections to third-party services are coming in the next few weeks, according to the company. Chrome integration follows later, and on Android, a new UI surface called Android Halo will show what the agent is doing right at the top of the screen. Android Halo is expected later this year and will work not just with Spark but with other supported agents too, Google says. Devices running Gemini Nano will get extra features. Ad Spark rolls out this week to select testers and next week as a beta for Ultra subscribers in the US. Google is introducing a new Ultra plan at $100 per month and dropping the previous top-tier Ultra plan from $250 to $200. Alongside Spark, a simpler agent called Daily Brief is coming to the app. It analyzes and prioritizes emails, calendar entries, and tasks overnight, according to Google. The feature builds on the Labs experiment "CC" from December. Google says the Gemini app is getting its biggest visual overhaul yet The Gemini app itself is getting what's probably the most visible update for end users. Google redesigned the app "from the ground up," according to Woodward. The new design language is called "Neural Expressive" and leans on fluid animations, new typography, and haptic feedback. Answers no longer show up as walls of text, Google says. Key information appears bold and at the top; scrolling reveals embedded images, timelines, or visualizations. Gemini Live now opens directly inline with no more switching between modes. The rollout starts today globally on Android, iOS, and the web, according to Google. Gemini 3.5 Flash and Gemini Omni ship with the update. Through Omni, users can upload their own photos and videos from their gallery, apply pre-built templates, or drop themselves into generated scenes via an AI avatar, Google says. The app is now available in over 230 countries and more than 70 languages, according to Pichai. Monthly users grew from 400 million a year ago to over 900 million, and daily queries increased sevenfold, he said. Google is also releasing a Gemini desktop app for macOS, available for download now. This summer, Spark will move into the desktop app, where it can access local files and automate workflows on the machine, according to the company. New voice features are planned for the summer too, turning free-form dictation directly into polished text drafts. Google says Antigravity 2.0 gives developers a full agent orchestration platform The developer platform Antigravity is getting a major update. Antigravity 2.0 is a standalone desktop app where users can orchestrate multiple autonomous agents in parallel, according to Google. It also comes with a CLI for terminal users and an SDK that provides the same agent harness Google uses to build its own products. Internal growth has been massive, according to Pichai: in March, Google processed 500 billion tokens per day. That number now tops three trillion per day, he said. Google also introduced Codemender, a tool that uses Gemini reasoning to find vulnerabilities in code and patch them automatically, the company claims. Select experts get API access first, with a broader release to follow. Google expands search, shopping, and watermarks Search chief Liz Reid announced that AI Mode is switching to Gemini 3.5 Flash. The feature hit over a billion monthly users in its first year, according to Google; AI Overviews sits at 2.5 billion. The search box is being redesigned to accept longer, multimodal queries. In the coming months, users can build their own mini-apps within Search initially for AI Pro and Ultra subscribers in the US, Google says. On the shopping side, Google is expanding the Universal Commerce Protocol (UCP) to cover hotels and delivery services. Amazon, Meta, Microsoft, Salesforce, and Stripe have joined the UCP Tech Council, according to the company. The Agent Payments Protocol (AP2) is designed to make sure agents only execute purchases within clearly defined limits. On the transparency front, Google is extending SynthID to Search and Chrome. A right-click in the browser will let users check whether an image was AI-generated or edited, Google says. OpenAI, Kakao, and Levin Labs are joining the watermark standard as new partners, following NVIDIA, which signed on last year. Asked whether competitors like

제미나이3.5 AI에이전트 멀티모달 구글클라우드 개발자도구