回顾2026年I/O大会的12个重要时刻

qimuai 发布于 阅读:3 一手编译

回顾2026年I/O大会的12个重要时刻

内容来源:https://blog.google/innovation-and-ai/technology/ai/io-2026-keynote-moment-videos/

内容总结:

谷歌I/O 2026大会12大亮点回顾:全能AI模型、智能搜索与个人AI代理全面升级

在2026年谷歌I/O开发者大会上,一系列重磅技术突破与产品更新成为焦点。以下是本届大会最受瞩目的12项发布:

1. Gemini Omni全能模型
全新多模态模型Gemini Omni可基于任意输入(率先支持视频)生成内容。用户可混合图像、音频、视频和文本作为输入,生成基于Gemini真实世界知识的高质量视频,并通过对话直接编辑视频。首款产品Gemini Omni Flash已面向全球Google AI Plus、Pro和Ultra订阅用户开放,YouTube Shorts和Create App用户可免费使用。

2. Gemini 3.5 Flash前沿模型
新一代Gemini 3.5系列模型首款产品Flash,在智能代理和编程任务上表现卓越,擅长处理复杂长周期任务。已通过Google Antigravity、AI Studio等平台全面上线,搜索AI模式及Gemini应用全球用户均可使用。3.5 Pro版本预计下月推出。

3. 搜索信息代理
搜索正式进入代理时代。用户可在搜索中创建、定制和管理多个AI代理,它们能7×24小时后台智能分析网页、新闻和社交内容,实时追踪金融、购物、体育等信息,并在适当时刻推送精准更新。今夏起面向Pro和Ultra订阅用户开放。

4. 搜索驱动的生成式UI体验
借助Antigravity技术,搜索现可动态生成定制化交互界面,包括图文混排、可操作图表、互动视频及完整小应用(如婚礼策划工具、搬家管理面板)。今夏起免费开放基础功能,高级定制服务率先在美国Pro/Ultra用户中测试。

5. 每日简报代理
Gemini应用新增“每日简报”功能,自动整合Gmail紧急邮件、日历事件及待办事项,生成个性化晨间简报,并主动推荐后续步骤。用户可通过点赞/点踩逐步优化推送内容。面向美国18岁以上AI订阅用户开放。

6. 通用购物车
智能购物车“Universal Cart”成为跨平台购物中枢,用户可在搜索、Gemini对话、YouTube甚至Gmail中一键添加商品。购物车自动追踪优惠、价格历史,并在补货时发出提醒。今夏在美国搜索和Gemini应用上线,后续支持YouTube和Gmail。

7. 神经表达设计语言
Gemini应用界面全面革新,采用Neural Expressive设计语言,包含流畅动画、鲜艳配色、新字体和触觉反馈。模型回复告别纯文本,转而实时设计包含丰富图像、互动时间轴、视频旁白和动态图表的定制化内容。安卓、iOS及网页版已全量推送。

8. Gemini Spark个人代理
全天候运行的AI代理Spark可管理数字生活、代行操作,集成Gmail、Docs、Slides等谷歌工具,支持云端后台持续运行。用户可设定重复任务、教授新技能、创建完整工作流。重大操作(如支付、发邮件)需先征得用户同意。已面向美国Ultra订阅用户公测。

9. macOS版Gemini应用
Mac版Gemini将引入Spark代理,支持本地文件操作和桌面端工作流自动化。同时创新语音交互,能实时将口语转化为精确草稿,并自动填充至光标位置。应用已开放下载,Spark和语音功能今夏上线。

10. 智能眼镜
Android XR平台推出两类智能眼镜:音频眼镜(播报信息、通话、点咖啡)和显示眼镜(叠加实时信息)。首批两款设计将于今秋发布,支持免提听音乐、拍照、调用手机应用等操作。

11. SynthID内容溯源
AI水印技术SynthID已标记超1000亿张图像/视频和6万年音频资产,OpenAI等企业正采用该技术。新增AI检测API上线谷歌云平台。同时扩大“内容凭证”覆盖范围,Pixel 10为首款原生支持图像凭证的手机,后续将扩展至视频,Gemini应用、搜索和Chrome也将加入内容来源验证功能。

12. Gemini for Science科研工具
全新科学工具集Gemini for Science整合深度推理与研究能力,可通过Labs平台体验新实验,并通过Science Skills将Antigravity等代理平台连接至30多个生命科学数据库和工具。代码已开源至GitHub。

中文翻译:

回顾2026年谷歌I/O大会的12大精彩时刻
在2026年谷歌I/O大会上,我们最宏大、最大胆的新成果占据了舞台中心。我们宣布了技术突破,例如Gemini Omni能够以任意输入(从视频开始)创造一切。同时,我们还分享了旨在助力日常生活的产品更新,比如全新的智能搜索框,它允许你跨模态进行搜索,使用文本、图片、文件、视频或Chrome标签页作为输入。(此外,还有大量其他I/O重大发布,远不止于此!)
若你错过了,以下是我们今年I/O主题演讲中一些最激动人心的发布。

  1. Gemini Omni
    Gemini Omni是我们的新模型,能够以任意输入(从视频开始)创造一切。借助Omni,你可以将图像、音频、视频和文本作为输入组合,并基于Gemini对现实世界的知识生成高质量视频。你还能通过对话轻松编辑视频。
    首先,我们推出Omni系列的首个模型:Gemini Omni Flash。Gemini Omni Flash正在通过Gemini应用和Google Flow向全球所有Google AI Plus、Pro和Ultra订阅用户推送。同时,它也在YouTube Shorts和YouTube Create应用中免费向用户开放。

  2. Gemini 3.5 Flash
    我们全新的Gemini 3.5模型系列将前沿智能与行动能力相结合。该系列的首发型号是Gemini 3.5 Flash,它为智能体和编程提供了领先性能,尤其在处理复杂的长周期任务方面表现出色,能带来实际应用价值。
    Gemini 3.5 Flash已通过Google Antigravity、Google AI Studio和Android Studio中的Gemini API、Gemini企业智能体平台以及Gemini企业版全面可用。它还在搜索的AI模式中向所有人开放,并正在全球范围内的Gemini应用中逐步推送。我们也在紧锣密鼓地开发Gemini 3.5 Pro,它已在内部使用,我们期待下个月向用户推出。

  3. 搜索中的信息智能体
    我们正迈入搜索智能体的时代。你可以在搜索中轻松创建、定制和管理多个AI智能体,以处理各种任务。我们从信息智能体开始入手,它们能在后台全天候运行,智能地分析整个网络(如博客、新闻网站和社交媒体帖子),并结合最新数据(如金融、购物和体育的实时信息)。信息智能体将帮助你及时了解最重要的事情,在恰当时刻向你发送包含所需内容的全面更新,并提供便于进一步探索网络的有用链接。
    信息智能体将于今夏开始推送,首先面向Google AI Pro和Ultra订阅用户。只需在搜索中添加“保持更新”即可创建信息智能体,并通过搜索AI模式中的侧边栏查看活跃的智能体。

  4. 搜索中由Google Antigravity驱动的体验
    我们将Antigravity和Gemini 3.5 Flash的智能编程能力直接引入搜索,使搜索能根据你的问题即时打造完全定制化的理想格式。你将获得动态布局、交互式视觉效果以及完整体验,全部为你量身定制。这些生成式UI功能将于今夏向所有搜索用户免费开放。
    有些项目并非一次性问题,而是持续性任务。借助Antigravity,搜索还能为你编写完整的定制化体验,如工具、仪表盘或追踪器。这就像用搜索构建你自己的迷你应用。它们特别适合那些需要反复查看的长期任务,比如规划婚礼或管理搬家。未来几个月内,你将能在搜索中直接使用Antigravity构建定制体验,首先面向美国地区的Google AI Pro和Ultra订阅用户。

  5. 每日简报
    Gemini应用中的每日简报是一个新智能体,它能为你提供个性化的晨间简报,整理好你开启一天所需了解的内容。这份个性化摘要旨在成为你每天清晨的第一站。
    一旦你选择启用,Gemini会在后台跨你连接的各类应用运行。它从你的Gmail收件箱中收集紧急更新,追踪日历中的即将发生的事件,并将相关的后续细节整理成易于浏览的简报。这远不止是简单的摘要。每日简报会根据你的具体目标主动整理和排序,甚至建议下一步行动。你可以通过给反馈点“赞”或“踩”来轻松引导它。
    每日简报正在Gemini应用中向所有Google AI订阅用户(18岁以上)推送,首先从美国开始。要使用每日简报,Google AI订阅用户必须选择连接其Google应用。

  6. 通用购物车
    我们的全新通用购物车是一个真正智能的购物车,也是你在Google上购物的新中心。它跨商家和跨服务运行,因此你可以在浏览搜索、与Gemini聊天、观看YouTube或阅读Gmail时,随时将商品加入购物车。一旦添加商品,购物车就会在后台为你工作。它会寻找优惠和降价信息,提供价格历史洞察,并在商品重新上架时提醒你。
    通用购物车将于今夏在美国的搜索和Gemini应用中推出,随后将覆盖YouTube和Gmail。

  7. Neural Expressive
    我们利用Neural Expressive彻底重新设计了Gemini体验。这是一种令人惊艳的新设计语言,从你打开Gemini应用或访问网站时就能感受到。界面包含流畅的动画、鲜艳的色彩、全新的字体以及遍布各处的触觉反馈。模型响应是Neural Expressive真正展现魅力的地方。Gemini现在不再是单调的文本墙,而是实时设计定制化响应——融入丰富的图像、互动时间线、带旁白的视频和动态图形。
    Neural Expressive正在Android、iOS和网页版的Gemini应用中向所有人推送。

  8. Gemini Spark
    这是Gemini应用中的一个全天候个人AI智能体,能帮助你在数字生活中导航,代表你执行操作,并完全听从你的指令。它与Gmail、Docs、Slides等Google工具套件集成。由于它是云端智能体,即使你合上笔记本电脑或锁定手机,它也能在后台继续工作。借助Spark,你可以设置重复任务、教它新技能并创建完整工作流程。你可以自行选择是否启用它以及它连接哪些应用,并且它在执行高风险操作(如花钱或发送邮件)前会先征得你的同意。
    Gemini Spark正在向可信测试者推送,同时作为Beta版面向美国地区的Google AI Ultra订阅用户推出。

  9. macOS版Gemini应用
    我们正在为macOS版Gemini应用进行重大更新。今夏,我们将把Gemini Spark引入桌面版Gemini应用,使其能够处理涉及本地文件的任务,并在桌面上自动化工作流程。
    我们还在macOS应用中创新语音体验,类似于我们在Android Show上预览的功能。你无需担心思考时脱口而出的“嗯”或“那个”。利用屏幕上的上下文,Gemini能将你自由的语音转化为精确的草稿,并即时调整文本格式以捕捉你的意图,直接呈现在光标位置。
    macOS应用现已可供所有用户下载,Gemini Spark和新语音功能将于今年夏末推送。

  10. 智能眼镜
    Android XR的下一个重要里程碑是智能眼镜。将有两种类型的智能眼镜:音频眼镜,可在耳边提供语音帮助;以及显示眼镜,能在你需要时立即向你显示所需信息。
    音频眼镜将于今年秋季晚些时候推出。在2026年I/O大会上,我们展示了首批两款设计。这些眼镜让你无需动手、无需低头,即可听音乐、拍照、打电话、下日常咖啡订单或访问手机应用,无需从口袋掏出手机。

  11. SynthID
    三年前,我们推出了SynthID,这是业界领先的数字水印技术,能将不可察觉的信号嵌入AI生成的内容中。自那时起,我们将SynthID集成到生成式媒体模型和产品中,为超过1000亿张图片和视频以及6万年的音频资产添加了水印,并将SynthID验证功能引入Gemini应用。现在,我们正在将此验证能力扩展到搜索,并在未来几周内扩展到Chrome。
    OpenAI、Kakao和ElevenLabs等公司正在采用SynthID为其更多AI生成内容添加水印。我们还在Google Cloud的Gemini企业智能体平台上推出新的AI内容检测API,为企业提供强大的工具来识别其运营中的合成媒体。
    此外,我们正在跨产品扩展内容凭证。Pixel 10是首款在其原生相机应用中为图片提供内容凭证的智能手机,我们将在未来几周内将此技术扩展到Pixel 8、9和10手机的视频中。我们还在未来几个月内将内容凭证验证添加到Gemini应用、搜索和Chrome中。这将向你显示内容的来源是AI还是相机,以及是否使用生成式AI工具进行过编辑。

  12. 面向科学的Gemini
    面向科学的Gemini是一套全新的科学工具和实验集合,旨在扩大科学探索的规模和精度。它基于Gemini的深度推理和研究能力以及Deep Think和Deep Research,包括在Labs上的新实验以及科学技能,后者可将Google Antigravity等智能体平台连接到30多个主要生命科学数据库和工具。
    你可以在Google Labs上表达尝试面向科学的Gemini实验的兴趣,而科学技能现已在GitHub上以及直接通过Google Antigravity提供。

英文来源:

Catch up on 12 major I/O 2026 moments
Our biggest, boldest new developments took center stage at Google I/O 2026. We announced technical breakthroughs, like Gemini Omni’s ability to create anything from any input, starting with video. And we shared product updates to help you day-to-day, like the brand new, intelligent Search box that will let you search across modalities, using text, images, files, videos or Chrome tabs as inputs. (And with plenty of other big I/O announcements, there’s a lot more where that came from!)
In case you missed it, here are some of our most exciting I/O keynote reveals this year.

  1. Gemini Omni
    Gemini Omni is our new model that can create anything from any input — starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge. You can also easily edit your videos through conversation.
    First, we’re launching the first model in the Omni family: Gemini Omni Flash. Gemini Omni Flash is rolling out to all Google AI Plus, Pro and Ultra subscribers globally through the Gemini app and Google Flow. It’s also rolling out at no cost to users on YouTube Shorts and YouTube Create App.
  2. Gemini 3.5 Flash
    Our new Gemini 3.5 family of models combines frontier intelligence with action. We’re kicking off the series by releasing Gemini 3.5 Flash, which delivers frontier performance for agents and coding, excelling at complex long-horizon tasks that deliver real-world utility.
    Gemini 3.5 Flash is generally available via Google Antigravity, the Gemini API in Google AI Studio and Android Studio, Gemini Enterprise Agent Platform and Gemini Enterprise. It’s also available for everyone in AI Mode in Search and now rolling out to everyone globally in the Gemini app. We’re also hard at work on Gemini 3.5 Pro. It’s already being used internally, and we look forward to rolling it out next month.
  3. Information agents in Search
    We’re entering the era of Search agents, where you can easily create, customize and manage multiple AI agents for your many tasks, right in Search. We’re starting with information agents, which operate in the background, 24/7, to intelligently reason across the web, like blogs, news sites and social posts (plus our freshest data, such as real-time info on finance, shopping and sports). Information agents will help you stay updated on whatever matters most to you, sending a comprehensive update with exactly what you need at exactly the right moment, along with helpful links to explore further on the web.
    Information agents are rolling out this summer, starting first with Google AI Pro and Ultra subscribers. Simply add “keep me updated” to your search to create an information agent, and view your active agents via the side panel in AI Mode in Search.
  4. Google Antigravity-powered experiences in Search
    We’re bringing Antigravity and the agentic coding capabilities of Gemini 3.5 Flash right into Search, so Search can build you the ideal format exactly for your question, completely custom, on the fly. You can get dynamic layouts, interactive visuals and entire experiences, all created just for you. These generative UI capabilities will be available for everyone in Search this summer, free of charge.
    Some projects aren’t one-off questions — they're ongoing tasks. Also with Antigravity, Search will also code entire custom experiences, like tools, dashboards or trackers, just for you. It’s like building your own mini apps with Search. They’re especially awesome for those long-running tasks where you want to keep coming back, like planning a wedding or managing your home move. You’ll be able to build custom experiences with Antigravity, right in Search in the coming months, starting first for Google AI Pro and Ultra subscribers in the U.S.
  5. Daily Brief
    Daily Brief in the Gemini app is a new agent that gives you a personalized morning brief and organizes exactly what you need to know to start your day. This personalized digest is designed to be your first stop every morning.
    Once you opt in, Gemini works across your connected apps in the background. It gathers urgent updates from your Gmail inbox, tracks upcoming events from your Calendar and compiles relevant follow-up details into a skimmable briefing. It goes far beyond a simple summary. Daily Brief actively organizes and prioritizes based on your specific goals, even suggesting immediate next steps. You can easily steer it by giving responses a quick thumbs up or down over time.
    Daily Brief is rolling out to all Google AI subscribers (18+) in the Gemini app, starting in the U.S. In order to use Daily Brief, Google AI subscribers must have chosen to connect their Google apps.
  6. Universal Cart
    Our new Universal Cart is a truly intelligent shopping cart and your new hub for shopping on Google. It works across merchants and across services, so you can add things to your cart while you’re browsing Search, chatting with Gemini, watching YouTube or even reading your Gmail. The moment you add a product, your cart goes to work for you in the background. It finds deals and price drops, gives you insights on price history and alerts you when something comes back in stock.
    Universal Cart is rolling out across Search and the Gemini app in the U.S. this summer, with YouTube and Gmail to follow.
  7. Neural Expressive
    We've completely redesigned the Gemini experience from the ground up with Neural Expressive, our stunning new design language you’ll see from the moment you open the Gemini app or visit the site. The interface features fluid animations, vibrant colors, new typography and haptic feedback throughout. Model responses are where Neural Expressive truly comes alive. Instead of a wall of text, Gemini now designs tailored responses in real time — incorporating rich imagery, interactive timelines, narrated videos and dynamic graphics.
    Neural Expressive is now rolling out in the Gemini app on Android, iOS and the web to everyone.
  8. Gemini Spark
    This 24/7 personal AI agent in the Gemini app helps you navigate your digital life, takes action on your behalf and is under your direction. It’s integrated with Google’s suite of tools, like Gmail, Docs, Slides and more, and because it’s a cloud-based agent, it’s able to keep working in the background, even when you close your laptop or lock your phone. With Spark, you can set recurring tasks, teach it new skills and create complete workflows. You choose whether to turn it on and what apps it connects to, and it’s designed to ask you first before performing high-stakes actions like spending money or sending emails.
    Gemini Spark is rolling out to trusted testers, and we’re also rolling it out as a Beta for Google AI Ultra subscribers in the U.S.
  9. Gemini app for macOS
    We’re working on big updates to the Gemini app for macOS. We’ll be bringing Gemini Spark to the Gemini desktop app this summer so it can help with tasks involving your local files and automate workflows across your desktop.
    We’re also innovating on new voice experiences in the macOS app, similar to what we previewed at The Android Show. You won’t have to worry about all the “ums” or “what abouts” that happen as you think aloud. Using the context from your screen, Gemini can turn your free-flowing speech into precise drafts, instantly reformatting the text to capture your intent, right where your cursor is.
    The macOS app is available to download for all users, with Gemini Spark and the new voice features will roll out later this summer.
  10. Intelligent eyewear
    Our next big milestone for Android XR is intelligent eyewear. There will be two types of intelligent eyewear: audio glasses that offer spoken help in your ear, and display glasses that show you the information you need, right when you need it.
    Audio glasses are launching later this fall, and at I/O 2026, we revealed the first two designs. These glasses let you stay hands-free and heads-up for things like listening to music, taking photos, making calls, placing your usual coffee order or tapping into your phone apps without reaching into your pocket.
  11. SynthID
    Three years ago, we introduced SynthID, our industry-leading digital watermarking technology that embeds imperceptible signals into AI-generated content. Since then, we've integrated SynthID into our generative media models and products, watermarking over 100 billion images and videos and 60,000 years of audio assets, and brought SynthID verification to the Gemini app. We’re now expanding this verification capability to Search and also to Chrome in the coming weeks.
    Companies like OpenAI, Kakao and ElevenLabs are adopting SynthID to watermark more of their own AI-generated content. We’re also launching a new AI content detection API on Google Cloud’s Gemini Enterprise Agent Platform, giving businesses a robust tool to identify synthetic media across their operations.
    Additionally, we’re expanding Content Credentials across products. Pixel 10 was the first smartphone to provide Content Credentials for images in its native camera app, and we are expanding this technology to video on Pixel 8, 9 and 10 phones in the coming weeks. We’re also adding Content Credentials verification to the Gemini app, and to Search and Chrome in the coming months. This will show you if the origin of the content was AI or a camera, and if it’s been edited with generative AI tools.
  12. Gemini for Science
    Gemini for Science is a new collection of science tools and experiments designed to expand the scale and precision of scientific exploration. Building on the deep reasoning and research capabilities of Gemini as well as Deep Think and Deep Research, it includes new experiments on Labs as well as Science Skills to connect agentic platforms like Google Antigravity to over 30 major life science databases and tools.
    You can express interest to try Gemini for Science experiments on Google Labs, and Science Skills is available today on GitHub and directly in Google Antigravity.

谷歌新消息

文章目录


    扫描二维码,在手机上阅读