{"id":24161,"date":"2025-05-21T15:22:13","date_gmt":"2025-05-21T12:22:13","guid":{"rendered":"https:\/\/forklog.com\/en\/google-i-o-2025-a-249-99-ai-agent-video-generators-and-other-innovations\/"},"modified":"2025-05-21T15:22:13","modified_gmt":"2025-05-21T12:22:13","slug":"google-i-o-2025-a-249-99-ai-agent-video-generators-and-other-innovations","status":"publish","type":"post","link":"https:\/\/forklog.com\/en\/google-i-o-2025-a-249-99-ai-agent-video-generators-and-other-innovations\/","title":{"rendered":"Google I\/O 2025: a $249.99 AI agent, video generators and other innovations"},"content":{"rendered":"<p>On May 20 at the Google I\/O 2025 conference, the company unveiled a raft of new AI products, including an image generator, video tools, a filmmaking app, a translator in Google Meet and more.<\/p>\n<p><iframe loading=\"lazy\" width=\"560\" height=\"315\" src=\"https:\/\/www.youtube.com\/embed\/x_x-JAAKSvU?si=6yde8BR43fnJeV9N\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<h2 class=\"wp-block-heading\"><strong>$249.99 for Google AI Ultra <\/strong><\/h2>\n<p>Google launched a new AI Ultra plan at $249.99 a month. It provides \u201cthe highest level of access\u201d to the company\u2019s AI apps and services. The subscription includes the new Google Veo 3 video generator, the Flow filmmaking app and the powerful Gemini 2.5 Pro Deep Think model (not yet launched). <\/p>\n<p>Other Google AI Ultra options:<\/p>\n<ul class=\"wp-block-list\">\n<li>higher limits on the NotebookLM and Whisk platforms;<\/li>\n<li>access to the Gemini chatbot in Chrome;<\/li>\n<li>agent tools based on Project Mariner technology;<\/li>\n<li>YouTube Premium;<\/li>\n<li>30 TB of storage across Google Drive, Google Photos and Gmail.<\/li>\n<\/ul>\n<p>One of the agent tools is Agent Mode. It can browse web pages, conduct research and integrate with Google apps to execute specific tasks. Its launch is expected \u201csoon\u201d.<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>\u201cUltra is a programme for those who want to be on the front line of artificial intelligence from Google,\u201d said Josh Woodward, vice president of Google Labs and Gemini.<\/p>\n<\/blockquote>\n<p>The AI Ultra subscription is available for now only in the US. <\/p>\n<p>Google joins a growing list of firms launching pricey plans. In December 2024, OpenAI <a href=\"https:\/\/forklog.com\/en\/news\/openai-unveils-200-a-month-pro-version-of-o1-amid-concerns-over-deceptive-behaviour\">released<\/a> ChatGPT Pro at $200 a month. In April, AI startup Anthropic set the same price for Max.<\/p>\n<p><script async src=\"https:\/\/telegram.org\/js\/telegram-widget.js?22\" data-telegram-post=\"forklogAI\/5861\" data-width=\"100%\"><\/script><\/p>\n<h2 class=\"wp-block-heading\"><strong>Veo 3 \u2014 video with sound<\/strong><\/h2>\n<p>Veo 3 is a new AI model for generating video and audio accompaniment such as effects, noise and dialogue. The company stressed the product\u2019s superiority over the previous Veo 2 in the quality of its output. <\/p>\n<p><iframe loading=\"lazy\" width=\"560\" height=\"315\" src=\"https:\/\/www.youtube.com\/embed\/QYnJ3qJ5qJQ?si=luIxB6wwQueYa8hJ\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>\u201cFor the first time we are coming out of the age of silence in video creation. [You can give Veo 3] a prompt for character and environment characteristics and propose dialogue with a description of how it should sound,\u201d said Google DeepMind CEO Demis Hassabis. <\/p>\n<\/blockquote>\n<blockquote class=\"twitter-tweet\">\n<p lang=\"en\" dir=\"ltr\">cooking up something tasty for tomorrow\u2026 <a href=\"https:\/\/t.co\/wyIRMsXkFG\">pic.twitter.com\/wyIRMsXkFG<\/a><\/p>\n<p>\u2014 Demis Hassabis (@demishassabis) <a href=\"https:\/\/twitter.com\/demishassabis\/status\/1924501631972057186?ref_src=twsrc%5Etfw\">May 19, 2025<\/a><\/p><\/blockquote>\n<p> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<p>The model is available in the Gemini app for subscribers to the Google AI Ultra plan. <\/p>\n<p>The appearance of Veo 3 was likely made possible by DeepMind\u2019s work in the field. In June last year, Google\u2019s AI division <a href=\"https:\/\/forklog.com\/en\/news\/google-deepmind-develops-ai-model-for-video-soundtrack-generation\">began developing<\/a> a technology based on artificial intelligence for generating soundtracks for video.<\/p>\n<p>Improvements were also presented for <a href=\"https:\/\/forklog.com\/en\/news\/google-deepmind-unveils-advanced-ai-video-and-image-generators\">Veo 2<\/a> \u2014 it can now be given images of characters, scenes, objects and styles to improve consistency. It understands camera motion, can add or remove objects from a clip and can expand frames \u2014 for example, turning vertical video into horizontal.<\/p>\n<p>The new Veo 2 features will become available on the Vertex AI platform.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Imagen 4 \u2014 image generator<\/strong><\/h2>\n<p>Google brought to market a new AI model for creating images \u2014 Imagen 4. It can visualise fine details, such as fabrics, water droplets and animal fur, and work with photorealistic and abstract styles. <\/p>\n<blockquote class=\"twitter-tweet\">\n<p lang=\"en\" dir=\"ltr\">Imagen 4 delivers visuals that pop with richer details, more nuanced color, and better text outputs.<\/p>\n<p>Everyone can make images for free in the Gemini App today: <a href=\"https:\/\/t.co\/awhPeHZIqm\">https:\/\/t.co\/awhPeHZIqm<\/a><a href=\"https:\/\/twitter.com\/hashtag\/GoogleIO?src=hash&#038;ref_src=twsrc%5Etfw\">#GoogleIO<\/a> <a href=\"https:\/\/t.co\/nnI8ZGIELv\">pic.twitter.com\/nnI8ZGIELv<\/a><\/p>\n<p>\u2014 Google Gemini App (@GeminiApp) <a href=\"https:\/\/twitter.com\/GeminiApp\/status\/1924893484760367226?ref_src=twsrc%5Etfw\">May 20, 2025<\/a><\/p><\/blockquote>\n<p> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<p>The model delivers higher-quality results than Imagen 3 and can create illustrations in different aspect ratios at resolutions up to 2K.<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>\u201cWe also put a lot of emphasis on improving text generation and typography, so the model is great for creating slides, invitations or any other materials where you need to combine images and text,\u201d Woodward stressed.<\/p>\n<\/blockquote>\n<p>The tool is available in the Gemini app, on the Google Whisk and Vertex AI platforms, and in Google Slides, Vids, Docs and other Google Workspace products.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Flow \u2014 film generator<\/strong><\/h2>\n<p>At Google I\/O 2025 the company announced Flow, a new AI model for creating films. It integrates three tools: <\/p>\n<ul class=\"wp-block-list\">\n<li>Veo for generating video; <\/li>\n<li>Imagen for generating images;<\/li>\n<li>Gemini for working with text and prompts.<\/li>\n<\/ul>\n<blockquote class=\"twitter-tweet\">\n<p lang=\"en\" dir=\"ltr\">Introducing Flow: a new type of AI filmmaking tool that combines the best of Veo, Imagen and Gemini \u2014 built with and for creatives.<\/p>\n<p>Flow helps you maintain character and visual consistency from one clip to the next.<\/p>\n<p>See how emerging filmmakers are using it ? <a href=\"https:\/\/t.co\/H0cBv6IGs1\">pic.twitter.com\/H0cBv6IGs1<\/a><\/p>\n<p>\u2014 Google (@Google) <a href=\"https:\/\/twitter.com\/Google\/status\/1924896843441336440?ref_src=twsrc%5Etfw\">May 20, 2025<\/a><\/p><\/blockquote>\n<p> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<p>Flow lets you import characters or scenes, or create these elements directly inside the tool. It offers camera controls for changing angle or perspective, a scene builder and asset-management features. <\/p>\n<p>In addition, the company is launching Flow TV \u2014 a feed of video clips and content with the exact prompts used to create them. The service will help users understand creators\u2019 process. <\/p>\n<h2 class=\"wp-block-heading\"><strong>Smart glasses<\/strong><\/h2>\n<p>Google is joining the smart-glasses race, announcing partnerships with Gentle Monster and Warby Parker to build an Android XR-based gadget.<\/p>\n<p>Android XR is a platform for extended-reality (XR) devices launched last year in partnership with Qualcomm and Samsung. <\/p>\n<p>The company said it is deepening its partnership with Samsung to develop XR glasses. The two firms are building the software and hardware platform.<\/p>\n<p>At the conference, Google showed a concept of Android XR glasses with Gemini artificial intelligence. They are equipped with a camera, microphone, speakers and a display for viewing notifications. <\/p>\n<blockquote class=\"twitter-tweet\">\n<p lang=\"en\" dir=\"ltr\">Google Android XR Glasses ? Live Demo<a href=\"https:\/\/twitter.com\/hashtag\/GoogleIO?src=hash&#038;ref_src=twsrc%5Etfw\">#GoogleIO<\/a> <a href=\"https:\/\/t.co\/qoGK4rs2z4\">pic.twitter.com\/qoGK4rs2z4<\/a><\/p>\n<p>\u2014 Ben Geskin (@BenGeskin) <a href=\"https:\/\/twitter.com\/BenGeskin\/status\/1924903309065723970?ref_src=twsrc%5Etfw\">May 20, 2025<\/a><\/p><\/blockquote>\n<p> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<p>Google plans to allocate up to $150 million to co-develop AI glasses with Warby Parker. $75 million has already been sent. <\/p>\n<h2 class=\"wp-block-heading\"><strong>Gemini integration in Chrome<\/strong><\/h2>\n<p>The company announced the launch of Gemini integration in Chrome. Users will get an AI assistant for working in the browser. It can understand page context and perform various tasks. <\/p>\n<p>Gemini in Chrome is available via text input and voice command. You can start chatting with the assistant by clicking the Gemini icon in the top-right corner of the Chrome window.<\/p>\n<p>Example: a user can open a banana-bread recipe page and ask Gemini to make it gluten-free. Or use the digital assistant to choose a plant for a bedroom depending on lighting conditions. <\/p>\n<p>In future, Gemini will be able to work with multiple tabs at once \u2014 enabling, among other things, comparison of two similar items across pages or online shops. <\/p>\n<h2 class=\"wp-block-heading\"><strong>Translator in Google Meet<\/strong><\/h2>\n<p>Google Meet has added real-time speech translation. The company uses a large audio language model from DeepMind to enable natural conversation with a counterpart in another language. <\/p>\n<p>During translation, voice, intonation and facial expression are preserved. The new feature has many use cases. For example, English-speaking grandchildren will be able to talk to Spanish-speaking grandparents, as will employees of a large company across regions. <\/p>\n<p>The company claims translation latency is very low, allowing conversations with several people at once. <\/p>\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh7-qw.googleusercontent.com\/docsz\/AD_4nXfrmosfnTelJaJq9ZIV1FUgWofj-JcvpLTdTO_KJL600CYNgBYU63tJCh29YSln4uDbkta71OFY1on3g1KAKlBHqRVLTBWA4WmgDiKrJL28mDs7y8jk8DKeNd1cD3ybqzG_zbADDg?key=2h8LMU7bFaqcEedzvx5t0A\" alt=\"Google I\/O 2025: $249.99 for an AI agent, video generators and other innovations\"\/><figcaption class=\"wp-element-caption\">Data: Google.<\/figcaption><\/figure>\n<p>During the conversation, the original speech from the interlocutor is preserved. The translation is overlaid on top. <\/p>\n<h2 class=\"wp-block-heading\"><strong>Gemini chatbot improvements<\/strong><\/h2>\n<p>Google announced several updates to the Gemini chatbot. Among them:<\/p>\n<ul class=\"wp-block-list\">\n<li>broader availability of multimodal capabilities;<\/li>\n<li>updated AI models;<\/li>\n<li>streaming video from the phone\u2019s camera or screen while holding voice conversations in parallel;<\/li>\n<li>routing in Google Maps, creating events in Google Calendar and to-do lists in Google Tasks.<\/li>\n<\/ul>\n<p>At the conference, Google said Gemini now has 400 million monthly active users.<\/p>\n<p>The company also updated Deep Research \u2014 a tool for generating detailed research reports. Users can upload PDFs and images, and the service will match them with public information to provide more personalised answers. <\/p>\n<p>In future, Drive and Gmail will be integrable into Deep Research. <\/p>\n<h2 class=\"wp-block-heading\"><strong>Project Mariner \u2014 an AI agent for browsing web pages<\/strong><\/h2>\n<p>Google opened the experimental AI agent Project Mariner to American users with a Google AI Ultra subscription. Its operating principle has also been updated \u2014 the assistant can now perform up to ten tasks simultaneously. <\/p>\n<p>Examples of Project Mariner\u2019s capabilities include buying tickets to a baseball game or groceries online. Users chat with the agent; it then visits sites and performs the required actions. They can get on with other things while the assistant completes tasks in the background.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Other solutions<\/strong><\/h2>\n<p>Alongside the products above, Google also presented many others. Among them:<\/p>\n<ul class=\"wp-block-list\">\n<li>Gemma 3n \u2014 an AI model for \u201csmooth\u201d operation on phones, laptops and tablets, able to interact with audio, text, images and video;<\/li>\n<li>Stitch \u2014 a tool to assist front-end development of web and mobile applications; it can generate the required interface elements and code;<\/li>\n<li>adding a video-overviews feature in NotebookLM \u2014 users will be able to turn multimedia materials into easy-to-digest visual presentations;<\/li>\n<li>new features in Google Search\u2019s AI mode for online shoppers, including a visual panel, personalised price alerts and virtual try-ons;<\/li>\n<li>SynthID Detector \u2014 can determine whether an image, video, audio or a text fragment was created with the company\u2019s AI models;<\/li>\n<li>an improved Deep Think reasoning mode for the flagship Gemini 2.5 Pro model;<\/li>\n<li>new AI features in Gmail, Google Docs and Google Vids for cleaning up messages, composing personalised emails or creating and editing content.<\/li>\n<\/ul>\n<p>In April, <a href=\"https:\/\/forklog.com\/en\/news\/openai-expresses-interest-in-acquiring-google-chrome\">it emerged that<\/a> OpenAI wanted to acquire the Chrome browser.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>On May 20 at the Google I\/O 2025 conference, the company unveiled a raft of new AI products, including an image generator, video tools, a filmmaking app, a translator in Google Meet and more. $249.99 for Google AI Ultra Google launched a new AI Ultra plan at $249.99 a month. It provides \u201cthe highest level [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":24160,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"select":"","news_style_id":"","cryptorium_level":"","_short_excerpt_text":"","creation_source":"","_metatest_mainpost_news_update":false,"footnotes":""},"categories":[3],"tags":[1751,438,719,738,1150],"class_list":["post-24161","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news-and-analysis","tag-ai-agents","tag-artificial-intelligence","tag-gemini","tag-google","tag-news-plus"],"aioseo_notices":[],"amp_enabled":true,"views":"93","promo_type":"","layout_type":"","short_excerpt":"","is_update":"","_links":{"self":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/24161","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/comments?post=24161"}],"version-history":[{"count":0,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/posts\/24161\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/media\/24160"}],"wp:attachment":[{"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/media?parent=24161"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/categories?post=24161"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/forklog.com\/en\/wp-json\/wp\/v2\/tags?post=24161"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}