<?xml version="1.0" encoding="UTF-8"?>
<!-- AUTOGENERATED FILE. DO NOT EDIT. -->
<feed xmlns="http://www.w3.org/2005/Atom">
  <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes</id>
  <title>Generative AI on Vertex AI - Release notes</title>
  <link rel="self" href="https://docs.cloud.google.com/feeds/generative-ai-on-vertex-ai-release-notes.xml"/>
  <author>
    <name>Google Cloud Platform</name>
  </author>
  <updated>2026-04-06T00:00:00-07:00</updated>

  <entry>
    <title>April 06, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#April_06_2026</id>
    <updated>2026-04-06T00:00:00-07:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#April_06_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Metadata search for RAG Engine</strong></p>
<p>Use schema-based metadata search in Vertex AI RAG Engine.
You can define a metadata schema for a corpus, attach metadata to files within
that corpus, and use this metadata to filter contexts during retrieval.
For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/rag-engine/use-metadata-search">Filter with metadata search</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>April 03, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#April_03_2026</id>
    <updated>2026-04-03T00:00:00-07:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#April_03_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Gemma 4 26B A4B IT</strong> is available as an experimental launch in Model Garden. This is an open model built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on small models) and generating text output.
Gemma 4 26B A4B IT is available as a managed API in Model Garden. To learn more, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/maas/google/gemma-4-26b-a4b-it">Gemma 4 26B A4B IT</a>.</p>
<h3>Feature</h3>
<p><strong>Vertex AI RAG Engine Serverless mode</strong></p>
<p>Vertex AI RAG Engine Serverless mode is now available in <a href="https://cloud.google.com/products#product-launch-stages">public
preview</a>. Serverless
mode provides a fully managed database for storing RAG resources that abstracts
away database provisioning and scaling. You can seamlessly switch between
Serverless mode and Spanner mode, which provides dedicated, isolated database
instances.</p>
<p>For more information, see the following:</p>
<ul>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/rag-engine/deployment-modes">Deployment modes in Vertex AI RAG Engine</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/rag-engine/serverless-mode">Serverless mode</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/rag-engine/spanner-mode">Managing Spanner mode</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/rag-engine/switching-modes">Switching between modes</a></li>
</ul>
]]>
    </content>
  </entry>

  <entry>
    <title>April 02, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#April_02_2026</id>
    <updated>2026-04-02T00:00:00-07:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#April_02_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Veo 3.1 Lite</strong></p>
<p>Veo 3.1 Lite is available in <a href="https://cloud.google.com/products#product-launch-stages">public
preview</a>. This release
is our most cost-efficient Veo on Vertex AI model.</p>
<p>For more information, see <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/veo/3-1-generate#3.1-lite-generate-001">3.1 Lite
Generate</a></p>
<h3>Announcement</h3>
<p><strong>Gemini 2.5 model retirement dates updated</strong></p>
<p>The retirement dates for Gemini 2.5 Pro, Gemini 2.5 Flash-Lite,
and Gemini 2.5 Flash have been updated to October 16, 2026. For more information,
see <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/learn/model-versions">Model versions and lifecycle</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>March 25, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#March_25_2026</id>
    <updated>2026-03-25T00:00:00-07:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#March_25_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Lyria 3</strong></p>
<p>Lyria is available in <a href="https://cloud.google.com/products#product-launch-stages">public
preview</a>. You can use
<code>lyria-3-pro-preview</code> to generate 184 seconds of audio, or
<code>lyria-3-clip-preview</code> to generate 30 seconds of audio.</p>
<p>For more information, see the following:</p>
<ul>
<li><p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/lyria/lyria-3#lyria-3-pro-preview">Lyria 3 Pro
Preview</a></p></li>
<li><p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/lyria/lyria-3#lyria-3-clip-preview">Lyria 3 Clip Preview</a></p></li>
</ul>
]]>
    </content>
  </entry>

  <entry>
    <title>March 24, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#March_24_2026</id>
    <updated>2026-03-24T00:00:00-07:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#March_24_2026"/>
    <content type="html"><![CDATA[<h3>Deprecated</h3>
<p><strong>Imagen generation GA endpoints deprecation</strong></p>
<p>The following table describes image generation endpoints that are deprecated and
their replacements. We recommend updating your model endpoints before June 30,
2026, to avoid service disruption.</p>
<table>
<thead>
<tr>
<th>Discontinued endpoints</th>
<th>Recommended endpoint migration</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>imagegeneration@002</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagegeneration@003</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagegeneration@004</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagegeneration@005</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagegeneration@006</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagetext@001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-3.0-capability-001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-3.0-capability-002</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-3.0-fast-generate-001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-3.0-generate-001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-3.0-generate-002</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-fast-generate-001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-generate-001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-ultra-generate-001</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
</tbody>
</table>
<h3>Deprecated</h3>
<p><strong>Video generation GA endpoints deprecation</strong></p>
<p>The following table describes video generation endpoints that are deprecated and
their replacements. We recommend updating your model endpoints before June 30,
2026, to avoid service disruption.</p>
<table>
<thead>
<tr>
<th>Discontinued endpoints</th>
<th>Recommended endpoint migration</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>veo-3.0-generate-001</code></td>
<td><code>veo-3.1-generate-001</code></td>
</tr>
<tr>
<td><code>veo-3.0-fast-generate-001</code></td>
<td><code>veo-3.1-fast-generate-001</code></td>
</tr>
<tr>
<td><code>veo-2.0-generate-001</code></td>
<td><code>veo-3.1-generate-001</code></td>
</tr>
</tbody>
</table>
]]>
    </content>
  </entry>

  <entry>
    <title>March 12, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#March_12_2026</id>
    <updated>2026-03-12T00:00:00-07:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#March_12_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Partner model evaluations</strong></p>
<p>The Gen AI evaluation service supports evaluating partner models, such as Anthropic
and Llama models. For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-genai-console#evaluate_partner_models">Perform evaluation using the console</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>March 03, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#March_03_2026</id>
    <updated>2026-03-03T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#March_03_2026"/>
    <content type="html"><![CDATA[<h3>Deprecated</h3>
<p><strong>Video generation preview endpoints deprecation</strong></p>
<p>The following table describes video generation endpoints that are deprecated and
their replacements. We recommend updating your model endpoints before April 2,
2026, to avoid service disruption.</p>
<table>
<thead>
<tr>
<th>Discontinued endpoints</th>
<th>Recommended endpoint migration</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>veo-3.0-generate-preview</code></td>
<td><code>veo-3.0-generate-001</code></td>
</tr>
<tr>
<td><code>veo-3.0-fast-generate-preview</code></td>
<td><code>veo-3.0-fast-generate-preview</code></td>
</tr>
<tr>
<td><code>veo-2.0-generate-preview</code></td>
<td><code>veo-2.0-generate-001</code></td>
</tr>
<tr>
<td><code>veo-2.0-generate-exp</code></td>
<td><code>veo-2.0-generate-001</code></td>
</tr>
<tr>
<td><code>veo-001-preview-0815</code></td>
<td><code>veo-2.0-generate-001</code></td>
</tr>
<tr>
<td><code>veo-001-preview</code></td>
<td><code>veo-2.0-generate-001</code></td>
</tr>
<tr>
<td><code>veo-3.1-generate-preview</code></td>
<td><code>veo-3.1-generate-001</code></td>
</tr>
<tr>
<td><code>veo-3.1-fast-generate-preview</code></td>
<td><code>veo-3.1-fast-generate-001</code></td>
</tr>
</tbody>
</table>
<h3>Feature</h3>
<p><strong>Gemini 3.1 Flash-Lite</strong></p>
<p>Gemini 3.1 Flash-Lite (<code>gemini-3.1-flash-lite-preview</code>) is
available in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>.
This release is our most cost-efficient Gemini model and is
optimized for low latency use cases for high-volume, cost-sensitive LLM traffic.</p>
<p>For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-1-flash-lite">Gemini 3.1 Flash-Lite</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>February 26, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#February_26_2026</id>
    <updated>2026-02-26T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#February_26_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Gemini 3.1 Flash Image</strong></p>
<p>Gemini 3.1 Flash Image (<code>gemini-3.1-flash-image</code>) is available
in <a href="https://cloud.google.com/products#product-launch-stages">public preview</a>.
This release enables high-quality image generation with improved pricing and
latency. We recommend using Gemini 3.1 Flash Image when generating
images.</p>
<p>For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-1-flash-image">Gemini 3.1 Flash Image</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>February 23, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#February_23_2026</id>
    <updated>2026-02-23T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#February_23_2026"/>
    <content type="html"><![CDATA[<h3>Deprecated</h3>
<p><strong>Anthropic's Claude 3 Haiku</strong></p>
<p>Anthropic's Claude 3 Haiku is deprecated as of February 23, 2026 and will be
shut down on August 23, 2026. For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/deprecations/partner-models#haiku-3">Partner model deprecations</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>February 19, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#February_19_2026</id>
    <updated>2026-02-19T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#February_19_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Gemini 3.1 Pro Preview</strong></p>
<p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/3-1-pro">Gemini 3.1 Pro</a>
is available in preview in Model Garden. Gemini 3.1 Pro is
our most advanced reasoning Gemini model, capable of solving complex
problems from different information sources, including text, audio, images,
video, PDFs, and even entire code repositories with its 1M token context
window.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>February 17, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#February_17_2026</id>
    <updated>2026-02-17T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#February_17_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Anthropic's Claude Sonnet 4.6</strong></p>
<p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/sonnet-4-6">Claude Sonnet 4.6</a>
is available in Model Garden.</p>
<h3>Deprecated</h3>
<p><strong>Image generation preview endpoints deprecation</strong></p>
<p>The following table describes image generation endpoints that are deprecated and
their replacements. We recommend updating your model endpoints before March 19,
2026, to avoid service disruption.</p>
<table>
<thead>
<tr>
<th>Discontinued endpoints</th>
<th>Recommended endpoint migration</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>gemini-2.0-flash-image-generation-preview</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>gemini-2.5-flash-image-generation-preview</code></td>
<td><code>imagen-4.0-generate-001</code> or <code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-generate-preview-05-20</code></td>
<td><code>imagen-4.0-generate-001</code> or <code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-generate-preview-06-06</code></td>
<td><code>imagen-4.0-generate-001</code> or <code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-ultra-generate-preview-06-06</code></td>
<td><code>imagen-4.0-generate-001</code> or <code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-4.0-fast-generate-preview-05-20</code></td>
<td><code>imagen-4.0-generate-001</code> or <code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-product-recontext-preview-06-30</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>imagen-2.0-edit-preview-0627</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
<tr>
<td><code>virtual-try-on-preview-08-04</code></td>
<td><code>virtual-try-on-001</code></td>
</tr>
<tr>
<td><code>imagen-4.0-ingredients-preview</code></td>
<td><code>gemini-2.5-flash-image</code></td>
</tr>
</tbody>
</table>
]]>
    </content>
  </entry>

  <entry>
    <title>February 10, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#February_10_2026</id>
    <updated>2026-02-10T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#February_10_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>GLM 5</strong> is available as an experimental launch in Model Garden. This model
is targeting complex systems engineering and long-horizon agentic tasks.
GLM 5 is available as a managed API in Model Garden. To learn more, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/maas/zai-org/glm-5">GLM 5</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>February 04, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#February_04_2026</id>
    <updated>2026-02-04T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#February_04_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Anthropic's Claude Opus 4.6</strong></p>
<p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/opus-4-6">Claude Opus 4.6</a>
is available in Model Garden.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>January 23, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#January_23_2026</id>
    <updated>2026-01-23T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#January_23_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Virtual Try-On</strong></p>
<p>Virtual Try-On is now <a href="https://cloud.google.com/products/#product-launch-stages">generally available
(GA)</a>.
The new endpoint, <code>virtual-try-on-001</code>,
replaces the previous endpoint, <code>virtual-try-on-preview-08-04</code>. We
recommend changing to the new endpoint as soon as possible.</p>
<p>For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/image/generate-virtual-try-on-images">Generate Virtual Try-On Images</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>January 22, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#January_22_2026</id>
    <updated>2026-01-22T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#January_22_2026"/>
    <content type="html"><![CDATA[<h3>Announcement</h3>
<p>Codestral (25.01) and Mistral Large (24.11) are retired as of January 23, 2026.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>January 20, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#January_20_2026</id>
    <updated>2026-01-20T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#January_20_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>GLM 4.7</strong> GA is now available in Model Garden. This model
is designed for core or vibe coding, tool use, and complex reasoning.
GLM 4.7 is available as a managed API in Model Garden. To learn more, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/maas/zai-org/glm-47">GLM 4.7</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>January 13, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#January_13_2026</id>
    <updated>2026-01-13T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#January_13_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Veo 3.1 reference-to-video update</strong></p>
<p>Veo 3.1 Preview models now support the following features:</p>
<ul>
<li>9:16 aspect ratio for reference-to-video.</li>
<li>Upsampling for videos generated at 1080p and 4k resolutions.</li>
</ul>
<p>For more information, see the following:</p>
<ul>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/video/use-reference-images-to-guide-video-generation">Generate Veo videos from reference images</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/model-reference/veo-video-generation">Veo on Vertex AI video generation API</a></li>
</ul>
]]>
    </content>
  </entry>

  <entry>
    <title>January 06, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#January_06_2026</id>
    <updated>2026-01-06T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#January_06_2026"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>GLM 4.7</strong> is available as an experimental launch in Model Garden. This model
is designed for core or vibe coding, tool use, and complex reasoning.
GLM 4.7 is available as a managed API in Model Garden. To learn more, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/maas/zai-org/glm-47">GLM 4.7</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>January 05, 2026</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#January_05_2026</id>
    <updated>2026-01-05T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#January_05_2026"/>
    <content type="html"><![CDATA[<h3>Deprecated</h3>
<p><strong>Anthropic's Claude 3.5 Haiku</strong></p>
<p>Anthropic's Claude 3.5 Haiku is deprecated as of January 5, 2026 and will be
shut down on July 5, 2026. For more information, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/deprecations/partner-models#haiku-3-5">Partner model deprecations</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>December 18, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_18_2025</id>
    <updated>2025-12-18T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_18_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Save and share prompts in Vertex AI Studio</strong>: The prompt sharing feature no longer needs to be enabled. You can share prompts without asking your administrator to first enable the prompt sharing feature. For more information, see <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/prompt-sharing">Save and share prompts</a>.</p>
<h3>Feature</h3>
<p>The following models are available through Model Garden:</p>
<ul>
<li><a href="https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/functiongemma">FunctionGemma</a></li>
<li><a href="https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/t5gemma">T5Gemma 2</a></li></ul>
]]>
    </content>
  </entry>

  <entry>
    <title>December 17, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_17_2025</id>
    <updated>2025-12-17T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_17_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><a href="https://docs.cloud.google.com/api-registry/docs/overview">Cloud API Registry</a> is available
in the Google Cloud console in
<a href="https://cloud.google.com/products#product-launch-stages">Preview</a>. Use
Cloud API Registry in the Google Cloud console to view and manage the MCP
servers and tools your agent has access to.</p>
<h3>Feature</h3>
<p><strong>Gemini 3 Flash</strong></p>
<p>Gemini 3 Flash is now available in public preview. This model is designed to
tackle the most challenging agentic problems with strong coding and
state-of-the-art reasoning capabilities, and is our best model for complex
multimodal understanding.</p>
<p>For more information, see <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/models/gemini/3-flash">Gemini 3
Flash</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>December 16, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_16_2025</id>
    <updated>2025-12-16T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_16_2025"/>
    <content type="html"><![CDATA[<h3>Change</h3>
<p>Updated pricing for <strong>Vertex AI Agent Engine</strong>:</p>
<ul>
<li><p>Pricing for Vertex AI Agent Engine Runtime was lowered.</p></li>
<li><p>On <strong>January 28, 2026</strong>, Sessions, Memory Bank, and Code Execution will begin
charging for usage.</p></li>
</ul>
<p>For more information, see <a href="https://cloud.google.com/vertex-ai/pricing#vertex-ai-agent-engine">Pricing</a>.</p>
<h3>Announcement</h3>
<p><strong>Vertex AI Agent Engine</strong></p>
<p>Vertex AI Agent Engine <a href="https://docs.cloud.google.com/agent-builder/agent-engine/sessions/overview">Sessions</a>
and <a href="https://docs.cloud.google.com/agent-builder/agent-engine/memory-bank/overview">Memory Bank</a> are now
<a href="https://cloud.google.com/products#product-launch-stages">Generally Available</a>.</p>
<h3>Change</h3>
<p><strong>Vertex AI Agent Engine</strong></p>
<p>Vertex AI Agent Engine is now available in the following regions:</p>
<ul>
<li><code>europe-west6</code> (Zurich)</li>
<li><code>europe-west8</code> (Milan)</li>
<li><code>asia-east2</code> (Hong Kong)</li>
<li><code>asia-northeast3</code> (Seoul)</li>
<li><code>asia-southeast2</code> (Jakarta)</li>
<li><code>northamerica-northeast2</code> (Toronto)</li>
<li><code>southamerica-east1</code> (São Paulo)</li>
</ul>
<p>For more information, see <a href="https://docs.cloud.google.com/agent-builder/locations">Vertex AI Agent Builder locations</a>.</p>
<h3>Change</h3>
<p><strong>Virtual Try-On</strong></p>
<p>Our Virtual Try-On model, <code>virtual-try-on-preview-08-04</code>, is
improved. Latency is significantly reduced and quality is improved for shoes,
body shape preservation, and product fidelity.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>December 12, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_12_2025</id>
    <updated>2025-12-12T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_12_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Gemini 2.5 Flash with Gemini Live API Native Audio</strong></p>
<p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-flash-live-api">Gemini 2.5 Flash with Gemini Live API Native Audio</a> (<code>gemini-live-2.5-flash-native-audio</code>) is Generally Available (GA).
This model features cutting-edge native audio functionality for
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/live-api">Gemini Live API</a>, including enhanced voice quality and adaptability, Proactive Audio, and Affective Dialog.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>December 10, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_10_2025</id>
    <updated>2025-12-10T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_10_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>DeepSeek-V3.2</strong> is available in Model Garden.
DeepSeek-V3.2 is a state-of-the-art large language model from
DeepSeek.
DeepSeek-V3.2 is available as a managed API in Model Garden. To learn more, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/maas/deepseek/deepseek-v32">DeepSeek-V3.2</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>December 09, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_09_2025</id>
    <updated>2025-12-09T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_09_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p>The following models are available through Model Garden:</p>
<ul>
<li><a href="https://console.cloud.google.com/vertex-ai/publishers/mistralai/model-garden/ministral-3">Ministral 3</a></li>
<li><a href="https://console.cloud.google.com/vertex-ai/publishers/mistralai/model-garden/mistral-large-3">Mistral Large 3</a></li></ul>
]]>
    </content>
  </entry>

  <entry>
    <title>December 08, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_08_2025</id>
    <updated>2025-12-08T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_08_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Veo 3.1 video extension</strong></p>
<p>Veo 3.1 supports video extension in Preview.</p>
<p>For more information, see the following:</p>
<ul>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/video/extend-a-veo-video">Extend Veo
videos</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/model-reference/veo-video-generation">Veo video generation API</a></li></ul>
]]>
    </content>
  </entry>

  <entry>
    <title>December 02, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#December_02_2025</id>
    <updated>2025-12-02T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#December_02_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p>The Vertex AI Model Garden model co-hosting vLLM container is available to use with <a href="https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/community/model_garden/model_garden_model_cohost.ipynb">this sample notebook</a>. You can use this container to serve multiple replicas of a model and serve multiple models with dynamic loading and unloading. This allows you to maximize resource utilization and serving efficiency, and flexibly adjust the models to serve.</p>
<h3>Feature</h3>
<p>The following models are available through Model Garden:</p>
<ul>
<li><a href="https://console.cloud.google.com/vertex-ai/publishers/deepseek-ai/model-garden/deepseek-v3-2;publisherModelVersion=deepseek-v3-2">DeepSeek-V3.2</a></li>
<li><a href="https://console.cloud.google.com/vertex-ai/publishers/deepseek-ai/model-garden/deepseek-v3-2;publisherModelVersion=deepseek-v3-2-speciale">DeepSeek-V3.2-Speciale</a></li></ul>
]]>
    </content>
  </entry>

  <entry>
    <title>November 24, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#November_24_2025</id>
    <updated>2025-11-24T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#November_24_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Anthropic's Claude Opus 4.5</strong></p>
<p><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/opus-4-5">Claude Opus 4.5</a>
is available in Model Garden.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>November 17, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#November_17_2025</id>
    <updated>2025-11-17T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#November_17_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Veo video generation</strong></p>
<p>Veo 3.1 is Generally Available, and introduces the following models:</p>
<ul>
<li><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/veo/3-1-generate#3.1-generate-001">Veo
3.1</a></li>
<li><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/veo/3-1-generate#3.1-fast-generate-001">Veo 3.1
Fast</a></li>
</ul>
<p>For more information, see the following:</p>
<ul>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/video/overview">Generate videos with Veo on Vertex AI</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/video/generate-videos-from-text">Generate Veo videos from text prompts</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/video/generate-videos-from-an-image">Generate Veo videos from an image</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/video/generate-videos-from-first-and-last-frames">Generate Veo videos using first and last frames</a></li>
<li><a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/model-reference/veo-video-generation">Veo video generation API</a></li></ul>
<h3>Announcement</h3>
<p><strong>LearnLM in Gemini</strong></p>
<p>The LearnLM model is no longer a separate offering or listing on AI Studio as
<a href="https://blog.google/outreach-initiatives/education/google-gemini-learnlm-update/">LearnLM capabilities have been integrated into the latest Gemini models (starting with Gemini 2.5)</a>.</p>
<p>Built in collaboration with experts in education,
<a href="https://cloud.google.com/solutions/learnlm">LearnLM</a> represents our
capabilities fine-tuned for learning informed by rigorous research. These
advancements and improvements are available directly in Gemini, enhancing
educational experiences and applications.</p>
<p>Pre-existing learnlm-2.0-flash-experimental projects will not remain functional
past December 3, 2025 unless an alternative model is manually selected—we
encourage developers to switch to the latest Gemini models and optimize their
prompts by reviewing our
<a href="https://services.google.com/fh/files/misc/learnlm_prompt_guide.pdf">LearnLM Partner Prompt Guide</a>.</p>
]]>
    </content>
  </entry>

  <entry>
    <title>November 13, 2025</title>
    <id>tag:google.com,2016:generative-ai-on-vertex-ai-release-notes#November_13_2025</id>
    <updated>2025-11-13T00:00:00-08:00</updated>
    <link rel="alternate" href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/release-notes#November_13_2025"/>
    <content type="html"><![CDATA[<h3>Feature</h3>
<p><strong>Kimi K2 Thinking</strong> is available in Model Garden. This model is
a thinking model that excels at complex problem-solving and deep reasoning.
Kimi K2 Thinking is available as a managed API in Model Garden. To learn more, see
<a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/maas/kimi/kimi-k2-thinking">Kimi K2 Thinking</a>.</p>
<h3>Feature</h3>
<p><strong>Updated Prompt Caching for Anthropic Claude Models</strong></p>
<p>Prompt caching for Anthropic Claude models now supports a one-hour Time To Live (TTL).</p>
<p>For more information, see <a href="https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/prompt-caching">Prompt caching</a>.</p>
]]>
    </content>
  </entry>

</feed>
