Gemini 3.1 Flash-Lite is a Google Cloud Gemini model announced as generally available on the Gemini Enterprise Agent Platform. The product is positioned as the fastest and most cost-efficient Gemini 3 series model in the announcement, with a focus on ultra-low latency, high-volume tasks, and production use cases that need predictable speed and cost control.
The blog says Flash-Lite is intended for agentic workflows such as tool calling and orchestration, as well as automated pipelines that need to run at scale. It is shown in practical use across software development, customer support, creative media workflows, gaming, financial services, and data operations, while the pricing page confirms per-token pricing for input and output across multiple serving modes.