Tekmono
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
No Result
View All Result
Tekmono
No Result
View All Result
Home News
Google Unveils Gemini 3.1 Flash-Lite AI Model

Google Unveils Gemini 3.1 Flash-Lite AI Model

by Tekmono Editorial Team
04/03/2026
in News
Share on FacebookShare on Twitter

Google has unveiled Gemini 3.1 Flash-Lite, its latest and most affordable Gemini 3 model, designed to handle high-volume developer workloads and data processing tasks at a competitive price.

The new model is priced at $0.25 per million input tokens and $1.50 per million output tokens, making it an attractive option for developers who require high-performance capabilities without incurring significant costs. Gemini 3.1 Flash-Lite is available in preview mode via the Gemini API in Google AI Studio and Vertex AI, although it is not included in the Gemini consumer app.

Compared to its predecessor, Gemini 2.5 Flash-Lite, the new version is more expensive but offers significantly enhanced capabilities. Despite the higher cost, Gemini 3.1 Flash-Lite generally outperforms Gemini 2.5 Flash at a lower price point, making it a more cost-effective solution for certain applications.

Related Reads

Microsoft enhances Copilot with multimodal features, introduces new $99 tier

Apple celebrates 50th anniversary amid scrutiny over privacy practices

Huawei launches Converged Development Engine for HarmonyOS PCs

Salesforce unveils updated Slack with 30 new AI features

In terms of performance, Gemini 3.1 Flash-Lite outshines competitors such as GPT-5 mini and Claude 4.5 Haiku. Although Grok 4.1 Fast is more affordable, Gemini 3.1 Flash-Lite boasts faster processing speeds, promising up to 363 tokens per second. On multimodal benchmarks, the model achieved an impressive 1432 Elo points on the Arena.ai Leaderboard, placing it among open-weight models and last-generation commercial offerings.

Google has emphasized that Gemini 3.1 Flash-Lite is designed for high-volume tasks and data processing, rather than managing fleets of agents, and as such, the company did not publish agent benchmarks for the release. However, developers can utilize the API to adjust the model’s reasoning time to control costs, with lower reasoning settings producing fewer tokens, which is particularly relevant for high-volume workloads.

This marks the first Flash-Lite version for Gemini 3.1, deviating from Google’s traditional approach of launching more capable Flash versions first or skipping Flash-Lite entirely, as seen with Gemini 3. The launch of Gemini 3.1 Flash-Lite follows the release of Gemini 3.1 Pro two weeks prior, with Google describing Flash-Lite as tailored for high-volume developer workloads at scale.

ShareTweet

You Might Be Interested

Microsoft enhances Copilot with multimodal features, introduces new  tier
News

Microsoft enhances Copilot with multimodal features, introduces new $99 tier

02/04/2026
News

Apple celebrates 50th anniversary amid scrutiny over privacy practices

02/04/2026
News

Huawei launches Converged Development Engine for HarmonyOS PCs

02/04/2026
Salesforce unveils updated Slack with 30 new AI features
News

Salesforce unveils updated Slack with 30 new AI features

02/04/2026
Please login to join discussion

Recent Posts

  • Microsoft enhances Copilot with multimodal features, introduces new $99 tier
  • Apple celebrates 50th anniversary amid scrutiny over privacy practices
  • Huawei launches Converged Development Engine for HarmonyOS PCs
  • Salesforce unveils updated Slack with 30 new AI features
  • Meta announces release of second generation smart glasses starting April 14

Recent Comments

No comments to show.
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
Tekmono is a Linkmedya brand. © 2015.

No Result
View All Result
  • News
  • Guides
  • Lists
  • Reviews
  • Deals