What you need to knowGoogle revealed Gemini 3.1 Flash-Lite, it’s newest model that’s capable of assisting developers with complex, “high” workloads.Seeing as it aims to handle higher data workloads, Google touts this AI model as its cheapest and speediest AI yet, nestled in its Gemini 3 series.Google is seemingly positioning 3.1 Flash-Lite as the next best thing, as the AI seeks to overtake what was started by 2.5 Flash last year.

Google’s not slowing down its development process for next-gen AI; however, what it’s rolling out this week is yet another lightweight, speedy model.

In a Keyword post, Google shared details about its newest lightweight model: Gemini 3.1 Flash-Lite for developers. Out of the gate, the company touts 3.1 Flash-Lite as the premier AI model for developers with “high-volume workloads.” Similar to previous highly efficient, low-cost AI models from Google, Gemini 3.1 Flash-Lite offers its services at $0.25/1M input tokens and $1.50/1M output tokens.

called back to its 2.5 Flash model quite often in its announcement. It’s a model that debuted last spring with “hybrid reasoning” and high speeds, while maintaining its accuracy. This model holds a few similarities to what Google’s come with today. Low-latency performance, alongside cheaper costs, and speed for the developer’s needs. However, 3.1 Flash-Lite takes that and raises it severely by taking over that complex, high-workload space for users.

In short, this is likely the model Google is hoping developers will reach for next time they need to work with a lot of data. Gemini 3 Flash arrived in December, but this model was positioned as more of the “for everybody” lightweight model. The company brought this to developers through the usual channels and to every user through AI Mode in Search.