Gemma 3 4B IT

Small 4B Gemma 3 model for efficient inference.

gemma-3-4b-it
STABLEModel DeactivatedGet StartedView uptime
1,000,000 context
Starting at $0.07/M input tokens
Starting at $0.30/M output tokens
Streaming

Select Provider

All Providers for Gemma 3 4B IT

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

Google AI Studio
Context: 1M
Deactivated since Apr 30, 2026
Input
$0.075
/M tokens
Cached
/M tokens
Output
$0.3
/M tokens
Get Started