BREAKING
Cerebras Runs Gemma 4 31B at 1,851 t/s
0t/s
output speed
0x
vs GPU
0s
first token
Tokens per Second vs Typical GPU
Cerebras1851
Typical GPU50
Vision Agent Loop Now Practical
1Image input
2Reasoning
3Tool calls
4Verify & retry
Gemma 4 vs Claude Haiku
Gemma 4 31BCerebras
Intelligence index 29
18x faster than Haiku
Apache 2.0 license
Claude Haiku
Intelligence index 30
Comparable quality
First Multimodal Model on Cerebras
AI NEWS BLITZ
Cerebras just launched Google's Gemma 4 31B at over eighteen hundred tokens per second.