
artificial-intelligence
developer-tools
Google Just Made Gemma 4 Up to 3x Faster — Without Touching the Model
Google's new Multi-Token Prediction (MTP) drafters pair a lightweight model alongside Gemma 4 to predict multiple tokens ahead, letting the big model verify them all in one pass — delivering up to 3x speedup with zero quality loss.