Tech

Google’s New AI Thinks Three Steps Ahead to Work Faster

May 6, 2026 2 min read

Google just made their Gemma 4 AI models three times faster by teaching them to guess what they’ll say next. The trick works without making the AI any less accurate.

Most AI systems work like careful typists, thinking about one word at a time before moving on. Google’s upgrade lets their AI peek into the future and predict several words it will probably use, then write them all at once. It’s like knowing the end of your sentence before you finish saying it.

The Crystal Ball Method

This speed boost comes from something called “speculative decoding.” The AI makes educated guesses about what words come next, then double-checks itself to make sure it didn’t mess up. When the guesses are right, everything moves much faster. When they’re wrong, it just tries again.

Google says this works especially well for longer responses where the AI has more patterns to predict from. The company tested this on their Gemma 4 models, which compete with ChatGPT and other popular AI assistants.

The breakthrough matters because speed is one of the biggest complaints people have about AI chatbots. Nobody wants to wait 30 seconds for an AI to write a simple email or answer a basic question.

What’s Next

Google plans to roll this faster processing to more of their AI products soon. Other AI companies are probably scrambling to build similar features, since users always prefer the fastest option. Expect AI responses to get much snappier across the board this year.

Originally reported by

Ars Technica

Back to Articles

You Have 5 unread Messages

You Have 5 unread Messages

Google’s New AI Thinks Three Steps Ahead to Work Faster

The Crystal Ball Method

What’s Next

More to Read

Scientists Build Bendable Quantum Computer Parts

Asus Makes Tiny Gaming Monitor to Sit Next to Your Main One

Apple Makes Surprise Deal to Use Intel Chips Again