Google DeepMind brings the Gemma 4 12B model and quantization-capable weights for notebooks

08.06.2026, 11:16 , von News-Redaktion

Google DeepMind has expanded the Gemma 4 family with a 12 billion parameter model and a set of quantization-enabled training weights designed to improve the local deployment of AI. The 12B model is designed to run on standard consumer hardware and requires only 16 GB of memory to run local agents on a laptop.

Google DeepMind Gemma4 Image © Google Google DeepMind Gemma4 (Image © Google)

The 12B model offers a significant leap in efficiency, with benchmark performance almost matching that of the larger 26B model. This allows you to run complex multi-stage inference and agentic workflows locally without the need for extensive cloud computing resources.

Google DeepMind Gemma4 Benchmark (Image © Google)

To further improve accessibility and speed, Google DeepMind has released quantization-aware training weights for the entire Gemma-4 product suite. While traditional quantization often results in a loss of model accuracy, QAT incorporates the quantization process directly into the training phase. This approach minimizes memory requirements and speeds up token generation while maintaining output quality compared to the original weights.

Google DeepMind Models (Image © Google)

These optimizations provide broader hardware compatibility, with performance improvements seen on chips from NVIDIA, AMD, Intel, Qualcomm and Apple. The QAT weights are currently available for a wide range of model sizes, including E2B, E4B, 12B, 26B and 31B versions.

The integration of the new model and weights has been optimized via [Ollama][1]. Users can use the 12B model in various developer tools and applications such as Claude Code, Codex App, Hermes Agent and OpenClaw as well as for general chat purposes. [1]: https://www.pcmasters.de/server/133714724-ai-chatbot-hosten-auf-eigenem-server-auf-ubuntu-debian-mit-ollama-und-open-webui.html

News-Redaktion

The news editorial team provides news on all topics in the IT sector...

292 articles Email

Google DeepMind brings the Gemma 4 12B model and quantization-capable weights for notebooks

Support PCMasters

Google DeepMind brings the Gemma 4 12B model and quantization-capable weights for notebooks further downloads:

Andere Artikel aus dieser Kategorie

Support PCMasters

Google DeepMind brings the Gemma 4 12B model and quantization-capable weights for notebooks further downloads:

Andere Artikel aus dieser Kategorie

Asus Unveils ExpertCenter PN55 Mini PC with Ryzen AI 400 CPUs

Patriot Viper Steel 5 Infinite DDR5 with up to 8000 MT/s released

Apple Drops M6 Pro and Max Chips Due to AI Hardware

Debian 13.6: Essential Updates in the Stable Trixie Branch

Linux 7.2-rc3: Release Candidate Includes Bug Fixes for Ultra RISC-V and Multi-GPU