
{"id":9001,"date":"2026-06-10T06:13:12","date_gmt":"2026-06-09T22:13:12","guid":{"rendered":"https:\/\/infernews.com\/blog\/?p=9001"},"modified":"2026-06-10T06:13:14","modified_gmt":"2026-06-09T22:13:14","slug":"gemma-4-12b-qat-%e9%87%8f%e5%8c%96%e6%84%9f%e7%9f%a5%e8%a8%93%e7%b7%b4%ef%bc%89","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/gemma-4-12b-qat-%e9%87%8f%e5%8c%96%e6%84%9f%e7%9f%a5%e8%a8%93%e7%b7%b4%ef%bc%89\/","title":{"rendered":"Gemma 4 12B (QAT \u91cf\u5316\u611f\u77e5\u8a13\u7df4\uff09"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">Gemma 4 12B \u9019\u6ce2\u67d0\u7a0b\u5ea6\u4e0a\u7b97\u662f\u5728\u56de\u61c9\u90a3\u500b\u300c\u624b\u6a5f\u7d1a\u6a21\u578b\u548c\u5927\u6a21\u578b\u4e4b\u9593\uff0c\u6703\u88dc\u4e00\u500b\u4e2d\u968e\u6a21\u578b\u300d\u7684\u50b3\u805e\u3002\u4e0d\u904e\u771f\u6b63\u8b93\u4eba\u773c\u775b\u4e00\u4eae\u7684\uff0c\u9084\u662f QAT (Quantization Aware Training\uff0c\u91cf\u5316\u611f\u77e5\u8a13\u7df4\uff09\u771f\u7684\u505a\u4e0a\u4f86\u4e86\u3002\u518d\u52a0\u4e0a\u73fe\u5728\u4e5f\u652f\u63f4 MTP\uff0cGemma 4 \u9019\u4ee3\u5728\u672c\u5730\u6a21\u578b\u7684\u80fd\u529b\u548c\u6548\u80fd\u4e0a\uff0c\u6574\u9ad4\u90fd\u5f80\u524d\u63a8\u4e86\u4e0d\u5c11\u3002\u7e3d\u7b97\u770b\u5230\u9664\u4e86 Qwen \u4e4b\u5916\uff0c\u5176\u4ed6\u5be6\u9a57\u5ba4\u958b\u59cb\u6253\u51fa\u50cf\u6a23\u7684\u7af6\u722d\u4e86\uff1b\u6700\u8fd1\u7684 local AI\uff0c\u771f\u7684\u5f88\u50cf\u4e00\u76f4\u90fd\u662f Qwen \u5728 carry\u3002<\/p>\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"lyte-wrapper\" title=\"Google Just Found a Loophole in AI Hardware Limitations\" style=\"width:853px;max-width:100%;margin:5px auto;\"><div class=\"lyMe\" id=\"WYL_DTUNF9weRls\" itemprop=\"video\" itemscope itemtype=\"https:\/\/schema.org\/VideoObject\"><div><meta itemprop=\"thumbnailUrl\" content=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FDTUNF9weRls%2Fhqdefault.jpg\" \/><meta itemprop=\"embedURL\" content=\"https:\/\/www.youtube.com\/embed\/DTUNF9weRls\" \/><meta itemprop=\"duration\" content=\"PT19M30S\" \/><meta itemprop=\"uploadDate\" content=\"2026-06-09T18:00:09Z\" \/><\/div><div id=\"lyte_DTUNF9weRls\" data-src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FDTUNF9weRls%2Fhqdefault.jpg\" class=\"pL\"><div class=\"tC\"><div class=\"tT\" itemprop=\"name\">Google Just Found a Loophole in AI Hardware Limitations<\/div><\/div><div class=\"play\"><\/div><div class=\"ctrl\"><div class=\"Lctrl\"><\/div><div class=\"Rctrl\"><\/div><\/div><\/div><noscript><a href=\"https:\/\/youtu.be\/DTUNF9weRls\" rel=\"nofollow noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FDTUNF9weRls%2F0.jpg\" alt=\"Google Just Found a Loophole in AI Hardware Limitations\" width=\"853\" height=\"460\" \/><br \/>Watch this video on YouTube<\/a><\/noscript><meta itemprop=\"description\" content=\"Gemma 4 12B answers the rumor about a new intermediate model between their mobile (E2B, E4B) and more hardware heavy models (26B MoE, 31B) but really stepped up the game with QAT (Quantization Aware Training). This is *on top* of the MTP (Multi-Token Processing) support for these models! Gemma 4 is a serious step in capability and performance for local models across the board. Nice to see at least some level of competition from other labs since Qwen has been backpacking the entire industry for local Ai recently! *Links* : AnythingLLM: https:\/\/anythingllm.com\/ AnythingLLM GitHub: https:\/\/github.com\/Mintplex-Labs\/anything-llm Gemma 12B: https:\/\/huggingface.co\/google\/gemma-4-12B Gemma 12B QAT GGUF: https:\/\/huggingface.co\/unsloth\/gemma-4-12B-it-qat-GGUF *Chapters* : 0:00 Let&#039;s Talk About Gemma 4 12B 0:34 Brief History of Gemma 4 3:06 Gemma 12B is a welcome addition 6:59 Qwen3.5 or Gemma 12B 8:18 What is QAT (Quantization Aware Training) 10:24 QAT is NOT exactly Bitnet, but it is close 11:35 Testing Gemma 12B in AnythingLLM 17:05 Final Thoughts: Gemma 12B is 100% worth a look\"><\/div><\/div><div class=\"lL\" style=\"max-width:100%;width:853px;margin:5px auto;\"><\/div><figcaption><\/figcaption><\/figure>","protected":false},"excerpt":{"rendered":"<p>Gemma 4 12B \u9019\u6ce2\u67d0\u7a0b\u5ea6\u4e0a\u7b97\u662f\u5728\u56de\u61c9\u90a3\u500b\u300c\u624b\u6a5f\u7d1a\u6a21\u578b\u548c\u5927\u6a21\u578b\u4e4b\u9593\uff0c\u6703\u88dc\u4e00\u500b\u4e2d\u968e\u6a21\u578b\u300d\u7684\u50b3\u805e\u3002\u4e0d\u904e\u771f\u6b63\u8b93\u4eba\u773c\u775b\u4e00\u4eae\u7684\uff0c\u9084\u662f QAT (Quantization Aware Training\uff0c\u91cf\u5316\u611f\u77e5\u8a13\u7df4\uff09\u771f\u7684\u505a\u4e0a\u4f86\u4e86\u3002\u518d\u52a0\u4e0a\u73fe\u5728\u4e5f\u652f\u63f4 MTP\uff0cGemma 4 \u9019\u4ee3\u5728\u672c\u5730\u6a21\u578b\u7684\u80fd\u529b\u548c\u6548\u80fd\u4e0a\uff0c\u6574\u9ad4\u90fd\u5f80\u524d\u63a8\u4e86\u4e0d\u5c11\u3002\u7e3d\u7b97\u770b\u5230\u9664\u4e86 Qwen \u4e4b\u5916\uff0c\u5176\u4ed6\u5be6\u9a57\u5ba4\u958b\u59cb\u6253\u51fa\u50cf\u6a23\u7684\u7af6\u722d\u4e86\uff1b\u6700\u8fd1\u7684 local AI\uff0c\u771f\u7684\u5f88\u50cf\u4e00\u76f4\u90fd\u662f Qwen \u5728 carry\u3002<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","footnotes":""},"categories":[76],"tags":[],"class_list":["post-9001","post","type-post","status-publish","format-standard","hentry","category-76"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9001","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=9001"}],"version-history":[{"count":1,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9001\/revisions"}],"predecessor-version":[{"id":9002,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9001\/revisions\/9002"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=9001"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=9001"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=9001"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}