
{"id":4159,"date":"2024-12-31T07:20:37","date_gmt":"2024-12-30T23:20:37","guid":{"rendered":"https:\/\/infernews.com\/?p=4159"},"modified":"2025-02-11T00:06:08","modified_gmt":"2025-02-10T16:06:08","slug":"cag-rag-%e7%9a%84%e6%9b%bf%e4%bb%a3%e6%96%b9%e6%a1%88","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/cag-rag-%e7%9a%84%e6%9b%bf%e4%bb%a3%e6%96%b9%e6%a1%88\/","title":{"rendered":"CAG &#8211; RAG \u7684\u66ff\u4ee3\u65b9\u6848"},"content":{"rendered":"\n<p>CAG (Cache-Augmented Generation ) \u6311\u6230\u4e86\u76ee\u524d\u5ee3\u6cdb\u4f7f\u7528\u7684 Retrieval-Augmented Generation (RAG) \u65b9\u6cd5\u3002RAG \u65b9\u6cd5\u900f\u904e\u5373\u6642\u641c\u5c0b\u5916\u90e8\u77e5\u8b58\u5eab\u4f86\u589e\u5f37\u5927\u578b\u8a9e\u8a00\u6a21\u578b (LLM) \u7684\u80fd\u529b\uff0c\u4f46\u5b58\u5728\u5ef6\u9072\u3001\u641c\u5c0b\u932f\u8aa4\u4ee5\u53ca\u7cfb\u7d71\u8907\u96dc\u5ea6\u9ad8\u7b49\u7f3a\u9ede\u3002CAG \u5247\u5229\u7528\u5177\u6709\u9577\u4e0a\u4e0b\u6587\u7a97\u53e3\u7684 LLM\uff0c\u9810\u5148\u5c07\u6240\u6709\u76f8\u95dc\u8cc7\u6e90\u8f09\u5165\u6a21\u578b\u7684\u4e0a\u4e0b\u6587\u4e26\u9810\u8a08\u7b97\u95dc\u9375\u503c\u5feb\u53d6 (KV cache)\uff0c\u5f9e\u800c\u5728\u63a8\u7406\u904e\u7a0b\u4e2d\u7121\u9700\u5373\u6642\u641c\u5c0b\u5373\u53ef\u76f4\u63a5\u56de\u7b54\u554f\u984c\u3002\u900f\u904e\u5be6\u9a57\u6bd4\u8f03 CAG \u548c RAG \u5728 SQuAD \u548c HotPotQA \u5169\u500b\u554f\u7b54\u6578\u64da\u96c6\u4e0a\u7684\u6548\u80fd\uff0c\u7d50\u679c\u986f\u793a\u5728\u77e5\u8b58\u5eab\u898f\u6a21\u6709\u9650\u7684\u60c5\u6cc1\u4e0b\uff0cCAG \u5728\u6548\u7387\u548c\u6e96\u78ba\u6027\u4e0a\u5747\u512a\u65bc RAG\uff0c\u5c24\u5176\u5728\u8655\u7406\u9577\u7bc7\u6587\u672c\u6642\uff0cCAG \u7684\u901f\u5ea6\u5927\u5e45\u63d0\u5347\u3002\u5728\u7279\u5b9a\u61c9\u7528\u5834\u666f\u4e0b\uff0c\u7279\u5225\u662f\u77e5\u8b58\u5eab\u5927\u5c0f\u53ef\u63a7\u7684\u60c5\u6cc1\u4e0b\uff0cCAG \u63d0\u4f9b\u4e86\u4e00\u500b\u66f4\u7c21\u6f54\u3001\u9ad8\u6548\u4e14\u6e96\u78ba\u7684\u66ff\u4ee3\u65b9\u6848\u3002<\/p>\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"lyte-wrapper\" title=\"Goodbye RAG - Smarter CAG w\/ KV Cache Optimization\" style=\"width:853px;max-width:100%;margin:5px auto;\"><div class=\"lyMe\" id=\"WYL_NaEf_uiFX6o\" itemprop=\"video\" itemscope itemtype=\"https:\/\/schema.org\/VideoObject\"><div><meta itemprop=\"thumbnailUrl\" content=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FNaEf_uiFX6o%2Fhqdefault.jpg\" \/><meta itemprop=\"embedURL\" content=\"https:\/\/www.youtube.com\/embed\/NaEf_uiFX6o\" \/><meta itemprop=\"duration\" content=\"PT26M19S\" \/><meta itemprop=\"uploadDate\" content=\"2024-12-30T15:00:32Z\" \/><\/div><div id=\"lyte_NaEf_uiFX6o\" data-src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FNaEf_uiFX6o%2Fhqdefault.jpg\" class=\"pL\"><div class=\"tC\"><div class=\"tT\" itemprop=\"name\">Goodbye RAG - Smarter CAG w\/ KV Cache Optimization<\/div><\/div><div class=\"play\"><\/div><div class=\"ctrl\"><div class=\"Lctrl\"><\/div><div class=\"Rctrl\"><\/div><\/div><\/div><noscript><a href=\"https:\/\/youtu.be\/NaEf_uiFX6o\" rel=\"nofollow\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FNaEf_uiFX6o%2F0.jpg\" alt=\"Goodbye RAG - Smarter CAG w\/ KV Cache Optimization\" width=\"853\" height=\"460\" \/><br \/>Watch this video on YouTube<\/a><\/noscript><meta itemprop=\"description\" content=\"Unleash the future of AI with Cache-Augmented Generation (CAG)! Say goodbye to RAG retrieval delays and RAG errors - CAG preloads knowledge directly into large language models, delivering lightning-fast, accurate responses. CAG is the better RAG. Experience a streamlined architecture that outperforms traditional RAG methods. All rights w\/ authors: Don\u2019t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan, Chao-Ting Chen, Jui-Hung Cheng and Hen-Hsen Huang National Chengchi University Taipei, Taiwan and Academia Sinica Taipei, Taiwan #education #informationtechnology #newtechnology #ai #aiagents\"><\/div><\/div><div class=\"lL\" style=\"max-width:100%;width:853px;margin:5px auto;\"><\/div><figcaption><\/figcaption><\/figure>","protected":false},"excerpt":{"rendered":"<p>CAG (Cache-Augmented Generation ) \u6311\u6230\u4e86\u76ee\u524d\u5ee3\u6cdb\u4f7f\u7528\u7684 Retrieval-Augmented Generation (RAG) \u65b9\u6cd5\u3002RAG \u65b9\u6cd5\u900f\u904e\u5373\u6642\u641c\u5c0b\u5916\u90e8\u77e5\u8b58\u5eab\u4f86\u589e\u5f37\u5927\u578b\u8a9e\u8a00\u6a21\u578b (LLM) \u7684\u80fd\u529b\uff0c\u4f46\u5b58\u5728\u5ef6\u9072\u3001\u641c\u5c0b\u932f\u8aa4\u4ee5\u53ca\u7cfb\u7d71\u8907\u96dc\u5ea6\u9ad8\u7b49\u7f3a\u9ede\u3002CAG \u5247\u5229\u7528\u5177\u6709\u9577\u4e0a\u4e0b\u6587\u7a97\u53e3\u7684 LLM\uff0c\u9810\u5148\u5c07\u6240\u6709\u76f8\u95dc\u8cc7\u6e90\u8f09\u5165\u6a21\u578b\u7684\u4e0a\u4e0b\u6587\u4e26\u9810\u8a08\u7b97\u95dc\u9375\u503c\u5feb\u53d6 (KV cache)\uff0c\u5f9e\u800c\u5728\u63a8\u7406\u904e\u7a0b\u4e2d\u7121\u9700\u5373\u6642\u641c\u5c0b\u5373\u53ef\u76f4\u63a5\u56de\u7b54\u554f\u984c\u3002\u900f\u904e\u5be6\u9a57\u6bd4\u8f03 CAG \u548c RAG \u5728 SQuAD \u548c HotPotQA \u5169\u500b\u554f\u7b54\u6578\u64da\u96c6\u4e0a\u7684\u6548\u80fd\uff0c\u7d50\u679c\u986f\u793a\u5728\u77e5\u8b58\u5eab\u898f\u6a21\u6709\u9650\u7684\u60c5\u6cc1\u4e0b\uff0cCAG \u5728\u6548\u7387\u548c\u6e96\u78ba\u6027\u4e0a\u5747\u512a\u65bc RAG\uff0c\u5c24\u5176\u5728\u8655\u7406\u9577\u7bc7\u6587\u672c\u6642\uff0cCAG \u7684\u901f\u5ea6\u5927\u5e45\u63d0\u5347\u3002\u5728\u7279\u5b9a\u61c9\u7528\u5834\u666f\u4e0b\uff0c\u7279\u5225\u662f\u77e5\u8b58\u5eab\u5927\u5c0f\u53ef\u63a7\u7684\u60c5\u6cc1\u4e0b\uff0cCAG \u63d0\u4f9b\u4e86\u4e00\u500b\u66f4\u7c21\u6f54\u3001\u9ad8\u6548\u4e14\u6e96\u78ba\u7684\u66ff\u4ee3\u65b9\u6848\u3002<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"googlesitekit_rrm_CAowvqSiDA:productID":"","footnotes":""},"categories":[109],"tags":[],"class_list":["post-4159","post","type-post","status-publish","format-standard","hentry","category-rag"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/4159","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=4159"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/4159\/revisions"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=4159"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=4159"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=4159"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}