
{"id":4474,"date":"2025-02-06T18:53:56","date_gmt":"2025-02-06T10:53:56","guid":{"rendered":"https:\/\/infernews.com\/?p=4474"},"modified":"2025-02-07T02:02:58","modified_gmt":"2025-02-06T18:02:58","slug":"deepseek-r1-%e7%9a%84%e5%86%b7%e5%95%9f%e5%8b%95%e5%be%ae%e8%aa%bf","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/deepseek-r1-%e7%9a%84%e5%86%b7%e5%95%9f%e5%8b%95%e5%be%ae%e8%aa%bf\/","title":{"rendered":"DeepSeek R1 \u7684\u51b7\u555f\u52d5 1.5b \u5fae\u8abf"},"content":{"rendered":"\n<p>\u5f71\u7247\u4e3b\u8981\u8b1b\u89e3\u4e86\u5982\u4f55\u4f7f\u7528\u51b7\u555f\u52d5\u6280\u8853\u4f86\u63d0\u5347\u5c0f\u578b\u8a9e\u8a00\u6a21\u578b\uff08LLM\uff09\u7684\u63a8\u7406\u80fd\u529b\uff0c\u7279\u5225\u662f\u5728\u6578\u5b78\u554f\u984c\u4e0a\u7684\u8868\u73fe\u3002\u5f71\u7247\u7684\u6838\u5fc3\u5728\u65bc\u91cd\u73fe DeepSeek R1 \u6a21\u578b\u8ad6\u6587\u4e2d\u63d0\u5230\u7684\u51b7\u555f\u52d5\u65b9\u6cd5\uff0c\u5373\u900f\u904e\u5c11\u91cf\u9ad8\u54c1\u8cea\u7684\u5408\u6210\u6578\u64da\u96c6\uff0c\u8b93\u6a21\u578b\u5728\u5f37\u5316\u5b78\u7fd2\u524d\u5c31\u80fd\u5920\u751f\u6210\u6e05\u6670\u4e14\u9023\u8cab\u7684\u601d\u8003\u93c8\u3002\u9019\u4e9b\u6578\u64da\u96c6\u5229\u7528\u6578\u5b78\u7de8\u8b6f\u5668\u4f86\u7522\u751f\u7cbe\u78ba\u7684\u6b65\u9a5f\u5f0f\u89e3\u984c\u904e\u7a0b\uff0c\u4e26\u4f7f\u7528\u5927\u578b\u8a9e\u8a00\u6a21\u578b\u751f\u6210\u81ea\u7136\u8a9e\u8a00\u89e3\u91cb\uff0c\u9032\u800c\u5fae\u8abf\u4e00\u500b\u53ea\u6709 15 \u5104\uff081.5b)\u53c3\u6578\u7684\u5c0f\u578b\u6a21\u578b\uff0c\u4f7f\u5176\u80fd\u5920\u9032\u884c\u8907\u96dc\u7684\u6578\u5b78\u63a8\u7406\uff0c\u4e26\u5728\u601d\u8003\uff08think\uff09\u548c\u56de\u7b54\uff08answer\uff09\u6a19\u7c64\u4e2d\u5448\u73fe\u5176\u63a8\u7406\u904e\u7a0b\uff0c\u800c\u6700\u7d42\u7d50\u679c\u986f\u793a\u5373\u4f7f\u662f\u5c0f\u578b\u6a21\u578b\uff0c\u4e5f\u80fd\u900f\u904e\u51b7\u555f\u52d5\u6280\u8853\u9054\u5230\u4ee4\u4eba\u5370\u8c61\u6df1\u523b\u7684\u63a8\u7406\u80fd\u529b\u3002\u5f71\u7247\u4e5f\u5f37\u8abf\u4e86\u51b7\u555f\u52d5\u6578\u64da\u96c6\u7684\u591a\u6a23\u6027\uff0c\u5305\u62ec\u6578\u5b78\u3001\u7a0b\u5f0f\u78bc\u548c\u5176\u4ed6\u9818\u57df\uff0c\u624d\u80fd\u4f7f\u6a21\u578b\u5177\u6709\u5f37\u5927\u7684\u901a\u7528\u80fd\u529b\u3002<\/p>\n\n\n\n<figure class=\"wp-block-audio\"><audio controls src=\"\/blog\/wp-content\/uploads\/2025\/02\/1738864788036333012-234749516542203.mp3\"><\/audio><\/figure>\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"lyte-wrapper\" title=\"DeepSeek R1 Coldstart: How to TRAIN a 1.5B Model to REASON\" style=\"width:853px;max-width:100%;margin:5px auto;\"><div class=\"lyMe\" id=\"WYL_Pabqg33sUrg\" itemprop=\"video\" itemscope itemtype=\"https:\/\/schema.org\/VideoObject\"><div><meta itemprop=\"thumbnailUrl\" content=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FPabqg33sUrg%2Fhqdefault.jpg\" \/><meta itemprop=\"embedURL\" content=\"https:\/\/www.youtube.com\/embed\/Pabqg33sUrg\" \/><meta itemprop=\"duration\" content=\"PT42M\" \/><meta itemprop=\"uploadDate\" content=\"2025-01-27T01:11:42Z\" \/><\/div><div id=\"lyte_Pabqg33sUrg\" data-src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FPabqg33sUrg%2Fhqdefault.jpg\" class=\"pL\"><div class=\"tC\"><div class=\"tT\" itemprop=\"name\">DeepSeek R1 Coldstart: How to TRAIN a 1.5B Model to REASON<\/div><\/div><div class=\"play\"><\/div><div class=\"ctrl\"><div class=\"Lctrl\"><\/div><div class=\"Rctrl\"><\/div><\/div><\/div><noscript><a href=\"https:\/\/youtu.be\/Pabqg33sUrg\" rel=\"nofollow\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2FPabqg33sUrg%2F0.jpg\" alt=\"DeepSeek R1 Coldstart: How to TRAIN a 1.5B Model to REASON\" width=\"853\" height=\"460\" \/><br \/>Watch this video on YouTube<\/a><\/noscript><meta itemprop=\"description\" content=\"Curious how a 1.5B parameter model can solve maths problems better than far larger models? In this video, I demonstrate how DeepSeek R1 leverages lengthy chains of thought to enhance its mathematical reasoning. We take a close look at how DeepSeek R1 prompts are structured and generated according to the R1 paper\u2014then reproduce these chain of thought prompts via the DeepSeek R1 coldstart method and my own maths compiler to create synthetic training data. I then walk through the entire fine-tuning process, step by step, showing how even a relatively modest model can outperform bulkier rivals using DeepSeek R1\u2019s coldstart technique. If you\u2019re fascinated by AI breakthroughs or simply enjoy seeing a thorough training pipeline, this detailed behind-the-scenes session is for you. github repo for math compiler: https:\/\/github.com\/chrishayuk\/chuk-math github repo for verifiers: https:\/\/github.com\/chrishayuk\/verifiers 00:00 - intro 01:10 - DeepSeek R1 Chat 03:35 - DeepSeek R1 Ollama 04:44 - Think Tags 05:04 - Deep Seek R1 paper 13:45 - Generating synthetic long chains of thought 15:25 - Translating the CoT to natural language 18:40 - Self Reflection and Self Correction 22:50 - Generating sample data 30:06 - Testing the Qwen2.5-1.5B 30:52 - Fine Tuning Qwen2.5-1.5B with our Coldstart data 34:52 - Chatting with our Fine Tuned Model 39:55 - Conclusion\"><\/div><\/div><div class=\"lL\" style=\"max-width:100%;width:853px;margin:5px auto;\"><\/div><figcaption><\/figcaption><\/figure>","protected":false},"excerpt":{"rendered":"<p>\u5f71\u7247\u4e3b\u8981\u8b1b\u89e3\u4e86\u5982\u4f55\u4f7f\u7528\u51b7\u555f\u52d5\u6280\u8853\u4f86\u63d0\u5347\u5c0f\u578b\u8a9e\u8a00\u6a21\u578b\uff08LLM\uff09\u7684\u63a8\u7406\u80fd\u529b\uff0c\u7279\u5225\u662f\u5728\u6578\u5b78\u554f\u984c\u4e0a\u7684\u8868\u73fe\u3002\u5f71\u7247\u7684\u6838\u5fc3\u5728\u65bc\u91cd\u73fe DeepSeek R1 \u6a21\u578b\u8ad6\u6587\u4e2d\u63d0\u5230\u7684\u51b7\u555f\u52d5\u65b9\u6cd5\uff0c\u5373\u900f\u904e\u5c11\u91cf\u9ad8\u54c1\u8cea\u7684\u5408\u6210\u6578\u64da\u96c6\uff0c\u8b93\u6a21\u578b\u5728\u5f37\u5316\u5b78\u7fd2\u524d\u5c31\u80fd\u5920\u751f\u6210\u6e05\u6670\u4e14\u9023\u8cab\u7684\u601d\u8003\u93c8\u3002\u9019\u4e9b\u6578\u64da\u96c6\u5229\u7528\u6578\u5b78\u7de8\u8b6f\u5668\u4f86\u7522\u751f\u7cbe\u78ba\u7684\u6b65\u9a5f\u5f0f\u89e3\u984c\u904e\u7a0b\uff0c\u4e26\u4f7f\u7528\u5927\u578b\u8a9e\u8a00\u6a21\u578b\u751f\u6210\u81ea\u7136\u8a9e\u8a00\u89e3\u91cb\uff0c\u9032\u800c\u5fae\u8abf\u4e00\u500b\u53ea\u6709 15 \u5104\uff081.5b)\u53c3\u6578\u7684\u5c0f\u578b\u6a21\u578b\uff0c\u4f7f\u5176\u80fd\u5920\u9032\u884c\u8907\u96dc\u7684\u6578\u5b78\u63a8\u7406\uff0c\u4e26\u5728\u601d\u8003\uff08think\uff09\u548c\u56de\u7b54\uff08answer\uff09\u6a19\u7c64\u4e2d\u5448\u73fe\u5176\u63a8\u7406\u904e\u7a0b\uff0c\u800c\u6700\u7d42\u7d50\u679c\u986f\u793a\u5373\u4f7f\u662f\u5c0f\u578b\u6a21\u578b\uff0c\u4e5f\u80fd\u900f\u904e\u51b7\u555f\u52d5\u6280\u8853\u9054\u5230\u4ee4\u4eba\u5370\u8c61\u6df1\u523b\u7684\u63a8\u7406\u80fd\u529b\u3002\u5f71\u7247\u4e5f\u5f37\u8abf\u4e86\u51b7\u555f\u52d5\u6578\u64da\u96c6\u7684\u591a\u6a23\u6027\uff0c\u5305\u62ec\u6578\u5b78\u3001\u7a0b\u5f0f\u78bc\u548c\u5176\u4ed6\u9818\u57df\uff0c\u624d\u80fd\u4f7f\u6a21\u578b\u5177\u6709\u5f37\u5927\u7684\u901a\u7528\u80fd\u529b\u3002<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"googlesitekit_rrm_CAowvqSiDA:productID":"","footnotes":""},"categories":[76,127],"tags":[],"class_list":["post-4474","post","type-post","status-publish","format-standard","hentry","category-76","category-127"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/4474","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=4474"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/4474\/revisions"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=4474"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=4474"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=4474"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}