
{"id":5953,"date":"2025-07-19T19:02:51","date_gmt":"2025-07-19T11:02:51","guid":{"rendered":"https:\/\/infernews.com\/?p=5953"},"modified":"2025-07-19T19:02:52","modified_gmt":"2025-07-19T11:02:52","slug":"art%ef%bc%9a%e7%89%b9%e5%b7%a5%e5%bc%b7%e5%8c%96%e8%a8%93%e7%b7%b4%e5%b8%ab","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/art%ef%bc%9a%e7%89%b9%e5%b7%a5%e5%bc%b7%e5%8c%96%e8%a8%93%e7%b7%b4%e5%b8%ab\/","title":{"rendered":"ART\uff1a\u7279\u5de5\u5f37\u5316\u8a13\u7df4\u5e2b"},"content":{"rendered":"\n<p><a href=\"https:\/\/github.com\/OpenPipe\/ART\">ART<\/a> \u662f\u4e00\u500b\u958b\u6e90\u5f37\u5316\u5b78\u7fd2\u6846\u67b6\uff0c\u5b83\u5141\u8a31 LLM <strong>\u5f9e\u7d93\u9a57\u4e2d\u5b78\u7fd2<\/strong>\uff0c\u5f9e\u800c\u63d0\u9ad8\u4ee3\u7406\u7684\u53ef\u9760\u6027\u3002 ART \u63d0\u4f9b\u4e86\u7b26\u5408\u4eba\u9ad4\u5de5\u5b78\u7684\u6846\u67b6\uff0c\u53ef\u5c07 GRPO \u6574\u5408\u5230\u4efb\u4f55 Python \u61c9\u7528\u7a0b\u5f0f\u4e2d\u3002<\/p>\n\n\n\n<p><strong>RULER<\/strong>\uff08Relative Universal LLM-Elicited Rewards\uff09\u900f\u904e\u4f7f\u7528 LLM-as-judge \u81ea\u52d5\u8a55\u5206\u4ee3\u7406\u8ecc\u8de1\uff0c\u6d88\u9664\u4e86\u624b\u52d5\u8a2d\u8a08\u734e\u52f5\u51fd\u6578\u7684\u9700\u8981\u3002\u53ea\u9700\u5728\u7cfb\u7d71\u63d0\u793a\u5b57\u5143\u4e2d\u5b9a\u7fa9\u60a8\u7684\u4efb\u52d9\uff0cRULER \u5c31\u6703\u8655\u7406\u5269\u4e0b\u7684\u5de5\u4f5c\u2014\u2014 <strong>\u7121\u9700\u6a19\u8a18\u8cc7\u6599\u3001\u5c08\u5bb6\u56de\u994b\u6216\u734e\u52f5\u5de5\u7a0b<\/strong>\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img data-dominant-color=\"f7f6f5\" data-has-transparency=\"false\" style=\"--dominant-color: #f7f6f5;\" loading=\"lazy\" decoding=\"async\" width=\"781\" height=\"514\" src=\"\/blog\/wp-content\/uploads\/2025\/07\/ART_E_graphs.png\" alt=\"\" class=\"wp-image-5954 not-transparent\" srcset=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2025\/07\/ART_E_graphs.png 781w, https:\/\/infernews.com\/blog\/wp-content\/uploads\/2025\/07\/ART_E_graphs-300x197.png 300w, https:\/\/infernews.com\/blog\/wp-content\/uploads\/2025\/07\/ART_E_graphs-768x505.png 768w\" sizes=\"auto, (max-width: 781px) 100vw, 781px\" \/><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>ART \u662f\u4e00\u500b\u958b\u6e90\u5f37\u5316\u5b78\u7fd2\u6846\u67b6\uff0c\u5b83\u5141\u8a31 LLM \u5f9e\u7d93\u9a57\u4e2d\u5b78\u7fd2\uff0c\u5f9e\u800c\u63d0\u9ad8\u4ee3\u7406\u7684\u53ef\u9760\u6027\u3002 ART \u63d0\u4f9b\u4e86\u7b26\u5408\u4eba\u9ad4\u5de5\u5b78\u7684\u6846\u67b6\uff0c\u53ef\u5c07 GRPO \u6574\u5408\u5230\u4efb\u4f55 Python \u61c9\u7528\u7a0b\u5f0f\u4e2d\u3002 RULER\uff08Relative Universal LLM-Elicited Rewards\uff09\u900f\u904e\u4f7f\u7528 LLM-as-judge \u81ea\u52d5\u8a55\u5206\u4ee3\u7406\u8ecc\u8de1\uff0c\u6d88\u9664\u4e86\u624b\u52d5\u8a2d\u8a08\u734e\u52f5\u51fd\u6578\u7684\u9700\u8981\u3002\u53ea\u9700\u5728\u7cfb\u7d71\u63d0\u793a\u5b57\u5143\u4e2d\u5b9a\u7fa9\u60a8\u7684\u4efb\u52d9\uff0cRULER \u5c31\u6703\u8655\u7406\u5269\u4e0b\u7684\u5de5\u4f5c\u2014\u2014 \u7121\u9700\u6a19\u8a18\u8cc7\u6599\u3001\u5c08\u5bb6\u56de\u994b\u6216\u734e\u52f5\u5de5\u7a0b\u3002<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"googlesitekit_rrm_CAowvqSiDA:productID":"","footnotes":""},"categories":[127,160,146,133],"tags":[],"class_list":["post-5953","post","type-post","status-publish","format-standard","hentry","category-127","category-160","category-146","category-133"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/5953","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=5953"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/5953\/revisions"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=5953"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=5953"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=5953"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}