
{"id":9842,"date":"2026-07-03T22:00:36","date_gmt":"2026-07-03T14:00:36","guid":{"rendered":"https:\/\/infernews.com\/blog\/discrete-diffusion-rrg\/"},"modified":"2026-07-03T22:00:36","modified_gmt":"2026-07-03T14:00:36","slug":"discrete-diffusion-rrg","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/discrete-diffusion-rrg\/","title":{"rendered":"discrete_diffusion_RRG\uff1a\u96e2\u6563\u64f4\u6563\u6a21\u578b\u9ede\u6a23\u5beb\u80f8\u80ba X \u5149\u5831\u544a"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/07\/pasted-c8352d9afde5.jpg\" alt=\"Repository image for mxvp\/discrete_diffusion_RRG\"><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">\u9019\u662f\u4e00\u500b\u91ab\u5b78\u5f71\u50cf\u8a9e\u8a00\u6a21\u578b\u5fae\u8abf\u8207\u8a55\u6e2c\u9805\u76ee\uff0c\u6838\u5fc3\u662f\u628a image-conditioned discrete-diffusion language model \u8207 autoregressive baseline \u653e\u5728\u540c\u4e00\u5bb6\u65cf\u9aa8\u5e79\u4e0b\u76f4\u63a5\u6bd4\u8f03\u3002\u5b83\u4e3b\u8981\u8655\u7406 chest X-ray VQA \u8207\u653e\u5c04\u5831\u544a\u88dc\u5168\uff0c\u76ee\u6a19\u4e0d\u662f\u55ae\u7d14\u751f\u6210\u6587\u5b57\uff0c\u800c\u662f\u8b93\u6a21\u578b\u6839\u64da X \u5149\u5f71\u50cf\u56de\u7b54\u554f\u984c\uff0c\u6216\u5728\u5df2\u77e5\u90e8\u5206\u53e5\u5b50\u7684\u60c5\u6cc1\u4e0b\u88dc\u5beb\u5176\u9918\u5167\u5bb9\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u9805\u76ee\u7684\u8a2d\u8a08\u91cd\u9ede\u5728\u65bc\u63a7\u5236\u8b8a\u56e0\uff1aDiffusionGemma \u8207 Gemma-4-26B \u4f7f\u7528\u76f8\u8fd1\u7684 backbone family\u3001vision tower\u3001\u8cc7\u6599\u8207 LoRA \u914d\u65b9\uff0c\u4ee4\u6bd4\u8f03\u66f4\u96c6\u4e2d\u65bc\u751f\u6210\u65b9\u5f0f\u672c\u8eab\u3002diffusion \u8def\u7dda\u628a\u5831\u544a\u7576\u6210\u53ef\u9010\u6b65\u53bb\u566a\u7684 decoder canvas\uff0cautoregressive \u5247\u6cbf\u7528 next-token \u9806\u5e8f\u751f\u6210\uff1b\u524d\u8005\u7684\u512a\u52e2\u662f\u53ef\u4ee5\u505a any-order infill\uff0c\u7528\u96d9\u5411\u8108\u7d61\u88dc\u7a7a\u4f4d\uff0c\u5f8c\u8005\u5247\u8f03\u63a5\u8fd1\u73fe\u6642\u591a\u6578 VLM \u7684\u5e38\u898b\u505a\u6cd5\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u90e8\u7f72\u8207\u6e2c\u8a66\u9580\u6abb\u4e0d\u7b97\u4f4e\u3002\u6a21\u578b\u6b0a\u91cd\u900f\u904e Hugging Face IDs \u8f09\u5165\uff0c\u8a2d\u5b9a\u6a94\u8981\u63a5\u99c1\u672c\u5730 JSON \u8cc7\u6599\u7d22\u5f15\uff1b\u5009\u5eab\u4e5f\u63d0\u4f9b synthetic: {n: 16} \u9019\u7a2e\u5c0f\u578b smoke test\uff0c\u9069\u5408\u5148\u78ba\u8a8d\u6d41\u7a0b\u6709\u6c92\u6709\u8dd1\u901a\u3002\u786c\u4ef6\u8981\u6c42\u6bd4\u8f03\u660e\u78ba\uff0cdiffusion backbone \u9700\u8981\u652f\u63f4 bf16 \u7684 GPU\uff0c\u800c\u4e14\u8a18\u61b6\u9ad4\u5927\u7d04\u8981 80 GB\uff0c\u9019\u5df2\u7d93\u628a\u5b83\u5b9a\u4f4d\u6210\u7814\u7a76\u5718\u968a\u6216\u5177\u5099\u9ad8\u968e GPU \u74b0\u5883\u7684\u91ab\u7642 AI \u9805\u76ee\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u6548\u80fd\u8868\u73fe\u6709\u5e7e\u500b\u503c\u5f97\u7559\u610f\u7684\u9ede\u3002\u652f\u63f4\u5167\u5bb9\u63d0\u5230 Discrete Diffusion Language Models \u5728\u91ab\u7642 VQA \u4e0a\u53ef\u8ffd\u5e73\uff0c\u751a\u81f3\u7565\u52dd\u540c\u7cfb autoregression\uff0c\u89e3\u78bc\u901f\u5ea6\u4ea6\u53ef\u9054 3.5 \u81f3 4.4 \u500d\uff1b\u4e0d\u904e\u76ee\u524d\u8f03\u5b8c\u6574\u7684\u6e96\u78ba\u5ea6\u91cd\u5fc3\u4ecd\u653e\u5728 VQA\uff0c\u800c\u5831\u544a\u751f\u6210\u90e8\u5206\u4e3b\u8981\u5c55\u793a\u4e92\u52d5\u5f0f infill \u80fd\u529b\uff0c\u672a\u7b97\u662f\u5b8c\u6574\u81e8\u5e8a\u5831\u544a\u751f\u6210\u7cfb\u7d71\u3002\u8a9e\u7fa9\u8a55\u5206\u9084\u53ef\u63a5 LLM judge\uff0c\u4f46\u9019\u90e8\u5206\u9700\u8981\u984d\u5916 API \u91d1\u9470\uff0c\u4e5f\u8868\u793a\u7d50\u679c\u89e3\u8b80\u4ecd\u6709\u4e00\u5b9a\u7814\u7a76\u6027\u8cea\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u985e\u578b\u4e0a\uff0c\u5b83\u8f03\u63a5\u8fd1<strong>\u7814\u7a76\u539f\u578b\u52a0\u8a55\u6e2c\u7a0b\u5f0f\u78bc<\/strong>\uff0c\u4e0d\u662f\u5373\u88dd\u5373\u7528\u7684\u81e8\u5e8a\u8edf\u4ef6\u3002<\/li>\n<li>\u4e3b\u8981\u8cc7\u6599\u4f86\u6e90\u5305\u62ec VQA-RAD\u3001SLAKE\u3001VQA-Med \u8207 MIMIC-CXR\u3002<\/li>\n<li>\u76f8\u95dc\u6a21\u578b\u5305\u62ec <strong>DiffusionGemma-26B<\/strong>\u3001<strong>Gemma-4-26B<\/strong>\uff0c\u4e26\u4ee5 <strong>LoRA<\/strong> \u65b9\u5f0f\u5fae\u8abf\u3002<\/li>\n<li>any-order infill \u662f\u6700\u6709\u8fa8\u8b58\u5ea6\u7684\u80fd\u529b\uff0c\u9069\u5408\u5148\u56fa\u5b9a\u90e8\u5206\u5831\u544a\u5167\u5bb9\uff0c\u518d\u7531\u6a21\u578b\u88dc\u5168\u5176\u9918\u4f4d\u7f6e\u3002<\/li>\n<li>\u9069\u5408\u9700\u8981\u6bd4\u8f03\u751f\u6210\u7bc4\u5f0f\u3001\u7814\u7a76 radiology report drafting\uff0c\u6216\u60f3\u9a57\u8b49 discrete diffusion \u5728\u91ab\u7642\u5834\u666f\u8868\u73fe\u7684\u5718\u968a\u3002<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/huggingface.co\/papers\/2607.01436\" rel=\"noopener noreferrer\" target=\"_blank\"><strong>\u9805\u76ee\u4e3b\u9801<\/strong><\/a> \u00b7 <a href=\"https:\/\/github.com\/mxvp\/discrete_diffusion_RRG\" rel=\"noopener noreferrer\" target=\"_blank\"><strong>GitHub<\/strong><\/a> \u00b7 <a href=\"https:\/\/huggingface.co\/gevaertlab\/diffusiongemma-radiology-vqa\" rel=\"noopener noreferrer\" target=\"_blank\"><strong>\u6a21\u578b<\/strong><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u9019\u662f\u4e00\u500b\u91ab\u5b78\u5f71\u50cf\u8a9e\u8a00\u6a21\u578b\u5fae\u8abf\u8207\u8a55\u6e2c\u9805\u76ee\uff0c\u91cd\u9ede\u662f\u6bd4\u8f03 DiffusionGemma \u8207\u81ea\u56de\u6b78\u57fa\u7dda\u3002\u5b83\u4e5f\u793a\u7bc4\u4efb\u4f55\u9806\u5e8f\u88dc\u5168\u6587\u672c\u5728\u653e\u5c04\u5831\u544a\u8349\u7a3f\u4e2d\u7684\u50f9\u503c\u3002<\/p>\n","protected":false},"author":8,"featured_media":9841,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","footnotes":""},"categories":[133,178,140,137,30,144,149,199],"tags":[],"class_list":["post-9842","post","type-post","status-publish","format-standard","hentry","category-133","category-google","category-gemini","category-api","category-image","category-medical","category-149","category-dataset-"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9842","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=9842"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9842\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/9841"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=9842"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=9842"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=9842"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}