
{"id":9646,"date":"2026-06-26T04:43:30","date_gmt":"2026-06-25T20:43:30","guid":{"rendered":"https:\/\/infernews.com\/blog\/towards-holistic-evaluation-of-generative-diffusion-transformers\/"},"modified":"2026-06-26T04:46:38","modified_gmt":"2026-06-25T20:46:38","slug":"towards-holistic-evaluation-of-generative-diffusion-transformers","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/towards-holistic-evaluation-of-generative-diffusion-transformers\/","title":{"rendered":"DiffusionBench\uff1a\u64f4\u6563\u6a21\u578b\u8a55\u6e2c\u6846\u67b6"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/06\/qualitative.jpg\" alt=\"DiffusionBench logo\"\/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">\u9019\u662f\u4e00\u500b\u91dd\u5c0d\u64f4\u6563 Transformer\uff08Diffusion Transformers, DiT\uff09\u7814\u7a76\u7684\u57fa\u6e96\u6e2c\u8a66\u9805\u76ee\uff08benchmark\uff09\uff0c\u6838\u5fc3\u76ee\u7684\u662f\u5728 ImageNet \u8207\u6587\u5b57\u751f\u6210\u5716\u50cf\uff08T2I\uff09\u5169\u7a2e\u5834\u666f\u4e0b\uff0c\u5c0d\u64f4\u6563\u6a21\u578b\u9032\u884c\u7d71\u4e00\u7684\u8a13\u7df4\u8207\u6a6b\u5411\u8a55\u6e2c\u3002\u820a\u6709\u505a\u6cd5\u666e\u904d\u4ee5 ImageNet \u7684\u985e\u5225\u689d\u4ef6\u751f\u6210\uff08class-conditional generation\uff09\u70ba\u55ae\u4e00\u8a55\u6e2c\u6a19\u6e96\uff0c\u4f5c\u8005\u6279\u8a55\u9019\u7a2e\u7bc4\u5f0f\u5df2\u7d93\u96e3\u4ee5\u53cd\u6620\u751f\u6210\u6a21\u578b\u7684\u771f\u5be6\u9032\u5c55\uff0c\u56e0\u70ba T2I \u96d6\u7136\u66f4\u8cbc\u8fd1\u5be6\u7528\uff0c\u537b\u5e38\u88ab\u8996\u70ba\u904e\u65bc\u6602\u8cb4\u6216\u4e0d\u4fbf\u800c\u8df3\u904e\u3002\u70ba\u6b64\uff0c\u9805\u76ee\u63a8\u51fa NanoGen \u7d71\u4e00\u8a13\u7df4\u6846\u67b6\uff0c\u4e26\u4ee5 DiffusionBench \u91cd\u65b0\u7d44\u7e54\u8a55\u6e2c\u7d50\u69cb\uff0c\u628a ImageNet \u8207 T2I \u7d0d\u5165\u540c\u4e00\u6bd4\u8f03\u57fa\u6e96\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>\u7d71\u4e00\u8a13\u7df4\u4ecb\u9762<\/strong>\uff1aNanoGen \u53ea\u9700\u7d04 12 \u884c\u7684\u914d\u7f6e\u66f4\u6539\uff0c\u5c31\u80fd\u5728 ImageNet \u8207 T2I \u4e4b\u9593\u5207\u63db\u3002<\/li>\n\n\n\n<li><strong>\u8de8\u4efb\u52d9\u65b9\u6cd5\u6bd4\u8f03<\/strong>\uff1a\u7cfb\u7d71\u6027\u6536\u9304\u4e26\u6bd4\u8f03 25 \u7a2e DiT \u65b9\u6cd5\u3002<\/li>\n\n\n\n<li><strong>\u591a\u7dad\u5ea6\u8a55\u6e2c\u6307\u6a19<\/strong>\uff1a\u6db5\u84cb FID \u7b49\u591a\u9805 ImageNet \u8207 T2I \u6307\u6a19\u3002<\/li>\n\n\n\n<li><strong>\u7814\u7a76\u6210\u679c\u5df2\u6536\u9304 arXiv \u8ad6\u6587\uff082606.24888\uff09<\/strong>\uff0c\u5c0d\u61c9\u7684\u6a21\u578b\u6b0a\u91cd\u540c\u6b65\u4e0a\u8f09\u81f3 HuggingFace\u3002<\/li>\n\n\n\n<li><strong>\u76ee\u524d\u7248\u672c\u70ba v0.1<\/strong>\uff0c\u4f5c\u8005\u660e\u78ba\u6a19\u793a\u4ecd\u8655\u65bc\u521d\u6b65\u968e\u6bb5\uff0c\u4e26\u7a4d\u6975\u62db\u52df\u793e\u7fa4\u8ca2\u737b\u8005\u3002<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">\u8207\u55ae\u7368\u7684 ImageNet \u8a55\u6e2c\u76f8\u6bd4\uff0cDiffusionBench \u7684\u95dc\u9375\u5dee\u7570\u5728\u65bc\u540c\u6642\u7d0d\u5165 T2I \u4efb\u52d9\uff0c\u85c9\u6b64\u63ed\u793a\u65b9\u6cd5\u6392\u540d\u5728\u5169\u985e\u4efb\u52d9\u4e4b\u9593\u4e26\u7121\u5f37\u76f8\u95dc\uff08no strong correlation\uff09\uff0c\u9019\u610f\u5473\u8457 ImageNet \u4e0a\u7684 FID \u63d0\u5347\u672a\u5fc5\u4ee3\u8868 T2I \u751f\u6210\u54c1\u8cea\u540c\u6b65\u6539\u5584\u3002\u6846\u67b6\u652f\u63f4 VAE\u3001RAE \u8207 Pixel space \u7b49\u4e0d\u540c\u6f5b\u5728\u7a7a\u9593\uff08latent space\uff09\u7684\u8a13\u7df4\uff0c\u6280\u8853\u4e0a\u6574\u5408\u4e86 REPA-E \u8207 iREPA \u7b49\u5c0d\u6bd4\u65b9\u6cd5\uff0c\u9069\u5408 DiT \u7814\u7a76\u5718\u968a\u3001\u751f\u6210\u5f0f\u6a21\u578b\u5de5\u7a0b\u5e2b\uff0c\u4ee5\u53ca\u95dc\u5fc3\u57fa\u6e96\u516c\u6b63\u6027\u7684\u5b78\u8853\u5de5\u4f5c\u8005\u4f7f\u7528\u3002\u53d7\u60e0\u6700\u5927\u7684\uff0c\u662f\u9700\u8981\u8a55\u4f30\u81ea\u5bb6\u65b9\u6cd5\u5728\u591a\u4efb\u52d9\u6cdb\u5316\u80fd\u529b\u7684\u5718\u968a\uff0c\u4ee5\u53ca\u5e0c\u671b\u907f\u514d\u55ae\u4e00\u6307\u6a19\u8aa4\u5c0e\u7684\u5be9\u7a3f\u4eba\u8207\u7814\u7a76\u8005\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>GitHub\uff1a<\/strong> <a href=\"https:\/\/github.com\/End2End-Diffusion\/diffusion-bench\" rel=\"noopener noreferrer\">https:\/\/github.com\/End2End-Diffusion\/diffusion-bench<\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>\u9805\u76ee\u4e3b\u9801\uff1a<\/strong> <a href=\"https:\/\/end2end-diffusion.github.io\/diffusion-bench\/\" rel=\"noopener noreferrer\">https:\/\/end2end-diffusion.github.io\/diffusion-bench\/<\/a><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Model\uff1a<\/strong> <a href=\"https:\/\/huggingface.co\/diffusion-bench\/diffusion-bench\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/huggingface.co\/diffusion-bench\/diffusion-bench<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>DiffusionBench \u900f\u904e\u7d71\u4e00\u7684 NanoGen \u6846\u67b6\uff0c\u6a6b\u5411\u6bd4\u8f03 25 \u7a2e DiT \u65b9\u6cd5\u5728 ImageNet \u8207\u6587\u5b57\u751f\u6210\u5716\u50cf\u7684\u8868\u73fe\uff0c\u6311\u6230\u50c5\u9760 FID \u8a55\u4f30\u64f4\u6563\u6a21\u578b\u7684\u820a\u7bc4\u5f0f\u3002<\/p>\n","protected":false},"author":8,"featured_media":9645,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","footnotes":""},"categories":[133,30,129,157,76,127,160,197],"tags":[],"class_list":["post-9646","post","type-post","status-publish","format-standard","hentry","category-133","category-image","category-txt2img","category-157","category-76","category-127","category-160","category-framework"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9646","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=9646"}],"version-history":[{"count":2,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9646\/revisions"}],"predecessor-version":[{"id":9649,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9646\/revisions\/9649"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/9645"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=9646"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=9646"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=9646"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}