
{"id":9018,"date":"2026-06-10T22:13:32","date_gmt":"2026-06-10T14:13:32","guid":{"rendered":"https:\/\/infernews.com\/blog\/lip-forcing\/"},"modified":"2026-06-11T01:36:37","modified_gmt":"2026-06-10T17:36:37","slug":"lip-forcing","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/lip-forcing\/","title":{"rendered":"Lip Forcing\uff1a\u628a\u5507\u5f62\u540c\u6b65\u63a8\u9032\u5373\u6642\u4e32\u6d41"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/06\/pasted-67685791aeb2.jpg\" alt=\"Hero image preview\"\/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Lip Forcing \u662f\u4e00\u500b\u91dd\u5c0d video-to-video\uff08V2V\uff09lip synchronization \u7684\u7814\u7a76\u9805\u76ee\uff0c\u91cd\u9ede\u662f\u628a diffusion \u6a21\u578b\u539f\u672c\u6602\u8cb4\u7684\u63a8\u7406\u6d41\u7a0b\uff0c\u5927\u5e45\u58d3\u7e2e\u5230\u9069\u5408\u5373\u6642\u4e32\u6d41\u4f7f\u7528\u3002\u5b83\u5e0c\u671b\u5728\u4fdd\u7559\u4eba\u7269\u8eab\u4efd\u3001\u982d\u90e8\u59ff\u52e2\u8207\u80cc\u666f\u4e00\u81f4\u6027\u7684\u540c\u6642\uff0c\u4ee4\u53e3\u578b\u66f4\u6e96\u78ba\u8cbc\u5408\u76ee\u6a19\u97f3\u8a0a\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u73fe\u6709 diffusion-based \u5507\u5f62\u540c\u6b65\u65b9\u6cd5\u756b\u8cea\u548c\u8072\u756b\u5c0d\u9f4a\u8868\u73fe\u4e0d\u932f\uff0c\u4f46\u901a\u5e38\u8981\u770b\u5b8c\u6574\u6bb5\u5f71\u7247\u3001\u518d\u7d93\u904e\u5f88\u591a\u6b21 denoising steps\uff0c\u901f\u5ea6\u548c\u5ef6\u9072\u90fd\u96e3\u4ee5\u914d\u5408\u76f4\u64ad\u7ffb\u8b6f\u3001virtual avatars\u3001interactive agents \u9019\u985e\u5834\u666f\u3002Lip Forcing \u6539\u7528 autoregressive diffusion\uff0c\u628a\u5f71\u7247\u5206\u6bb5\u9010\u584a\u751f\u6210\uff0c\u4e26\u628a 50-step teacher \u58d3\u7e2e\u6210 two-step streaming student\uff0c\u6e1b\u5c11\u8a08\u7b97\u8ca0\u64d4\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u5c0d lip-sync \u4efb\u52d9\uff0c\u672c\u8eab\u4e0d\u662f\u55ae\u7d14\u5957\u7528\u901a\u7528\u52a0\u901f\u6280\u5de7\u3002\u4f5c\u8005\u6307\u51fa CFG \u6703\u5728 reference fidelity \u8207 synchronization \u4e4b\u9593\u51fa\u73fe\u53d6\u6368\uff0c\u4e26\u64da\u6b64\u8a2d\u8a08\u51fa Sync-Window DMD\u3001two-step inference schedule\uff0c\u4ee5\u53ca\u4ee5 SyncNet \u70ba\u57fa\u790e\u7684 reward\uff0c\u76ee\u6a19\u662f\u5728\u5c11\u6b65\u6578\u4e0b\u4ecd\u7dad\u6301\u53ef\u7528\u7684\u5507\u5f62\u540c\u6b65\u6548\u679c\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u5169\u500b student \u6a21\u578b\u90fd\u7531 14B teacher \u84b8\u993e\u800c\u4f86\u30021.3B student \u53ef\u9054 31 FPS\uff0c\u901f\u5ea6\u6bd4\u540c\u898f\u6a21 bidirectional model \u5feb 17.6 \u500d\uff1b14B student \u5247\u6bd4 teacher \u5feb 39.8 \u500d\uff0c\u4e26\u7dad\u6301\u76f8\u8fd1\u7684 reference fidelity\u3002\u5169\u500b\u7248\u672c\u7684 time-to-first-frame \u90fd\u4f4e\u65bc 1 \u6beb\u79d2\uff0c\u986f\u793a\u5b83\u7279\u5225\u9069\u5408\u4f4e\u5ef6\u9072\u4e32\u6d41\u60c5\u5883\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u652f\u63f4\u5373\u6642\u4e32\u6d41\uff0c\u6700\u9ad8\u53ef\u9054 31 FPS<\/li>\n\n\n\n<li>\u6bcf\u500b chunk \u53ea\u9700 two denoising steps\uff0c\u6bcb\u9808 inference-time CFG<\/li>\n\n\n\n<li>\u63a1\u7528 autoregressive diffusion\uff0c\u964d\u4f4e\u5168\u5e8f\u5217\u6ce8\u610f\u529b\u5e36\u4f86\u7684\u6210\u672c<\/li>\n\n\n\n<li>\u91dd\u5c0d lip synchronization \u8a2d\u8a08\u84b8\u993e\u65b9\u6cd5\uff0c\u4e0d\u662f\u4e00\u822c\u52a0\u901f\u6539\u88dd<\/li>\n\n\n\n<li>\u9069\u5408 live translation\u3001virtual avatars\u3001interactive agents \u7b49\u5834\u666f<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">\u5982\u679c\u4f60\u95dc\u6ce8\u7684\u662f\u5373\u6642\u5634\u578b\u540c\u6b65\u3001\u4f4e\u5ef6\u9072\u5f71\u7247\u751f\u6210\uff0c\u6216\u60f3\u4e86\u89e3 few-step autoregressive diffusion \u5982\u4f55\u843d\u5730\u5230\u5f71\u97f3\u4efb\u52d9\uff0c\u9019\u500b\u9805\u76ee\u76f8\u7576\u6709\u53c3\u8003\u50f9\u503c\u3002\u6587\u4e2d\u53ef\u78ba\u8a8d\u5f15\u7528\u8207\u6bd4\u8f03\u7684\u6280\u8853\u8108\u7d61\u5305\u62ec Computer-use agents\u3001CUAs\u3001LoRA\u3001OSWorld \u4ee5\u5916\u7684\u5f71\u97f3\u751f\u6210\u65b9\u5411\uff1b\u5c31\u672c\u9801\u5167\u5bb9\u53ef\u660e\u78ba\u5217\u51fa\u7684\u6a21\u578b\uff0c\u4e3b\u8981\u662f 14B audio-conditioned bidirectional video diffusion teacher\u30011.3B student\u300114B student\uff0c\u4ee5\u53ca SyncNet\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Paper\uff1a<\/strong> <a href=\"https:\/\/arxiv.org\/pdf\/2606.11180\" rel=\"noopener noreferrer\">https:\/\/arxiv.org\/pdf\/2606.11180<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u53ea\u9700\u5169\u6b65\u751f\u6210\u5f71\u7247\u7247\u6bb5\uff0c\u517c\u9867\u5507\u5f62\u6e96\u78ba\u5ea6\u8207\u901f\u5ea6\u3002\u76ee\u6a19\u662f\u5728\u4f4e\u5ef6\u9072\u60c5\u6cc1\u4e0b\u505a\u5230\u5373\u6642 lip synchronization\u3002<\/p>\n","protected":false},"author":8,"featured_media":9017,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","footnotes":""},"categories":[133,76,128],"tags":[],"class_list":["post-9018","post","type-post","status-publish","format-standard","hentry","category-133","category-76","category-128"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9018","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=9018"}],"version-history":[{"count":1,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9018\/revisions"}],"predecessor-version":[{"id":9033,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9018\/revisions\/9033"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/9017"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=9018"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=9018"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=9018"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}