
{"id":8459,"date":"2026-05-22T14:03:36","date_gmt":"2026-05-22T06:03:36","guid":{"rendered":"https:\/\/infernews.com\/blog\/first-foundation-asr-built-for-the-real-world-7-atomic-acoustic-conditions-54-co\/"},"modified":"2026-05-22T14:03:36","modified_gmt":"2026-05-22T06:03:36","slug":"first-foundation-asr-built-for-the-real-world-7-atomic-acoustic-conditions-54-co","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/first-foundation-asr-built-for-the-real-world-7-atomic-acoustic-conditions-54-co\/","title":{"rendered":"Mega-ASR\uff1a\u5608\u96dc\u74b0\u5883\u4e0b\u66f4\u7a69\u5b9a\u7684\u8a9e\u97f3\u8fa8\u8b58"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/05\/mega_asr_logo.png\" alt=\"Mega-ASR Logo\"><\/figure>\n<p>Mega-ASR \u662f\u4e00\u500b\u91dd\u5c0d\u91ce\u5916\u5834\u666f\u800c\u8a2d\u7684\u8a9e\u97f3\u8fa8\u8b58\u9805\u76ee\uff0c\u91cd\u9ede\u653e\u5728\u300c\u74b0\u5883\u6108\u5dee\uff0c\u7d50\u679c\u4ecd\u7136\u53ef\u7528\u300d\u3002\u4e00\u822c\u6a21\u578b\u5728\u96dc\u97f3\u3001\u56de\u97f3\u3001\u6536\u97f3\u8ddd\u96e2\u9060\uff0c\u751a\u81f3\u50b3\u8f38\u4e2d\u65b7\u6642\uff0c\u5e38\u6703\u51fa\u73fe\u6f0f\u53e5\u3001\u4e82\u5beb\u5167\u5bb9\u6216\u76f4\u63a5\u6c92\u6709\u8f38\u51fa\uff1b\u9019\u500b\u9805\u76ee\u6b63\u662f\u70ba\u4e86\u89e3\u6c7a\u9019\u985e\u554f\u984c\u800c\u4f86\u3002<\/p>\n<p>\u5b83\u7684\u505a\u6cd5\u4e0d\u662f\u53ea\u9760\u55ae\u4e00\u566a\u97f3\u589e\u5f37\uff0c\u800c\u662f\u628a\u771f\u5be6\u4e16\u754c\u5e38\u898b\u7684\u8072\u5b78\u5e72\u64fe\u62c6\u6210 7 \u985e\u57fa\u672c\u689d\u4ef6\uff0c\u518d\u7d44\u5408\u6210 54 \u7a2e\u8907\u5408\u5834\u666f\uff0c\u7528\u7d04 260 \u842c\u7b46\u8a13\u7df4\u6a23\u672c\u53bb\u78e8\u7df4\u6a21\u578b\u3002\u8ad6\u6587\u4ea6\u63d0\u5230\u5169\u500b\u95dc\u9375\u65b9\u6cd5\uff1aA2S-SFT \u8207\u57fa\u65bc DG-WGPO \u7684\u5f37\u5316\u5b78\u7fd2\uff0c\u76ee\u6a19\u662f\u4ee4\u6a21\u578b\u7531\u8072\u97f3\u8a0a\u865f\u4e00\u8def\u66f4\u7a69\u5b9a\u5730\u5c0d\u61c9\u5230\u8a9e\u610f\uff0c\u7279\u5225\u52a0\u5f37\u56b4\u91cd\u5931\u771f\u4e0b\u7684\u8a9e\u610f\u6062\u5fa9\u8207\u5c40\u90e8\u95dc\u9375\u5b57\u91cd\u5efa\u3002<\/p>\n<p>\u60f3\u8a66\u9019\u500b\u9805\u76ee\uff0c\u6700\u76f4\u63a5\u662f\u67e5\u770b\u5176 Hugging Face \u6b0a\u91cd\u3001\u6280\u8853\u5831\u544a\uff0c\u4ee5\u53ca\u914d\u5957\u7684 Voices-in-the-Wild-2M \u8cc7\u6599\u96c6\u548c Voices-in-the-Wild-Bench \u57fa\u6e96\u3002\u5c0d\u958b\u767c\u8a9e\u97f3\u8f38\u5165\u3001\u6703\u8b70\u8f49\u9304\u3001\u5ba2\u670d\u9304\u97f3\u6574\u7406\uff0c\u6216\u6236\u5916\u6536\u97f3\u7522\u54c1\u7684\u4eba\u4f86\u8aaa\uff0c\u9019\u985e\u8cc7\u6e90\u6bd4\u55ae\u770b\u793a\u7bc4\u66f4\u6709\u53c3\u8003\u50f9\u503c\uff0c\u56e0\u70ba\u53ef\u4ee5\u7528\u540c\u4e00\u5957\u57fa\u6e96\u6bd4\u8f03\u4e0d\u540c\u6a21\u578b\u5728\u60e1\u52a3\u74b0\u5883\u4e0b\u7684\u8868\u73fe\u3002<\/p>\n<ul>\n<li>\u91dd\u5c0d\u96dc\u97f3\u3001\u9060\u5834\u3001\u906e\u64cb\u3001\u56de\u97f3\u3001\u9304\u97f3\u7455\u75b5\u3001\u96fb\u5b50\u5931\u771f\u8207\u50b3\u8f38\u6389\u5305\u800c\u8a13\u7df4<\/li>\n<li>\u7279\u8272\u662f\u6e1b\u5c11 hallucination\u3001\u7a7a\u767d\u8f38\u51fa\u8207\u6574\u53e5\u907a\u6f0f<\/li>\n<li>\u63d0\u4f9b\u6a21\u578b\u6b0a\u91cd\u3001\u8cc7\u6599\u96c6\u8207\u57fa\u6e96\uff0c\u65b9\u4fbf\u5ef6\u4f38\u8a55\u4f30<\/li>\n<li>\u76f8\u95dc\u6a21\u578b\u53ef\u7559\u610f Qwen3-ASR-1.7B\uff0c\u4ee5\u53ca README \u63d0\u5230\u7684\u5176\u4ed6\u958b\u6e90\u8207\u9589\u6e90\u5f37\u6a21\u578b\u6bd4\u8f03<\/li>\n<\/ul>\n<p>\u8868\u73fe\u65b9\u9762\uff0c\u516c\u958b\u8cc7\u6599\u6307\u51fa\u5b83\u5728\u591a\u500b\u60e1\u52a3\u689d\u4ef6\u57fa\u6e96\u4e0a\u512a\u65bc\u5148\u524d\u5f37\u6a21\u578b\uff0c\u4f8b\u5982\u5728 VOiCES R4-B-F \u8207 NOIZEUS Sta-0 \u7684\u932f\u8aa4\u7387\u5747\u6709\u660e\u986f\u4e0b\u964d\uff1b\u5728\u8907\u5408\u8072\u5b78\u5834\u666f\u4e0b\uff0c\u4ea6\u9304\u5f97\u8d85\u904e 30% \u7684\u76f8\u5c0d\u932f\u8aa4\u7387\u6539\u5584\u3002\u4e0d\u904e\u9019\u4e9b\u7d50\u679c\u4e3b\u8981\u4f86\u81ea\u8ad6\u6587\u8207\u9805\u76ee\u63d0\u4f9b\u7684\u8a55\u4f30\uff0c\u4f7f\u7528\u6642\u4ecd\u8981\u770b\u8a9e\u8a00\u7a2e\u985e\u3001\u97f3\u8a0a\u9577\u5ea6\u548c\u90e8\u7f72\u8cc7\u6e90\u662f\u5426\u914d\u5408\u4f60\u7684\u5834\u666f\u3002<\/p>\n<p>\u6574\u9ad4\u4f86\u770b\uff0cMega-ASR \u6700\u503c\u5f97\u7559\u610f\u7684\uff0c\u4e0d\u662f\u5b83\u628a\u4e7e\u6de8\u8a9e\u97f3\u5206\u6578\u63a8\u9ad8\u591a\u5c11\uff0c\u800c\u662f\u5b83\u628a\u8a9e\u97f3\u8fa8\u8b58\u5e36\u56de\u66f4\u63a5\u8fd1\u73fe\u5834\u7684\u554f\u984c\uff1a\u6536\u97f3\u5dee\u3001\u74b0\u5883\u4e82\u3001\u8a0a\u865f\u4e0d\u5b8c\u6574\u6642\uff0c\u7cfb\u7d71\u9084\u80fd\u5426\u4ea4\u51fa\u53ef\u4fe1\u6587\u672c\u3002\u5c0d\u9700\u8981\u300c\u7a69\u5b9a\u6bd4\u5b8c\u7f8e\u66f4\u91cd\u8981\u300d\u7684\u9805\u76ee\uff0c\u9019\u500b\u65b9\u5411\u76f8\u7576\u6709\u5438\u5f15\u529b\u3002<\/p>\n<p><strong>GitHub\uff1a<\/strong> <a href=\"https:\/\/github.com\/xzf-thu\/Mega-ASR\" rel=\"noopener noreferrer\">https:\/\/github.com\/xzf-thu\/Mega-ASR<\/a><\/p>\n<p><strong>Paper\uff1a<\/strong> <a href=\"https:\/\/arxiv.org\/pdf\/2605.19833\" rel=\"noopener noreferrer\">https:\/\/arxiv.org\/pdf\/2605.19833<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Mega-ASR\u4e3b\u6253\u771f\u5be6\u6df7\u4e82\u8072\u5b78\u74b0\u5883\u4e0b\u7684\u8a9e\u97f3\u8fa8\u8b58\u7a69\u5b9a\u5ea6\uff0c\u91cd\u9ede\u4e0d\u662f\u8ffd\u6c42\u4e7e\u6de8\u8a9e\u97f3\u5206\u6578\uff0c\u800c\u662f\u6e1b\u5c11\u6f0f\u5b57\u3001\u4e82\u4f30\u548c\u7a7a\u767d\u8f38\u51fa\u3002<\/p>\n","protected":false},"author":8,"featured_media":8458,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[133,76,128,200],"tags":[],"class_list":["post-8459","post","type-post","status-publish","format-standard","hentry","category-133","category-76","category-128","category-200"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/8459","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=8459"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/8459\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/8458"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=8459"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=8459"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=8459"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}