
{"id":8810,"date":"2026-06-04T18:13:28","date_gmt":"2026-06-04T10:13:28","guid":{"rendered":"https:\/\/infernews.com\/blog\/audio-interaction-ai\/"},"modified":"2026-06-04T18:13:28","modified_gmt":"2026-06-04T10:13:28","slug":"audio-interaction-ai","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/audio-interaction-ai\/","title":{"rendered":"Audio-Interaction\uff1a\u8b93 AI \u50cf\u771f\u4eba\u4e00\u6a23\u5373\u6642\u807d\u8207\u56de\u61c9"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/06\/top-df567dc6324e.jpg\" alt=\"Audio-Interaction teaser\"><\/figure>\n<p><strong>Audio-Interaction<\/strong> \u662f\u4e00\u6b3e\u7531\u5357\u6d0b\u7406\u5de5\u5927\u5b78\uff08NTU\uff09\u3001\u65b0\u52a0\u5761\u570b\u7acb\u5927\u5b78\uff08NUS\uff09\u53ca\u9999\u6e2f\u4e2d\u6587\u5927\u5b78\uff08CUHK\uff09\u5171\u540c\u7814\u767c\u7684\u5168\u958b\u6e90\u97f3\u8a0a\u8a9e\u8a00\u6a21\u578b\uff0c\u5c6c\u65bc\u65b0\u4e00\u4ee3\u7684 Audio Interaction Model\uff08\u97f3\u8a0a\u4e92\u52d5\u6a21\u578b\uff09\u3002\u5b83\u4ee5\u4e00\u500b\u59cb\u7d42\u904b\u884c\u7684\u611f\u77e5\u2014\u6c7a\u7b56\u2014\u56de\u61c9\u5faa\u74b0\uff08perceive-decide-respond loop\uff09\u70ba\u6838\u5fc3\uff0c\u80fd\u5373\u6642\u8046\u807d\u74b0\u5883\u8072\u97f3\u8207\u6307\u4ee4\uff0c\u4e26\u81ea\u884c\u5224\u65b7\u4f55\u6642\u61c9\u8a72\u958b\u53e3\u56de\u61c9\u3002<\/p>\n<p>\u50b3\u7d71\u7684\u5927\u578b\u97f3\u8a0a\u8a9e\u8a00\u6a21\u578b\u5927\u591a\u53ea\u652f\u63f4\u96e2\u7dda\u8655\u7406\uff0c\u800c\u73fe\u6709\u7684\u4e32\u6d41\u6a21\u578b\u4e00\u822c\u53ea\u80fd\u505a\u55ae\u4e00\u4efb\u52d9\uff0c\u4f8b\u5982\u5373\u6642\u8a9e\u97f3\u8fa8\u8b58\uff08streaming ASR\uff09\u6216\u8a9e\u97f3\u804a\u5929\u3002Audio-Interaction \u4ee5\u55ae\u4e00\u67b6\u69cb\u540c\u6642\u8986\u84cb\u96e2\u7dda\u8207\u5373\u6642\u4efb\u52d9\uff0c\u628a\u8fa8\u8b58\u3001\u7ffb\u8b6f\u3001\u5c0d\u8a71\u7b49\u4e0d\u540c\u529f\u80fd\u7d71\u4e00\u5728\u540c\u4e00\u689d\u4e32\u6d41\u4e2d\u3002\u9019\u610f\u5473\u8457\u958b\u767c\u8005\u53ea\u9700\u8981\u4e00\u5957\u6a21\u578b\uff0c\u5c31\u80fd\u61c9\u4ed8\u591a\u7a2e\u97f3\u8a0a\u4e92\u52d5\u5834\u666f\u3002<\/p>\n<p>\u9019\u500b\u9805\u76ee\u7684\u6838\u5fc3\u5275\u65b0\u5728\u65bc\u5176\u8a13\u7df4\u6d41\u7a0b <strong>SoundFlow<\/strong>\u3002\u5b83\u80fd\u628a\u77ed\u97f3\u8a0a\u7247\u6bb5\u62fc\u63a5\u6210\u9577\u4e92\u52d5\u8cc7\u6599\uff0c\u4e26\u4ee5\u300c\u584a\u7d1a\u6c7a\u7b56\u8a13\u7df4\u300d\uff08chunk-level decision training\uff09\u914d\u5408\u6b77\u53f2\u56de\u9867\u8207\u8a9e\u610f\u611f\u77e5\u7684\u975c\u97f3\u8655\u7406\uff0c\u8b93\u6a21\u578b\u5b78\u6703\u300c\u8a72\u4e0d\u8a72\u8aaa\u8a71\u300d\u3002\u5728\u63a8\u8ad6\u968e\u6bb5\uff0cSoundFlow \u63a1\u7528\u7570\u6b65 FIFO \u63a8\u8ad6\uff08asynchronous FIFO inference\uff09\uff0c\u4f7f\u9996\u5e40\u5ef6\u9072\u964d\u4f4e\u7d04 4.5 \u500d\uff0c\u5e36\u4f86\u66f4\u6d41\u66a2\u7684\u5373\u6642\u9ad4\u9a57\u3002<\/p>\n<p>\u4f7f\u7528\u6642\uff0c\u958b\u767c\u8005\u53ef\u4ee5\u76f4\u63a5\u5f9e\u5b98\u65b9\u9801\u9762\u53d6\u5f97\u6280\u8853\u5831\u544a\u8207\u7a0b\u5f0f\u78bc\uff0c\u4e26\u900f\u904e\u5fae\u4fe1\u7fa4\u7d44\u52a0\u5165\u793e\u7fa4\u8a0e\u8ad6\u3002\u8a72\u9805\u76ee\u4ea6\u63d0\u4f9b\u4e86\u5373\u6642\u8a66\u807d Demo\uff0c\u53ef\u8207 OpenAI \u7684 gpt-realtime \u53ca\u5b57\u7bc0\u8df3\u52d5\u7684 Seeduplex \u9032\u884c\u540c\u689d\u4ef6\u6bd4\u8f03\uff0c\u5728\u91cd\u8907\u8072\u97ff\u8a08\u6578\u3001\u54b3\u55fd\u8fa8\u8b58\u53ca\u97f3\u6a02\u98a8\u683c\u5224\u65b7\u7b49\u5834\u666f\u4e2d\uff0cAudio-Interaction \u80fd\u9010\u8f2a\u8f38\u51fa\u6709\u610f\u7fa9\u7684\u56de\u61c9\u3002<\/p>\n<p><strong>Audio-Interaction \u91cd\u9ede\u6458\u8981\uff1a<\/strong><\/p>\n<ul>\n<li><strong>\u7d71\u4e00\u67b6\u69cb<\/strong>\uff1a\u4ee5\u55ae\u4e00\u6a21\u578b\u540c\u6642\u652f\u63f4\u96e2\u7dda\u8207\u5373\u6642\u97f3\u8a0a\u4efb\u52d9\uff0c\u6db5\u84cb\u8fa8\u8b58\u3001\u7ffb\u8b6f\u53ca\u5c0d\u8a71\u3002<\/li>\n<li><strong>\u611f\u77e5\u2014\u6c7a\u7b56\u2014\u56de\u61c9\u5faa\u74b0<\/strong>\uff1a\u6a21\u578b\u81ea\u884c\u5224\u65b7\u56de\u61c9\u6642\u6a5f\uff0c\u8cbc\u8fd1\u771f\u5be6\u4eba\u6a5f\u4e92\u52d5\u7bc0\u594f\u3002<\/li>\n<li><strong>SoundFlow \u8a13\u7df4\u6d41\u7a0b<\/strong>\uff1a\u7d50\u5408\u8cc7\u6599\u62fc\u63a5\u3001\u584a\u7d1a\u6c7a\u7b56\u8a13\u7df4\u8207\u975c\u97f3\u611f\u77e5\uff0c\u63d0\u5347\u5373\u6642\u5224\u65b7\u80fd\u529b\u3002<\/li>\n<li><strong>\u4f4e\u5ef6\u9072\u63a8\u8ad6<\/strong>\uff1a\u7570\u6b65 FIFO \u63a8\u8ad6\u4f7f\u9996\u5e40\u5ef6\u9072\u964d\u4f4e\u7d04 4.5 \u500d\u3002<\/li>\n<li><strong>\u5b8c\u5168\u958b\u6e90<\/strong>\uff1a\u63d0\u4f9b\u6280\u8853\u5831\u544a\u3001\u7a0b\u5f0f\u78bc\u53ca\u5373\u6642\u8a66\u807d Demo\uff0c\u65b9\u4fbf\u7814\u7a76\u8207\u61c9\u7528\u3002<\/li>\n<\/ul>\n<p>\u9019\u500b\u9805\u76ee\u7279\u5225\u9069\u5408\u5f9e\u4e8b\u8a9e\u97f3 AI\u3001\u5c0d\u8a71\u7cfb\u7d71\u53ca\u591a\u6a21\u614b\u4e92\u52d5\u7814\u7a76\u7684\u958b\u767c\u8005\u8207\u5718\u968a\uff0c\u80fd\u70ba\u9700\u8981\u5373\u6642\u97f3\u8a0a\u7406\u89e3\u7684\u7522\u54c1\uff0c\u4f8b\u5982\u667a\u80fd\u52a9\u624b\u3001\u6703\u8b70\u8a18\u9304\u3001\u807d\u969c\u8f14\u52a9\u7b49\uff0c\u63d0\u4f9b\u4e00\u500b\u7d71\u4e00\u4e14\u9748\u6d3b\u7684\u57fa\u790e\u6a21\u578b\u3002<\/p>\n<p><strong>\u9805\u76ee\uff1a<\/strong> <a href=\"https:\/\/xzf-thu.github.io\/Audio-Interaction\/\" rel=\"noopener noreferrer\">https:\/\/xzf-thu.github.io\/Audio-Interaction\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Audio-Interaction \u662f\u4e00\u6b3e\u5168\u65b0\u7684\u958b\u6e90\u97f3\u8a0a\u8a9e\u8a00\u6a21\u578b\uff0c\u63a1\u7528\u611f\u77e5-\u6c7a\u7b56-\u56de\u61c9\u7684\u5faa\u74b0\u6a5f\u5236\uff0c\u7d71\u4e00\u8655\u7406\u5373\u6642\u8207\u96e2\u7dda\u97f3\u8a0a\u4efb\u52d9\u3002<\/p>\n","protected":false},"author":8,"featured_media":8809,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","wpai_meta_description":"","footnotes":""},"categories":[133,164,76,127,128],"tags":[],"class_list":["post-8810","post","type-post","status-publish","format-standard","hentry","category-133","category-164","category-76","category-127","category-128"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/8810","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=8810"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/8810\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/8809"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=8810"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=8810"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=8810"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}