
{"id":8816,"date":"2026-06-04T19:11:58","date_gmt":"2026-06-04T11:11:58","guid":{"rendered":"https:\/\/infernews.com\/blog\/ovo-s-bench\/"},"modified":"2026-06-04T19:11:58","modified_gmt":"2026-06-04T11:11:58","slug":"ovo-s-bench","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/ovo-s-bench\/","title":{"rendered":"OVO-S-Bench\uff1a\u8003\u9a57\u591a\u6a21\u614b\u6a21\u578b\u7684\u4e32\u6d41\u7a7a\u9593\u667a\u80fd"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/06\/pasted-fd3da6fe0595.jpg\" alt=\"OVO-S-Bench overview\"><\/figure>\n<p>\u7531\u6e05\u83ef\u5927\u5b78\u3001\u4e0a\u6d77 AI \u5be6\u9a57\u5ba4\u53ca\u5317\u4eac\u822a\u7a7a\u822a\u5929\u5927\u5b78\u5171\u540c\u63a8\u51fa\u7684 OVO-S-Bench\uff0c\u662f\u4e00\u5957\u5c08\u9580\u6e2c\u8a66\u591a\u6a21\u614b\u5927\u578b\u8a9e\u8a00\u6a21\u578b (Multimodal Large Language Models, MLLMs) \u5728\u9023\u7e8c\u5f71\u7247\u4e2d\u7a7a\u9593\u7406\u89e3\u80fd\u529b\u7684\u57fa\u6e96\u3002\u5b83\u91dd\u5c0d\u6a5f\u68b0\u4eba\u3001AR \u773c\u93e1\u548c\u81ea\u52d5\u99d5\u99db\u7b49\u9700\u8981\u300c\u908a\u770b\u908a\u60f3\u300d\u7684\u771f\u5be6\u5834\u666f\uff0c\u8981\u6c42\u6a21\u578b\u6839\u64da\u554f\u984c\u6642\u9593\u9ede\u4e4b\u524d\u770b\u5230\u7684\u756b\u9762\u7247\u6bb5\uff0c\u63a8\u7406\u51fa\u5730\u9ede\u8207\u4f48\u5c40\u7684\u8b8a\u5316\uff0c\u800c\u975e\u8b80\u53d6\u6574\u6bb5\u5f71\u7247\u3002<\/p>\n<p>\u984c\u76ee\u4f86\u6e90\u76f8\u7576\u591a\u5143\uff0c\u6db5\u84cb\u5ba4\u5167\u5c0e\u89bd\u3001\u7b2c\u4e00\u8996\u89d2\u6d3b\u52d5\u3001\u6236\u5916\u5834\u666f\u3001\u99d5\u99db\u5f71\u7247\u53ca\u5e36\u6709 3D \u8a3b\u89e3\u7684\u74b0\u5883\uff0c\u5171 348 \u6bb5\u5f71\u7247\u300212 \u4f4d\u5177\u5099 3D \u8996\u89ba\u80cc\u666f\u7684\u6a19\u8a3b\u54e1\u8017\u6642\u7d04 804 \u5c0f\u6642\u64b0\u5beb\u53ca\u53cd\u8986\u6838\u5c0d\u6bcf\u689d\u984c\u76ee\uff0c\u4e26\u900f\u904e\u300c\u6587\u5b57\u63a2\u91dd\u300d\u548c\u76f2\u6e2c\u8986\u6838\u6a5f\u5236\uff0c\u5254\u9664\u53ef\u6191\u984c\u5e79\u6587\u5b57\u6216\u5e38\u8b58\u76f4\u63a5\u7b54\u5c0d\u7684\u984c\u76ee\uff0c\u78ba\u4fdd\u96e3\u5ea6\u771f\u6b63\u4f86\u81ea\u7a7a\u9593\u7406\u89e3\u3002<\/p>\n<p>\u984c\u76ee\u5206\u70ba\u56db\u500b\u96e3\u5ea6\u5c64\u7d1a\uff0c\u7531\u7576\u4e0b\u756b\u9762\u7684\u77ac\u6642\u611f\u77e5 (Instantaneous Egocentric Perception)\u3001\u8ffd\u8e64\u96e2\u958b\u8996\u91ce\u7684\u7a7a\u9593\u8108\u7d61 (Spatiotemporal Context Tracking)\u3001\u63a8\u6e2c\u7a7a\u9593\u8b8a\u5316\u7684\u751f\u6210\u5f0f\u63a8\u7406 (Generative Spatial Reasoning)\uff0c\u5230\u5efa\u69cb\u5168\u5c40\u62d3\u6a38\u5730\u5716 (Global Topological Mapping)\u3002\u5728 38 \u500b\u958b\u6e90\u53ca\u5546\u7528\u6a21\u578b\u7684\u8a55\u4f30\u4e2d\uff0c\u5373\u4f7f\u662f\u8868\u73fe\u6700\u4f73\u7684 Gemini-3.1-Pro\uff0c\u5206\u6578\u4ecd\u6bd4\u4eba\u985e\u5c08\u5bb6\u4f4e 27 \u5206 (59.2 \u6bd4 86.6)\uff0c\u5168\u5c40\u62d3\u6a38\u5c64\u7d1a\u662f\u6700\u5927\u7684\u6a3d\u9838\u3002<\/p>\n<p>\u66f4\u503c\u5f97\u7559\u610f\u7684\u662f\uff0c\u90e8\u5206\u8072\u7a31\u91dd\u5c0d\u4e32\u6d41\u6216\u7a7a\u9593\u4efb\u52d9\u5fae\u8abf\u7684\u6a21\u578b\uff0c\u8868\u73fe\u53cd\u800c\u4e0d\u5982\u5176\u5e95\u5c64\u57fa\u5ea7\u6a21\u578b\uff1b\u800c\u7121\u6839\u64da\u7684\u601d\u7dad\u93c8 (chain-of-thought) \u63a8\u7406\uff0c\u5f80\u5f80\u6703\u653e\u5927\u7a7a\u9593\u932f\u8aa4\u3002\u9019\u5957\u57fa\u6e96\u70ba\u4e0b\u4e00\u4ee3\u4e32\u6d41\u7a7a\u9593\u6a21\u578b\u63d0\u4f9b\u4e86\u6e05\u6670\u4e14\u56b4\u8b39\u7684\u8a66\u91d1\u77f3\u3002<\/p>\n<p>\u91cd\u9ede\u6458\u8981\uff1a<\/p>\n<ul>\n<li>\u6db5\u84cb 1,680 \u689d\u4eba\u5de5\u64b0\u5beb\u984c\u76ee\u53ca 348 \u6bb5\u5f71\u7247\uff0c\u7e3d\u6a19\u8a3b\u5de5\u6642\u7d04 804 \u5c0f\u6642<\/li>\n<li>\u8a2d\u6709\u554f\u984c\u6642\u9593\u9ede\u53ca\u8b49\u64da\u5340\u9593\uff0c\u8a55\u4f30\u6642\u6a21\u578b\u53ea\u770b\u5230\u67e5\u8a62\u524d\u7684\u5f71\u7247\u7247\u6bb5<\/li>\n<li>\u5206\u70ba\u56db\u500b\u905e\u9032\u96e3\u5ea6\u5c64\u7d1a\uff0c\u7531\u77ac\u6642\u611f\u77e5\u5230\u5168\u5c40\u62d3\u6a38\u5efa\u5716<\/li>\n<li>38 \u6b3e MLLM \u4e2d\uff0cGemini-3.1-Pro \u53d6\u5f97 59.2 \u5206\uff0c\u4eba\u985e\u5c08\u5bb6\u70ba 86.6 \u5206<\/li>\n<li>\u4e32\u6d41\u53ca\u7a7a\u9593\u5fae\u8abf\u6a21\u578b\u8868\u73fe\u53ef\u80fd\u53cd\u905c\u65bc\u539f\u5e95\u5ea7\u6a21\u578b<\/li>\n<\/ul>\n<p><strong>GitHub\uff1a<\/strong> <a href=\"https:\/\/github.com\/InternLM\/OVO-S-Bench\" rel=\"noopener noreferrer\">https:\/\/github.com\/InternLM\/OVO-S-Bench<\/a><\/p>\n<p><strong>\u9805\u76ee\uff1a<\/strong> <a href=\"https:\/\/internlm.github.io\/OVO-S-Bench\/\" rel=\"noopener noreferrer\">https:\/\/internlm.github.io\/OVO-S-Bench\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OVO-S-Bench \u4ee5 1,680 \u689d\u984c\u76ee\u8a55\u4f30\u591a\u6a21\u614b\u6a21\u578b\u5728\u9023\u7e8c\u5f71\u7247\u4e2d\u7684\u7a7a\u9593\u8a8d\u77e5\u80fd\u529b\uff0c\u6db5\u84cb\u56db\u500b\u96e3\u5ea6\u5c64\u7d1a\uff0c\u63ed\u793a\u9802\u5c16\u6a21\u578b\u8207\u4eba\u985e\u7684\u660e\u986f\u5dee\u8ddd\u3002<\/p>\n","protected":false},"author":8,"featured_media":8815,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","wpai_meta_description":"","footnotes":""},"categories":[133,196,197,200],"tags":[],"class_list":["post-8816","post","type-post","status-publish","format-standard","hentry","category-133","category-196","category-framework","category-200"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/8816","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=8816"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/8816\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/8815"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=8816"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=8816"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=8816"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}