
{"id":9735,"date":"2026-06-28T16:56:28","date_gmt":"2026-06-28T08:56:28","guid":{"rendered":"https:\/\/infernews.com\/blog\/healthcare-research\/"},"modified":"2026-06-28T16:56:28","modified_gmt":"2026-06-28T08:56:28","slug":"healthcare-research","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/healthcare-research\/","title":{"rendered":"OpenBioRQ \u7528\u672a\u89e3\u91ab\u5b78\u554f\u984c\u6e2c\u8a66 AI \u4ee3\u7406"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/uploads\/2026\/06\/pasted-ee5bcc0d8a0b.jpg\" alt=\"Repository image for minstar\/healthcare-research\"><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">OpenBioRQ \u662f\u4e00\u500b\u751f\u7269\u91ab\u5b78\u57fa\u6e96\u8cc7\u6599\u96c6\u517c\u8a55\u6e2c\u6d41\u7a0b\uff0c\u805a\u7126\u65bc<strong>\u76ee\u524d\u4ecd\u672a\u89e3\u6c7a<\/strong>\u7684 biomedical \/ clinical research questions\u3002\u5b83\u8981\u89e3\u6c7a\u7684\u4e0d\u662f\u80cc\u7b54\u6848\u80fd\u529b\uff0c\u800c\u662f\u6e2c\u8a66 LLMs \u5728 agentic tool use \u60c5\u5883\u4e0b\uff0c\u80fd\u5426\u81ea\u5df1\u627e\u8b49\u64da\u3001\u6b63\u78ba\u5f15\u7528\u6587\u737b\uff0c\u4e26\u5728\u6c92\u6709\u5b9a\u8ad6\u6642\u4fdd\u6301 abstention\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u73fe\u6709 benchmark \u591a\u6578\u63a1\u7528\u56fa\u5b9a\u7b54\u6848 key \u7684\u554f\u7b54\u7bc4\u5f0f\uff0c\u6a21\u578b\u6709\u6a5f\u6703\u9760\u8a18\u61b6\u6216\u7dda\u7d22\u53cd\u63a8\u6a19\u6e96\u7b54\u6848\uff0c\u672a\u5fc5\u771f\u7684\u9a57\u8b49\u904e\u4f86\u6e90\u3002OpenBioRQ \u76f4\u63a5\u6539\u7528 retrieval-grounded openness\uff1a\u6bcf\u689d\u554f\u984c\u7684 open_status \u6703\u7528\u5f8c\u7e8c\u8ad6\u6587\u8207 trial records \u91cd\u65b0\u6838\u5c0d\uff1b\u96e3\u5ea6\u4e5f\u4e0d\u662f\u4f5c\u8005\u4e3b\u89c0\u6a19\u793a\uff0c\u800c\u662f\u5148\u8b93\u5f37\u6a21\u578b\u9023\u5de5\u5177\u4e00\u8d77\u8dd1\uff0c\u518d\u7528 pass\/fail \u7d50\u679c\u754c\u5b9a\u54ea\u4e9b\u984c\u76ee\u771f\u7684\u96e3\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u9805\u76ee\u7684\u8cc7\u6599\u6d41\u7a0b\u76f8\u7576\u5b8c\u6574\uff0c\u5f9e crawl\u3001extract\u3001refine\u3001dedup\uff0c\u5230 status verification\u3001contamination audit\u3001agentic-eval \u90fd\u6709\u6e05\u695a\u5206\u5de5\u3002README \u986f\u793a\u5b83\u4ee5 v3 \u7684 12,553 \u984c\u70ba\u57fa\u790e\uff0c\u53e6\u6709 frozen core \u4f5c\u4e3b\u8981\u8a55\u6e2c\u96c6\uff1brefine \u6b65\u9a5f\u4ea6\u628a\u554f\u984c\u6574\u7406\u6210\u8f03\u81ea\u8db3\u7684\u8868\u8ff0\uff0c\u81ea\u542b\u6027\u7531 51.6% \u63d0\u5347\u5230 85.4%\uff0c\u9019\u5c0d\u6a21\u578b\u548c\u4eba\u5de5\u8a55\u5be9\u90fd\u91cd\u8981\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u5b83\u548c\u540c\u985e\u505a\u6cd5\u6700\u5927\u7684\u5206\u5225\uff0c\u662f\u628a\u300c\u5f15\u7528\u53ef\u6253\u958b\u300d\u8207\u300c\u5f15\u7528\u771f\u7684\u652f\u6301\u7b54\u6848\u300d\u5206\u958b\u770b\u3002\u9805\u76ee\u6307\u51fa agent citations \u8d85\u904e 99% \u53ef\u4ee5\u89e3\u6790\uff0c\u4f46\u7d04 15.9% \u5176\u5be6\u9023\u5230\u932f\u8aa4\u8ad6\u6587\uff1b\u540c\u6642\u6700\u96e3\u984c\u7d44\u51fa\u73fe agentic collapse\uff0c\u90e8\u5206\u6a21\u578b\u5c31\u7b97\u95dc\u6389\u5de5\u5177\uff0c\u5206\u6578\u8b8a\u5316\u4e5f\u4e0d\u5927\uff0c\u53cd\u6620\u5de5\u5177\u8abf\u7528\u672a\u5fc5\u81ea\u7136\u8f49\u5316\u6210\u66f4\u597d\u63a8\u7406\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>\u985e\u578b\u5b9a\u4f4d<\/strong>\uff1a\u5c6c\u65bc\u57fa\u6e96\u8cc7\u6599\u96c6\u52a0\u8a55\u6e2c pipeline\uff0c\u4e0d\u662f\u81e8\u5e8a\u6c7a\u7b56\u7cfb\u7d71<\/li>\n<li><strong>\u4e3b\u8981\u50f9\u503c<\/strong>\uff1a\u6aa2\u67e5 evidence retrieval\u3001faithful citation \u8207 abstention\uff0c\u800c\u975e\u8003\u6a21\u578b\u80cc\u8aa6<\/li>\n<li><strong>\u8a55\u6e2c\u8a2d\u8a08<\/strong>\uff1a\u7528 per-question checklist rubrics \u56fa\u5b9a\u8a55\u5206\uff0cinter-judge agreement \u7531 Spearman 0.35 \u5347\u5230 0.82<\/li>\n<li><strong>\u8cc7\u6599\u53ef\u9760\u6027<\/strong>\uff1acore 657 \u8207 expand 483 \u5747\u5831\u544a contamination hard 0%<\/li>\n<li><strong>\u76f8\u95dc\u6a21\u578b<\/strong>\uff1aGoogle\u3001Anthropic\u3001OpenAI \u4e09\u689d\u7368\u7acb lineage\uff0c\u4ee5\u53ca README \u63d0\u5230\u7684 GLM-5.1\u3001MiniLM-L6<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">\u53d7\u60e0\u6700\u5927\u7684\u6703\u662f\u505a\u91ab\u7642\u7814\u7a76\u52a9\u7406\u3001\u6587\u737b\u6aa2\u7d22\u4ee3\u7406\u3001\u91ab\u5b78 AI \u8a55\u6e2c\u7684\u5718\u968a\uff0c\u800c\u4e0d\u662f\u60f3\u76f4\u63a5\u62ff\u53bb\u505a\u8a3a\u65b7\u7684\u6a5f\u69cb\u3002\u5b83\u76ee\u524d\u66f4\u50cf\u4e00\u500b\u7814\u7a76\u57fa\u5efa\u9805\u76ee\uff1a\u5e6b\u4eba\u770b\u6e05\u695a\u6a21\u578b\u5728\u9ad8\u4e0d\u78ba\u5b9a\u3001\u7121\u6a19\u6e96\u7b54\u6848\u5834\u666f\u4e0b\uff0c\u7a76\u7adf\u662f\u6709\u80fd\u529b\u627e\u8b49\u64da\uff0c\u9084\u662f\u53ea\u662f\u5728\u751f\u6210\u770b\u4f3c\u5408\u7406\u7684\u56de\u7b54\u3002<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/minstar.github.io\/OpenBioRQ\/\" rel=\"noopener noreferrer\" target=\"_blank\"><strong>\u9805\u76ee\u4e3b\u9801<\/strong><\/a> \u00b7 <a href=\"https:\/\/github.com\/minstar\/healthcare-research\" rel=\"noopener noreferrer\" target=\"_blank\"><strong>GitHub<\/strong><\/a> \u00b7 <a href=\"https:\/\/arxiv.org\/pdf\/2606.21959\" rel=\"noopener noreferrer\" target=\"_blank\"><strong>Paper<\/strong><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u9019\u4e0d\u662f\u4e00\u822c\u91ab\u5b78\u554f\u7b54\u96c6\uff0c\u800c\u662f\u7528\u672a\u89e3\u7814\u7a76\u554f\u984c\u6aa2\u9a57 AI \u4ee3\u7406\u627e\u8b49\u64da\u8207\u5f15\u7528\u662f\u5426\u53ef\u4fe1\u3002\u5b83\u66f4\u91cd\u8996\u80fd\u5426\u8aa0\u5be6\u4fdd\u7559\u4e0d\u78ba\u5b9a\u6027\u3002<\/p>\n","protected":false},"author":8,"featured_media":9734,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ai_generated_summary":"","footnotes":""},"categories":[133,185,178,140,147,153,116,151,144,191,199],"tags":[],"class_list":["post-9735","post","type-post","status-publish","format-standard","hentry","category-133","category-qwen","category-google","category-gemini","category-deepseek","category-openai","category-agentic","category-mcp","category-medical","category-anthropic","category-dataset-"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9735","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=9735"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/9735\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media\/9734"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=9735"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=9735"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=9735"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}