红杉中国于今天正式推出一款全新的AI基准测试工具xbench( xbench.org),并发布论文《xbench: Tracking Agents Productivity, Scaling with Profession-Aligned Real-world Evaluations》。首期发布包含