BriefGPT - AI 论文速递 ·

TimeBench：大型语言模型中的时间推理能力全面评估

💡 原文中文，约400字，阅读约需1分钟。

📝

内容提要

通过创建TimeBench，研究人员发现最先进的LLMs在时间推理方面与人类存在显著差距，希望TimeBench能促进LLMs在时间推理方面的研究。

🎯

🏷️

华为云高校公开课走进中山大学，聚焦智能体时代企业级开发能力建设
7月13日，华为云开发者发展与运营部部长林华鼎受邀走进中山大学深圳校区电子与通信工程学院，为30名学生带来《AI编程实战：重构学习生活，洞见企业级开发》专...
RoboTTT——面向机器人策略的上下文扩展：将TTT集成至VLA中以推理时建立记忆信息，从而将视觉-运动上下文扩展到 8K 个时间步
摘要：本文提出RoboTTT方法，通过将测试时训练（TTT）机制整合到机器人基础模型中，实现了8K时间步的长视觉-运动上下文建模。该方法采用快速权重机制，...
Presentation: From Copy-Paste to Composition: Building Agents Like Real Software
Jake Mannix discusses moving AI agents past chaotic "1970s BASIC" arc...
I made a policy engine think it was in production
Kyverno is a Kubernetes-native policy engine that validates, mutates, and gen...
Meta made its own AI detection system. It should have just used Google’s
IIn March, Meta's Oversight Board called on the company to "meet its ...
The 2026 Honda Prelude is a marvel of hybrid technology
When it comes to enthusiast-geared Honda hardware, the Civic Si, Civic Type R...