BriefGPT - AI 论文速递 ·

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

💡 原文英文，约100词，阅读约需1分钟。

📝

内容提要

本研究提出了PerceptionLM框架，解决了视觉语言模型的闭源问题，并发布了280万个人工标注的视频问答对，以促进详细视频理解。同时推出的PLM-VideoBench评估套件推动了透明研究的进展。

🎯

🏷️

Why R&D Data Belongs in the Lakehouse - and Why Agents Need It There
The setupAt cellcentric, a joint venture of Daimler Truck and Volvo Group, we...
What’s new: Air gets more agents, local models, and Java/Kotlin code intelligence
The new release of JetBrains Air brings support for GitHub Copilot, OpenCode,...
Google ships 3 new Gemini models. Just not the one everyone’s waiting for.
Google on Tuesday launched three new Gemini models: Gemini 3.6 Flash, a cheap...
Google launches a cheaper alternative to large AI security models like Mythos
Google is launching Gemini 3.6 Flash alongside a new security model dedicated...
Inside Roblox’s Bet on World Models
We sat down with Anupam Singh, senior vice president of engineering at Roblox...
“Second only to Fable 5:” Alibaba talks the talk with Qwen3.8 without providing any real data
Alibaba has revealed Qwen 3.8, its latest, greatest large language model (LLM...