BriefGPT - AI 论文速递 ·

AlignMMBench：对大规模视觉 - 语言模型中的中文多模态对齐进行评估

📝

内容提要

本研究通过引入 AlignMMBench，一个专门为新兴的中文视觉 - 语言模型设计的综合对齐基准，从真实场景和中国互联网来源精心策划，并包括三个类别中的十三个具体任务，以及单轮和多轮对话场景。通过结合一个提示重写策略，AlignMMBench 包括 1054 个图像和 4978 个问答对。为了促进评估流程，我们提出了 CritiqueVLM，一个超越 GPT-4...

🏷️

继续阅读

RoboTTT——面向机器人策略的上下文扩展：将TTT集成至VLA中以推理时建立记忆信息，从而将视觉-运动上下文扩展到 8K 个时间步
摘要：本文提出RoboTTT方法，通过将测试时训练（TTT）机制整合到机器人基础模型中，实现了8K时间步的长视觉-运动上下文建模。该方法采用快速权重机制，...
GitHub Increased Instant Navigation from 4% to 22% by Rethinking Client Side Architecture
GitHub redesigned GitHub Issues navigation using a client-side architecture t...
Architecting offline-first generative AI applications for edge deployments using AWS services
According to Siemens’ 2024 report The True Cost of Downtime, Fortune 500 comp...
Automate custom PII detection at scale with Amazon Macie and Step Functions
Organizations in regulated industries like financial services, insurance, hea...
Samsung’s newest foldable finally feels Ultra
While we wait for Apple's rumored foldable iPhone, Samsung is polishing a...
Samsung’s wider Z Fold 8 feels just right
A year after overhauling its Z Fold phone with a radically thinner design, Sa...

内容提要

标签

继续阅读