BriefGPT - AI 论文速递 ·

Taste More, Taste Better: Enhancing Semi-Supervised Crowd Counting with Diverse Data and Strong Models

💡 原文英文，约100词，阅读约需1分钟。

📝

内容提要

本研究提出了一种名为“多尝试，更美味”（TMTB）的框架，旨在降低密集场景中的标注成本。通过图像修复技术和视觉状态空间模型，增强数据多样性，显著提高了在极端拥挤和低光环境下的人群计数准确性。实验结果表明，该方法在多个基准数据集上超越了现有最优方案。

🎯

关键要点

本研究提出了一种名为“多尝试，更美味”（TMTB）的框架，旨在降低密集场景中的标注成本。
该框架通过图像修复技术增强数据多样性，提升了人群计数的准确性。
引入视觉状态空间模型以捕捉人群场景的全局上下文信息。
实验结果表明，该方法在极端拥挤和低光环境下的计数准确性显著提高。
在多个基准数据集上，该方法超越了现有最优方案。

🏷️

标签

models 人群计数低光环境图像修复数据增强视觉状态空间模型

➡️

继续阅读

Why R&D Data Belongs in the Lakehouse - and Why Agents Need It There
The setupAt cellcentric, a joint venture of Daimler Truck and Volvo Group, we...
What’s new: Air gets more agents, local models, and Java/Kotlin code intelligence
The new release of JetBrains Air brings support for GitHub Copilot, OpenCode,...
Experience Better Browsing: Introducing Native Containers in Firefox 153
Today, we’re excited to announce the Preview of Containers in Firefox version...
Google ships 3 new Gemini models. Just not the one everyone’s waiting for.
Google on Tuesday launched three new Gemini models: Gemini 3.6 Flash, a cheap...
Google launches a cheaper alternative to large AI security models like Mythos
Google is launching Gemini 3.6 Flash alongside a new security model dedicated...
Inside Roblox’s Bet on World Models
We sat down with Anupam Singh, senior vice president of engineering at Roblox...