结构之法算法之道 ·

图像分割的大变革：从SAM(分割一切)到FastSAM、MobileSAM

💡 原文中文，约7900字，阅读约需19分钟。

📝

内容提要

本文介绍了通用的图像分割模型SAM（Segment Anything Model），包括其结构和训练过程。SAM模型通过交互式标注和半自动标注的方法生成大量的掩码数据，并使用多次迭代和多种损失函数优化模型。最终，SAM模型在1100万张图像上生成了11亿个高质量的掩码。

🎯

关键要点

SAM是一种通用的图像分割模型，可以处理所有类型的图像。
与专有模型不同，SAM在多个领域的图像分割中表现良好。
SAM模型通过提示灵活分割，支持零样本和少样本任务。
模型结构包括prompt encoder、image encoder和mask decoder。
数据引擎分为人工辅助、半自动和全自动三步，生成大量掩码数据。
训练过程中使用focal loss和dice loss进行优化，训练时间为3-5天。
最终在1100万张图像上生成了11亿个高质量的掩码，99.1%为自动生成，质量高。

🏷️

标签

SAM模型交互式标注图像分割损失函数掩码数据

➡️

继续阅读

Building multi-Region resiliency for AWS CloudFormation custom resource deployment
AWS CloudFormation is the foundational tool of infrastructure-as-code for tho...
ReSharper C++ 2026.2: C++26 Reflection, ISPC Language Support, And More
ReSharper C++ 2026.2 is out, bringing initial support for C++26 reflection, t...
Rider 2026.2: IDE Intelligence for AI Agents, Faster Performance, and Spectacular Game Dev Updates
Rider 2026.2 opens up the IDE’s own intelligence to your AI coding agents, so...
ReSharper 2026.2: AI Agent Freedom in Visual Studio, .NET Debugging for VS Code, and More
ReSharper 2026.2 takes the first step toward ACP-based agent support in Visua...
GitHub Increased Instant Navigation from 4% to 22% by Rethinking Client Side Architecture
GitHub redesigned GitHub Issues navigation using a client-side architecture t...
Kaggle + Google’s Free 5-Day Agentic AI Course
Google and Kaggle's 5-Day AI agents course is now freely available to everyone.