BriefGPT - AI 论文速递 ·

Erased but Not Forgotten: How Backdoors Affect Concept Erasure

💡 原文英文，约100词，阅读约需1分钟。

📝

内容提要

本研究探讨了机器学习中概念抹除技术的安全漏洞，提出了“有害抹除”威胁模型，并通过后门攻击实例揭示现有抹除算法的脆弱性，指出当前策略存在重大安全隐患。

🎯

🏷️

America needs to stop getting shocked by Chinese AI
Last week, two Chinese AI companies unveiled models they say can credibly com...
Platform engineering for the agentic enterprise: Managing applications, resources, and AI agents
Platform engineering is evolving Platform engineering has become one of the d...
Why your agent needs access to your documentation
What 1,192 agent conversations taught us about knowledge base search A few mo...
在线教程｜一键加载ComfyUI工作流，不写一行代码也能玩转AI绘图
同时，ComfyUI 具备开放的扩展生态，支持社区自定义节点，可接入 LoRA、ControlNet、量化模型等多种能力，满足图像生成、图像编辑、视频生成...
2026年了，核弹还是fastjson，fastjson1.2.83 RCE是怎么回事？
7月19日，推上的一名安全研究员声称，他发现了一个在fastjson 1.2.83版本中无需gadget的RCE漏洞。一时间激起千帆浪。 Fastjson...
LWiAI Podcast #248 - Opus 4.8, MAI, Anthropic IPO, Minimax-M3
Exploring Claude Fable 5’s impact, Siri AI’s latest enhancements, and the com...