BriefGPT - AI 论文速递 ·

MBTSAD: Mitigating Backdoor Attacks in Language Models Based on Token Splitting and Attention Distillation

💡 原文英文，约100词，阅读约需1分钟。

📝

内容提要

本研究提出MBTSAD方法，旨在解决语言模型在后门攻击下的脆弱性。该方法通过少量干净数据重训练后门模型，并利用注意力蒸馏进行优化，实验证明其效果与依赖预训练权重的方法相当，同时在干净数据上保持了良好性能，具有更高的实用性。

🎯

🏷️

早报｜iPhone 20 Pro Max或搭载约7英寸屏幕/曝小米上调今年手机出货目标至1.1亿部/Gemini 3.6 Flash发布,输出Token减少17%
· 曝苹果正打样约 7 英寸屏幕，或用于 iPhone 20 Pro Max · 曝小米上调今年手机出货目标至 1.1 亿部 · 张一鸣向芳梅公益基金追加...
从 Token 价格战到成功任务单位经济学：AI 成本战的真正主线（上） - 张善友
AI 行业过去最喜欢讲的是"能力"，今天越来越必须讲的是"结果"。"有用智能每人民币"（Useful In...
谷歌Gemini 3.6 Flash发布：输出token暴降17%，价格战打到了七块五
谷歌AI模型更新引爆价格战，谁还敢说Flash系列只是“快枪手”？ Google一口气甩出三款新模型，直接把AI价格战打到了每百万token七块五毛钱，这...
What’s new: Air gets more agents, local models, and Java/Kotlin code intelligence
The new release of JetBrains Air brings support for GitHub Copilot, OpenCode,...
Google ships 3 new Gemini models. Just not the one everyone’s waiting for.
Google on Tuesday launched three new Gemini models: Gemini 3.6 Flash, a cheap...
Google launches a cheaper alternative to large AI security models like Mythos
Google is launching Gemini 3.6 Flash alongside a new security model dedicated...