BriefGPT - AI 论文速递 ·

伪随机纠错码

💡 原文中文，约500字，阅读约需2分钟。

📝

内容提要

本文介绍了一种在自回归语言模型中种植水印的方法，该方法对扰动具有鲁棒性。通过使用随机水印密钥计算的随机数序列映射到语言模型的样本来生成带水印的文本。实验证明该方法对各种释义攻击具有统计功率和鲁棒性。

🎯

关键要点

提出了一种在自回归语言模型中种植水印的方法，具有鲁棒性。
水印通过随机水印密钥计算的随机数序列映射到语言模型的样本生成。
任何知道密钥的一方可以将文本与随机数序列对齐以检测水印。
使用逆变换采样和指数最小采样两种方案实例化水印方法。
在OPT-1.3B和LLaMA-7B模型中，即使破坏40-50%的标记，仍能可靠检测水印。
对于Alpaca-7B模型，水印响应的检测较困难，约25%的响应可在p≤0.01条件下检测到。
水印对某些自动释义攻击的鲁棒性较差。

➡️

继续阅读

How the Galaxy Z Fold 8 and Z Flip 8 phones compare
Samsung's latest round of folding Galaxy Z phones and updated smartwatche...
Preorders for Samsung’s new Z Fold and Flip 8 come with up to $350 in gift cards
Samsung's newest foldables are here. At Galaxy Unpacked, the company anno...
Philips’ new smart toothbrush shows you where you didn’t properly brush
The latest addition to Philips' Sonicare line of smart electric toothbrus...
Microsoft is bringing original Xbox games to PC
Microsoft is expanding its Xbox backward compatibility efforts today by bring...
Not just development, distribution of software may change as well
Even if you are as averse to semver as I used to be in the course of my progr...
AMD commits up to $5 billion to Anthropic
AMD says it's going to invest up to $5 billion in Anthropic, while helpin...