钟意博客 ·

大语言模型的不确定性

💡 原文中文，约4600字，阅读约需11分钟。

📝

内容提要

在工程实践中，即使设置temperature=0和seed=0，LLM的输出仍然无法保证完全确定性，原因包括采样配置和数值误差等。目标应是控制模型行为在可接受的稳定性范围内，而非追求绝对一致性。建议通过参数调整、缓存和上层逻辑来应对不确定性，LLM更适合作为辅助决策工具。

🎯

🏷️

一分钟读论文：《Blindfold——通过动作级操纵越狱具身大语言模型》
研究显示，具身AI存在漏洞，无法理解物理因果关系。Blindfold攻击框架将恶意意图转化为安全动作序列，成功率高达98%。传统防御机制效果有限，需要整合...
You can now fill your home with Ikea’s cheap and tiny new Bluetooth speaker
Alongside smart Lego bricks and lots of robots, one of the most anticipated g...
AI 对话克隆网站，快速重建为 React 应用 | 开源日报 No.884
zyronon/TypeWords typing-word 是一款基于 Vue 开发的在线英语学习工具软件。支持通过网页进行交互式背诵训练 ...
Preorders for Apple’s colorful MacBook Neo come with a $25 gift card
The forthcoming MacBook Neo is certainly compelling — at least for the right ...
Pandas与Polars：语法、速度和内存的全面比较
Need help choosing the right Python dataframe library? This article compares ...
Prediction markets in the news are a dangerous gamble
Today on Decoder, let’s talk about prediction markets, which continue to inse...