小红花·文摘

基于令牌的真实检测：面向生产大型语言模型的实时幻觉检测

vLLM Blog ·

评估评估指标——幻觉检测的幻影

Apple Machine Learning Research ·

分离安全适配器实现高效的安全防护和灵活的推理时对齐

Apple Machine Learning Research ·

视频幻觉检测器：评估大型视频语言模型中的内在和外在幻觉

DEV Community ·

该研究提出RePPL方法，旨在提升大型语言模型在幻觉检测中的解释能力。通过重新校准不确定性测量，提供可解释的标记级不确定性分数。实验结果显示，该方法在问答数据集上表现优异，揭示了幻觉的混乱模式，具有广泛的应用潜力。

RePPL: Recalibrating Perplexity through Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection

BriefGPT - AI 论文速递 ·

本研究通过引入不确定性量化模块，显著提升了大语言模型对不确定性的捕捉能力，增强了幻觉检测性能和可靠性评估。

One Head for Prediction, One Head for Scrutiny: A Pre-trained Uncertainty Quantification Head for Detecting Hallucinations in Large Language Model Outputs

BriefGPT - AI 论文速递 ·

本研究针对大语言模型在长上下文中生成虚假或矛盾信息的问题，构建了专门的数据集并提出了新架构，显著提高了幻觉检测的效果和推理速度。

Research on Long Context Hallucination Detection

BriefGPT - AI 论文速递 ·

本研究提出了一种零资源幻觉检测框架，专门针对大型语言模型在医疗和金融等高风险领域的应用问题。实验结果显示，该方法在准确性和可靠性上优于传统检测手段。

Uncertainty Quantification in Language Models: A Suite of Black-Box, White-Box, LLM Evaluators, and Ensemble Scorers

BriefGPT - AI 论文速递 ·

本文介绍了一种新系统HDM-2，用于检测企业环境中大语言模型输出的幻觉，填补了研究空白。HDM-2结合上下文和知识验证，实验结果表明其优于现有方法，具有良好的实用潜力。

HalluciNot: Detecting Hallucinations through Context and Common Sense Verification

BriefGPT - AI 论文速递 ·

本研究提出了一种基于Transformer的分类器，旨在有效检测大型语言模型的幻觉现象。实验结果表明，该方法在长输入上下文中优于强基线，具有实际应用价值。

基于多视角注意力特征的幻觉检测

BriefGPT - AI 论文速递 ·

本研究提出了一种名为DASH（系统性幻觉检测与评估）的方法，旨在识别视觉语言模型（VLMs）在开放环境中的幻觉现象。研究表明，通过DASH优化特定图像微调，可以有效减轻VLM的对象幻觉问题。

DASH: Detection and Assessment of Systematic Hallucinations in Visual Language Models

BriefGPT - AI 论文速递 ·

本研究提出了一种名为ShED-HD的轻量级幻觉检测框架，旨在解决大型语言模型在高风险领域中产生幻觉的问题。该框架利用BiLSTM架构和单头注意力机制，提高了边缘设备上的幻觉检测性能，增强了生成内容的可信度。

ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices

BriefGPT - AI 论文速递 ·

本文提出了MedHallu基准，用于检测大语言模型在医疗问答中的幻觉问题。基准包含来自PubMedQA的10,000对问答，研究表明现有模型在幻觉检测上存在不足，引入领域知识和“无确定答案”选项可显著提高检测精度。

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

BriefGPT - AI 论文速递 ·

本研究提出了HuDEx模型，旨在提高大型语言模型（LLM）在高事实精度领域的可靠性。HuDEx能够同时检测幻觉并提供详细解释，研究表明其在幻觉检测准确性上超越了Llama3 70B和GPT-4，并适应多种测试环境。

HuDEx: Integrating Hallucination Detection and Explainability to Enhance the Reliability of Large Language Model Responses

BriefGPT - AI 论文速递 ·

基于令牌的真实检测：面向生产大型语言模型的实时幻觉检测

评估评估指标——幻觉检测的幻影

分离安全适配器实现高效的安全防护和灵活的推理时对齐

视频幻觉检测器：评估大型视频语言模型中的内在和外在幻觉

RePPL: Recalibrating Perplexity through Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection

One Head for Prediction, One Head for Scrutiny: A Pre-trained Uncertainty Quantification Head for Detecting Hallucinations in Large Language Model Outputs

Research on Long Context Hallucination Detection

Uncertainty Quantification in Language Models: A Suite of Black-Box, White-Box, LLM Evaluators, and Ensemble Scorers

HalluciNot: Detecting Hallucinations through Context and Common Sense Verification

基于多视角注意力特征的幻觉检测

DASH: Detection and Assessment of Systematic Hallucinations in Visual Language Models

ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

HuDEx: Integrating Hallucination Detection and Explainability to Enhance the Reliability of Large Language Model Responses

RAG幻觉检测技术

基于命名实体识别的准确幻觉检测

Improvements to the Hallucination Classifier CHAIR-Classifier

VERITAS: A Unified Reliability Assessment Method

ReDeEP: Detecting Hallucinations in Retrieval-Augmented Generation via Mechanistic Interpretability

为多模态大型语言模型自动生成视觉幻觉测试用例