BriefGPT - AI 论文速递 ·

xCOMET: 透明的机器翻译评估通过精细化错误检测

💡 原文中文，约300字，阅读约需1分钟。

📝

内容提要

本文研究了自动机器翻译度量在句子级别中区分好的翻译和坏的翻译的可靠性，并在三个下游跨语言任务上评估了最广泛使用的MT度量的段落级别性能。作者建议将来的MT指标应该被设计成产生错误标签而不是得分，以便于外在评估。

🎯

关键要点

研究自动机器翻译度量在句子级别区分好的翻译和坏的翻译的可靠性。
评估MT度量在三个下游跨语言任务中的段落级别性能。
实验表明，所有度量标准与下游结果的内在评估相关性微不足道。
神经度量提供的分数大多数不可解释，值域未定义。
建议未来的MT指标应设计为产生错误标签而非得分，以便于外在评估。

🏷️

标签

MT指标下游跨语言任务句子级别坏的翻译机器翻译自动机器翻译度量

➡️

继续阅读

Twelve South’s stylish charging tray now delivers more wireless power with a smaller footprint
Following the original's debut at CES earlier this year, Twelve South is ...
You don’t need to splurge on an expensive handheld fan to beat the heat
Despite what influencers may say, you don’t need to spend $99.99 on Dyson’s H...
5 ways AI Mode in Search helps you enjoy the real world
Illustration of a black magnifying glass in a white circle on green grass sur...
These Google Trends show people really want to touch grass
Illustration of a phone in do-not-disturb mode against green grass
5 ways to host the ultimate dinner party with Google Search
An illustrated black magnifying glass with a sparkle in a white circle surrou...
别被“通用Agent吃掉一切”骗了，这才是AI竞赛的真正底层逻辑 - 蝈蝈俊
最近，AI圈流传着三个非常犀利的判断：更通用的会吃掉更垂直的：通用Agent加上一堆技能插件，就能把垂类AI应用全部扫进垃圾桶，很多AI创业根本就是个伪...