BriefGPT - AI 论文速递 ·

MMTEB: Massive Multilingual Text Embedding Benchmark

💡 原文英文，约100词，阅读约需1分钟。

📝

内容提要

本研究提出了大规模多语言文本嵌入基准（MMTEB），涵盖500多个评估任务和250种语言。最优模型“multilingual-e5-large-instruct”以560百万参数领先，并通过新方法降低计算成本。

🎯

🏷️

基于SGLang的大模型推理实践——从benchmark方法论到部署方案选型与调优
随着大语言模型（LLM）的快速发展，模型规模不断增大，对推理部署的要求也越来越高。在实际项目中，如何高效地在GPU集群上部署和优化大模型推理，已经成为AI...
Accelerating Text-to-Video Generation with Calibrated Sparse Attention
Recent diffusion models enable high-quality video generation, but suffer from...
Wolves, sheep, and gypsies
In 2012, the first Danish wolf in nearly two hundred years was discovered in ...
13 Google tips for a fun, productive summer off from college
Illustration of a woman in front of a computer, a phone searching an image of...
Why R&D Data Belongs in the Lakehouse - and Why Agents Need It There
The setupAt cellcentric, a joint venture of Daimler Truck and Volvo Group, we...
How Dow Built a Carbon Footprint Ledger on Databricks to Accelerate Sustainability at Scale
Why we built the Carbon Footprint LedgerAt Dow, our ambition is to be the mos...