BriefGPT - AI 论文速递 ·

MedBench：一个用于评估医学大型语言模型的大规模中文基准

💡 原文中文，约300字，阅读约需1分钟。

📝

内容提要

为了解决医学大语言模型评估工作耗时且需要大量人力的问题，研究人员引入了MedBench，一个综合性的基准测试，包括来自医学各领域的40,041个问题。通过评估医学语言学习模型的知识掌握和推理能力，MedBench建立了一个可靠的标准，揭示了医学大语言模型的能力和限制，以帮助医学研究社区。

🎯

🏷️

《全面战争：战锤40K》总监用中文告诉我：要将中国视作重点市场
很懂行情。《全面战争：战锤40000》自去年TGA公布以来，就成为了不少《全面战争》玩家和“锤佬”最期待的作品之一。此前，《全面战争：战锤》系列一直以中古...
OpenAI built support agents for its own customer service line, now it hopes big enterprises will trust them too
The general consensus emerging across the AI and industrial spheres is that t...
Building a serverless AI assistant at Pelago: concept to care in two weeks
Healthcare organizations face a critical scaling challenge – how to maintain ...
Visual Studio Code 1.130（Insiders）
Visual Studio Code 1.130 Insiders版本发布，新增功能更新。用户可通过提交日志和已关闭问题列表跟踪进展，鼓励大家尽快尝试新特性。
Visual Studio Code 1.131 (Insiders)
Learn what's new in Visual Studio Code 1.131 (Insiders) Read the full article
Professor Emeritus Dimitri Bertsekas, influential computer scientist and prolific author, dies at 83
Known for his clear and elegant writing style, Bertsekas shaped fields from c...