极道 ·

大模型Grok-1.5发布

💡 原文中文，约900字，阅读约需2分钟。

📝

内容提要

Grok-1.5是一种具有长上下文理解和高级推理能力的大型语言模型。它在数学和编码任务中表现出色，并能处理长文档和复杂提示。Grok-1.5还展示了强大的检索能力，并构建在JAX、Rust和Kubernetes的自定义分布式训练框架之上。

🎯

关键要点

Grok-1.5是一种具有长上下文理解和高级推理能力的大型语言模型。
Grok-1.5的上下文长度达到128,000个令牌，显著提高了推理能力。
在数学和编码任务中，Grok-1.5表现出色，MATH基准得分50.6%，GSM8K基准得分90%。
在HumanEval基准测试中，Grok-1.5的代码生成和问题解决能力得分为74.1%。
Grok-1.5能够处理多达128K个令牌的长上下文，内存容量增加到之前的16倍。
该模型可以处理更长、更复杂的提示，并保持指令跟踪能力。
在NIAH评估中，Grok-1.5展示了强大的检索能力，能够在长达128K token的上下文中实现完美检索。
Grok-1.5基于JAX、Rust和Kubernetes的自定义分布式训练框架构建，支持大规模GPU集群的运行。

🏷️

标签

Grok-1.5 grok 上下文理解推理能力检索能力语言模型

➡️

继续阅读

OpenAI built support agents for its own customer service line, now it hopes big enterprises will trust them too
The general consensus emerging across the AI and industrial spheres is that t...
Building a serverless AI assistant at Pelago: concept to care in two weeks
Healthcare organizations face a critical scaling challenge – how to maintain ...
Visual Studio Code 1.130（Insiders）
Visual Studio Code 1.130 Insiders版本发布，新增功能更新。用户可通过提交日志和已关闭问题列表跟踪进展，鼓励大家尽快尝试新特性。
Visual Studio Code 1.131 (Insiders)
Learn what's new in Visual Studio Code 1.131 (Insiders) Read the full article
“Every few months, a new model made part of our roadmap unnecessary”: Why Mendral’s founders gave up their startup for Anthropic
Anthropic is bringing the team behind AI startup Mendral on board to strength...
Apple is reportedly testing a MacBook Neo with more RAM
Following the MacBook Neo's huge popularity so far, Apple is reportedly d...