小红花·文摘
  • 首页
  • 广场
  • 排行榜🏆
  • 直播
  • FAQ
Dify.AI
沉浸式翻译 immersive translate

A Senior Engineer’s Mental Model for AI

Foundation vs. Instruct vs. Thinking Models

Alex Ewerlöf Notes
Alex Ewerlöf Notes · 2025-12-24T07:07:00Z
怒喷大模型连狗都不如?揭秘硅谷集体幻觉与物理常识缺失,为何只有新架构才能通往通用人工智能|Yann LeCun World Models AMI LLMs AI Startup

杨乐坤在访谈中批评大语言模型,认为其智能水平不及狗,因缺乏与物理世界的关联。他提出的抽象世界模型(JEPA)强调抽象、分层、预测和最小消耗,以解决AI的局限性。杨乐坤计划创办AMI公司,专注于开源研究,支持自动驾驶和机器人技术。

怒喷大模型连狗都不如?揭秘硅谷集体幻觉与物理常识缺失,为何只有新架构才能通往通用人工智能|Yann LeCun World Models AMI LLMs AI Startup

硕鼠的博客站
硕鼠的博客站 · 2025-12-23T00:56:44Z

T5Gemma 2 Text

T5Gemma 2: The next generation of encoder-decoder models

The Keyword
The Keyword · 2025-12-18T18:30:00Z

Improved Gemini audio models for powerful voice experiences

Google DeepMind Blog
Google DeepMind Blog · 2025-12-12T17:50:50Z
BALROG - A benchmark suite for evaluating agentic large language models and …

BALROG是由Balrog AI开发的开源基准套件,旨在评估具备代理能力的模型在游戏中的推理与决策表现。它通过多任务基准、可复现评测和多模型支持,帮助研究者比较不同大语言模型和视觉语言模型的表现,适用于研究、工程和学术领域。

BALROG - A benchmark suite for evaluating agentic large language models and …

云原生
云原生 · 2025-12-08T13:29:00Z

Introduction Vision Language Models (VLMs) are crucial for bridging the gap between visual and textual data by combining image and language understanding. Some important VLMs’ use cases are: VLMs...

Supporting Vision Language Models (VLMs) provided by OCI GenAI service within the Heatwave

Planet MySQL
Planet MySQL · 2025-12-04T18:38:17Z

细节前几天在GitHub上看到点赞了JiT的代码仓库,进去看了下,发现是kaiming he的新论文,这几天看完后,发现是一篇非常极简,优雅,结

JiT论文阅读Back to Basics-Let Denoising Generative Models Denoise

Yunfeng's Simple Blog
Yunfeng's Simple Blog · 2025-11-23T08:55:40Z

This article is divided into two parts; they are: • Architecture and Training of BERT • Variations of BERT BERT is an encoder-only model.

BERT Models and Its Variants

MachineLearningMastery.com
MachineLearningMastery.com · 2025-11-22T18:20:15Z

This article is divided into two parts; they are: • Picking a Dataset • Training a Tokenizer To keep things simple, we'll use English text only.

Training a Tokenizer for BERT Models

MachineLearningMastery.com
MachineLearningMastery.com · 2025-11-18T20:07:11Z

Decision tree-based models in machine learning are frequently used for a wide range of predictive tasks such as classification and regression, typically on structured, tabular data.

Forecasting the Future with Tree-Based Models for Time Series

MachineLearningMastery.com
MachineLearningMastery.com · 2025-11-18T11:00:09Z

US healthcare organizations should rethink care and business models in response to substantial economic pressures and evolving care demands.

Reimagining sustainable healthcare and business models

McKinsey Insights & Publications
McKinsey Insights & Publications · 2025-11-18T00:00:00Z

A general-purpose agent built on large language models to automate software engineering tasks and boost developer productivity.

Trae Agent - A general-purpose agent built on large language models to automate software …

云原生
云原生 · 2025-11-14T09:20:03Z

Introducing T5Gemma, a new collection of encoder-decoder LLMs.

T5Gemma: A new collection of encoder-decoder Gemma models

Google DeepMind Blog
Google DeepMind Blog · 2025-10-25T18:14:00Z

We’re announcing new multimodal models in the MedGemma collection, our most capable open models for health AI development.

MedGemma: Our most capable open models for health AI development

Google DeepMind Blog
Google DeepMind Blog · 2025-10-25T18:02:50Z

Genie 3 can generate dynamic worlds that you can navigate in real time at 24 frames per second, retaining consistency for a few minutes at a resolution of 720p.

Genie 3: A new frontier for world models

Google DeepMind Blog
Google DeepMind Blog · 2025-10-24T02:54:30Z

I'm learning Domain-Driven Design (DDD) and studying different architecture patterns, and I’ve come across two seemingly conflicting design philosophies around domain modeling. 1. Rich Domain...

How should domain models be designed — rich domain models with encapsulated logic vs. anemic models with separate service/util layers?

Hot Monthly Questions - Software Engineering Stack Exchange
Hot Monthly Questions - Software Engineering Stack Exchange · 2025-10-24T00:05:31Z

We propose Recursive Language Models (RLMs), an inference strategy where language models can decompose and recursively interact with input context of unbounded length through REPL environments.

Recursive Language Models

blank
blank · 2025-10-15T00:00:00Z

Before we begin, let's make sure you're in the right place.

Building Transformer Models from Scratch with PyTorch (10-day Mini-Course)

MachineLearningMastery.com
MachineLearningMastery.com · 2025-10-12T03:45:31Z

Time series data have the added complexity of temporal dependencies, seasonality, and possible non-stationarity.

A Decision Matrix for Time Series Forecasting Models

MachineLearningMastery.com
MachineLearningMastery.com · 2025-10-06T11:00:33Z
译: Programming Language Memory Models (Memory Models, Part 2)

编程语言内存模型探讨了并行程序中线程共享内存的行为保障。通过原子变量和操作,程序可以同步线程,避免数据竞争。现代语言如C、Java和C++提供顺序一致的原子操作,确保无数据竞争的程序表现为顺序一致执行。尽管细节不同,各语言都致力于消除数据竞争,提高并发程序的可靠性。

译: Programming Language Memory Models (Memory Models, Part 2)

Steins;Lab
Steins;Lab · 2025-10-01T06:00:33Z
  • <<
  • <
  • 1 (current)
  • 2
  • 3
  • >
  • >>
👤 个人中心
在公众号发送验证码完成验证
登录验证
在本设备完成一次验证即可继续使用

完成下面两步后,将自动完成登录并继续当前操作。

1 关注公众号
小红花技术领袖公众号二维码
小红花技术领袖
如果当前 App 无法识别二维码,请在微信搜索并关注该公众号
2 发送验证码
在公众号对话中发送下面 4 位验证码
友情链接: MOGE.AI 九胧科技 模力方舟 Gitee AI 菜鸟教程 Remio.AI DeekSeek连连 53AI 神龙海外代理IP IPIPGO全球代理IP 东波哥的博客 匡优考试在线考试系统 开源服务指南 蓝莺IM Solo 独立开发者社区 AI酷站导航 极客Fun 我爱水煮鱼 周报生成器 He3.app 简单简历 白鲸出海 T沙龙 职友集 TechParty 蟒周刊 Best AI Music Generator

小红花技术领袖俱乐部
小红花·文摘:汇聚分发优质内容
小红花技术领袖俱乐部
Copyright © 2021-
粤ICP备2022094092号-1
公众号 小红花技术领袖俱乐部公众号二维码
视频号 小红花技术领袖俱乐部视频号二维码