小红花·文摘

Transformers v5引入了更模块化和互操作的核心

InfoQ ·

何恺明重磅新作：Just image Transformers让去噪模型回归基本功

机器之心 ·

人民大学&字节Seed：利用μP实现Diffusion Transformers高效扩展

机器之心 ·

使用Ollama、vLLM或Transformers本地安装DeepSeek-R1-0528的逐步指南

DEV Community ·

Learn how pgstream v0.6 simplifies complex data transformations with custom templates, enhances observability and improves snapshot performance.

Ahmet Gedemenli: pgstream v0.6.0: Template transformers, observability, and performance improvements

Planet PostgreSQL ·

本文探讨了自注意力机制在图信号处理中的局限性，提出了一种新方法——注意力图滤波器（AGF），通过奇异值域建模，提高了频率信息的利用效率。实验结果表明，AGF在多个任务中表现优异。

Learning Advanced Self-Attention of Linear Transformers in the Singular Value Domain

BriefGPT - AI 论文速递 ·

本研究提出了一种新的纵向表转换器（LTT）模型，以提高电力供应商在自然灾害中估计电力恢复时间（ETR）的准确性。分析了34,000个故障事件后，LTT模型的客户满意度指标平均提高了19.08%。

Using Longitudinal Table Transformers to Estimate Power Outage Restoration Times

BriefGPT - AI 论文速递 ·

本研究探讨了视觉变换器（ViTs）在植物疾病检测中的应用，克服了传统农业技术在可扩展性和准确性方面的局限性。ViTs在处理长距离依赖性方面表现优越，可能对现代农业产生重要影响。

Application of Vision Transformers in Precision Agriculture: A Comprehensive Survey

BriefGPT - AI 论文速递 ·

This post is divided into five parts: • Understanding the RAG architecture • Building the Document Indexing System • Implementing the Retrieval System • Implementing the Generator • Building the...

Building RAG Systems with Transformers

MachineLearningMastery.com ·

This post is divided into seven parts; they are: • Core Text Generation Parameters • Experimenting with Temperature • Top-K and Top-P Sampling • Controlling Repetition • Greedy Decoding and...

Understanding Text Generation Parameters in Transformers

MachineLearningMastery.com ·

本文提出了一种新颖的伪变换器框架，旨在解决弱监督时间行为定位中的时间标注缺失问题。通过引入RickerFusion生成高质量伪标签，优化训练过程，该方法在THUMOS14和ActivityNet1.3数据集上取得了优异的效果。

Bridging the Gap: Utilizing Pseudo Transformers for Temporal Action Localization from Weak Supervision to Full Supervision

BriefGPT - AI 论文速递 ·

本研究提出三种简单的修改，使普通变换器在图学习中有效应用，显著提升多种图数据集的性能，并在图同构性测试中表现优异。

Mechanistic Interpretability of Fine-tuned Vision Transformers for Distorted Images: Decoding Attention Head Behavior for Transparent and Trustworthy AI

BriefGPT - AI 论文速递 ·

Transformers v5引入了更模块化和互操作的核心

何恺明重磅新作：Just image Transformers让去噪模型回归基本功

人民大学&字节Seed：利用μP实现Diffusion Transformers高效扩展

使用Ollama、vLLM或Transformers本地安装DeepSeek-R1-0528的逐步指南

Ahmet Gedemenli: pgstream v0.6.0: Template transformers, observability, and performance improvements

Learning Advanced Self-Attention of Linear Transformers in the Singular Value Domain

Using Longitudinal Table Transformers to Estimate Power Outage Restoration Times

Application of Vision Transformers in Precision Agriculture: A Comprehensive Survey

Building RAG Systems with Transformers

Understanding Text Generation Parameters in Transformers

Bridging the Gap: Utilizing Pseudo Transformers for Temporal Action Localization from Weak Supervision to Full Supervision

Simplifying Transformers in Graph Neural Networks

Generating and Visualizing Context Vectors in Transformers

MiMu: Mitigating Multiple Shortcut Learning Behaviors in Transformers

RCCFormer: A Robust Crowd Counting Network Based on Transformers

Spatial Structure of Mixture of Experts in Transformers

Using Auto Classes in the Transformers Library

Text Embedding Generation with Transformers

VIP Cheatsheet: Transformers & Large Language Models

Mechanistic Interpretability of Fine-tuned Vision Transformers for Distorted Images: Decoding Attention Head Behavior for Transparent and Trustworthy AI