Planet PostgreSQL ·

亚当·亨德尔：在Postgres上实现向量数据库的运营

💡 原文英文，约1400词，阅读约需6分钟。

📝

内容提要

本文介绍了向量数据库的必要性和使用pg_vectorize生成和搜索嵌入向量的方法。pg_vectorize提供了管理转换的方法，并支持定时和实时更新嵌入向量。支持的嵌入模型包括OpenAI和Hugging Face的模型。

🎯

关键要点

向量数据库的必要性源于嵌入的高效存储、索引和搜索需求。
嵌入的生成和使用是一个持续的生命周期，需要不断维护。
模型训练和推理之间的一致性至关重要，必须使用相同的预处理步骤。
pg_vectorize 通过跟踪生成嵌入所用的变换模型来解决嵌入生成和搜索的问题。
可以在 Tembo Cloud 上免费启动 VectorDB 实例，或在本地运行 docker-compose 示例。
pg_vectorize 提供了两种管理嵌入转换的方法：基于时间的调度和实时触发。
pg_vectorize 支持所有 OpenAI 和 Hugging Face 的嵌入模型，包括私有模型。
可以直接通过 vectorize.transform_embeddings 方法将文本转换为嵌入。
pg_vectorize 扩展是开源的，欢迎在 GitHub 上提出问题或参与讨论。

❓

延伸问答

向量数据库的必要性是什么？

向量数据库的必要性源于对嵌入的高效存储、索引和搜索需求。

pg_vectorize如何生成和搜索嵌入向量？

pg_vectorize通过跟踪生成嵌入所用的变换模型，支持实时和定时更新嵌入向量。

如何在Tembo Cloud上启动VectorDB实例？

可以在Tembo Cloud上免费启动VectorDB实例，或在本地运行docker-compose示例。

pg_vectorize支持哪些嵌入模型？

pg_vectorize支持所有OpenAI和Hugging Face的嵌入模型，包括私有模型。

如何管理嵌入的更新？

pg_vectorize提供基于时间的调度和实时触发两种管理嵌入更新的方法。

如何将文本转换为嵌入？

可以直接通过vectorize.transform_embeddings方法将文本转换为嵌入。

🏷️

标签

pg_vectorize postgres 向量向量数据库嵌入向量数据库模型转换

➡️

继续阅读

Wolves, sheep, and gypsies
In 2012, the first Danish wolf in nearly two hundred years was discovered in ...
13 Google tips for a fun, productive summer off from college
Illustration of a woman in front of a computer, a phone searching an image of...
Why R&D Data Belongs in the Lakehouse - and Why Agents Need It There
The setupAt cellcentric, a joint venture of Daimler Truck and Volvo Group, we...
How Dow Built a Carbon Footprint Ledger on Databricks to Accelerate Sustainability at Scale
Why we built the Carbon Footprint LedgerAt Dow, our ambition is to be the mos...
Issue #744: CPython ABI, CLAUDE.md, Itertools Cheatsheet, and More (2026-07-21)
#744 – JULY 21, 2026 View in Browser » What Every Dev Should Know About t...
July Patches for Azure DevOps Server
We are releasing new patches for our self‑hosted product, Azure DevOps Server...