大型语言模型是新的数据库用户。现在我们需要一种衡量它们的方法：介绍text-to-sql-eval

Open-source text-to-SQL evaluation suite for PostgreSQL. Measure, debug, and improve LLM database accuracy with granular testing and actionable insights.

我们开源了用于评估和提升PostgreSQL文本到SQL系统的评估套件text-to-sql-eval。该工具支持多种模型，专为PostgreSQL设计，帮助识别失败原因并提供改进建议，包含多种操作模式，便于调试和结果跟踪，旨在提高文本到SQL系统的准确性和可靠性。

PostgreSQL 准确性可靠性文本到SQL 评估套件