Presentation: Building Evals for AI Adoption: From Principles to Practice

📝

内容提要

Mallika Rao discusses the hidden risk of evaluation debt in production AI systems, drawing on her experience at Twitter, Walmart, and Netflix. She explains why traditional metrics fail modern...

🏷️

标签

➡️

继续阅读