Presentation: Building Evals for AI Adoption: From Principles to Practice
📝
内容提要
Mallika Rao discusses the hidden risk of evaluation debt in production AI systems, drawing on her experience at Twitter, Walmart, and Netflix. She explains why traditional metrics fail modern...
🏷️
标签
➡️