Rethinking how we measure AI intelligence
📝
内容提要
Game Arena is a new, open-source platform for rigorous evaluation of AI models. It allows for head-to-head comparison of frontier systems in environments with clear winning conditions.
🏷️
标签
➡️