A Mini Exercise on the Mismanaged Geniuses Hypothesis (RLMs on LongCoT)

📝

内容提要

We study an example of the Mismanaged Geniuses Hypothesis at play on the LongCoT benchmark

➡️

继续阅读