A Mini Exercise on the Mismanaged Geniuses Hypothesis (RLMs on LongCoT)
📝
内容提要
We study an example of the Mismanaged Geniuses Hypothesis at play on the LongCoT benchmark
➡️
We study an example of the Mismanaged Geniuses Hypothesis at play on the LongCoT benchmark