How undesired goals can arise with correct rewards
📝
内容提要
As we build increasingly advanced artificial intelligence (AI) systems, we want to make sure they don’t pursue undesired goals. Such behaviour in an AI agent is often the result of specification...
➡️