A faster, better way to prevent an AI chatbot from giving toxic responses
📝
内容提要
Researchers create a curious machine-learning model that finds a wider variety of prompts for training a chatbot to avoid hateful or harmful output.
➡️