Applying RLAIF for Code Generation with API-usage in Lightweight LLMs
📝
内容提要
This paper was accepted at the Natural Language Reasoning and Structured Explanations workshop at ACL 2024. Reinforcement Learning from AI Feedback (RLAIF) has demonstrated significant potential...
🏷️
标签
➡️