Reinforcement Learning from Human Feedback

96 points | by onurkanbkrc 10 hours ago

5 comments