Supervised fine tuning on curated data is reinforcement learning

68 points | by GabrielBianconi a day ago

19 comments