R-Zero: Self-Evolving Reasoning LLM from Zero Data

120 points | by lawrenceyan 3 days ago

63 comments