Predicting Rare LLM Failures with 30× Fewer Rollouts

2 points | by aranguri 8 hours ago

No comments yet.