LLMs adapt 24.9% under observation – safety evals are always observed

2 points | by agentic-wiki 6 hours ago

1 comments