DoubleAgents: Fine-Tuning LLMs for Covert Malicious Tool Calls

97 points | by grumblemumble 5 days ago

30 comments