Chain of thought monitorability: A new and fragile opportunity for AI safety

132 points | by mfiguiere 4 days ago

65 comments