Summary of METR's predeployment evaluation of GPT-5.6 Sol

10 points | by pongogogo a day ago

6 comments