Weight-sparse transformers have interpretable circuits [pdf]

6 points | by 0x79de a day ago

No comments yet.