Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment

26 points | by anigbrowl 6 hours ago

12 comments