Theory and AI Alignment
Theory and AI Alignment The following is based on a talk that I gave (remotely) at the UK AI Security Institute Alignment Workshop on October 29, and which I then procrastinated for more than a month in writing up. Enjoy! Thanks for having me! I’m a theoretical computer scientist. I’ve spent most of my