Using data attribution for AI alignment

5 min read

Quantum Computing, Photonics, and Energy Bottlenecks for AGI

10 min read

AI Insights #1: How Misalignment Could Lead to Takeover & Necessary Safety Properties

5 min read

My current research and request for collaborators

3 min read

But is it really in Rome? Limitations of the ROME model editing technique

3 min read