Using data attribution for AI alignment

4 min read

Quantum Computing, Photonics, and Energy Bottlenecks for AGI

10 min read

AI Insights #1: How Misalignment Could Lead to Takeover & Necessary Safety Properties

4 min read

My current research and request for collaborators

3 min read

But is it really in Rome? Limitations of the ROME model editing technique

2 min read