Accelerating AI Alignment Research (Talk)
I gave a keynote talk on how we should be thinking about accelerating AI alignment (safety) research. This is a
A descriptive, not prescriptive, overview of current AI Alignment Research
TL;DR: In this project, we collected and cataloged AI alignment research literature and analyzed the resulting dataset in an
A survey of tool use and workflows in alignment research
Crossposted from the AI Alignment Forum.
TL;DR: We are building language model powered tools to augment alignment researchers and