Aligned to Flourish

Oct

02

AI Alignment Project Ideas

[Nov 27, 2024] I have some new alignment project ideas I quickly wrote up. These are mostly projects I'

Oct 2, 2024

6 min read

Sep

24

How much I'm paying for AI productivity software (and the future of AI use)

This post is broken down into two parts: 1. Which AI productivity tools am I currently using? 2. Why does

Sep 24, 2024

9 min read

Aug

14

The importance of Entropy

Imagine you're building a sandcastle on the beach. As you carefully shape the sand, you're creating

Aug 14, 2024

2 min read

Jul

27

Accelerating AI Alignment Research (Talk)

I gave a keynote talk on how we should be thinking about accelerating AI alignment (safety) research. This is a

Jul 27, 2024

1 min read

Jul

09

Using data attribution for AI alignment

This is a post on a recent paper I thought was cool. I give some follow-up project ideas after. In-Run

Jul 9, 2024

5 min read

May

10

Quantum Computing, Photonics, and Energy Bottlenecks for AGI

💡Note: I wrote this post in less than a day and didn't want to spend more time on

May 10, 2024

10 min read

May

03

AI Insights #1: How Misalignment Could Lead to Takeover & Necessary Safety Properties

AI safety insights number 1: risks of misaligned AI takeover, key properties of AGI safety plans, and dangers of autonomous AI agents maximizing rewards in unintended ways as models advance.

May 3, 2024

5 min read

Jan

23

My current research and request for collaborators

I wrote this as a bio for EAG Bay Area 2024. I'm sharing this here because it gives

Jan 23, 2024

3 min read

Dec

29

But is it really in Rome? Limitations of the ROME model editing technique

I just published a new post on LessWrong. It's about the causal tracing and model editing paper (ROME)

Dec 29, 2022

3 min read

Dec

29

An incomplete list of projects I'd like to work on in 2023

Wrote up a short (incomplete) bullet-point list of the projects I'd like to work on in 2023. Here&

Dec 29, 2022

Featured articles

Accelerating AI Alignment Research (Talk)

Quantum Computing, Photonics, and Energy Bottlenecks for AGI

AI Insights #1: How Misalignment Could Lead to Takeover & Necessary Safety Properties

Foresight for AGI Safety Strategy

A descriptive, not prescriptive, overview of current AI Alignment Research

Latest

AI Alignment Project Ideas

How much I'm paying for AI productivity software (and the future of AI use)

The importance of Entropy

Accelerating AI Alignment Research (Talk)

Using data attribution for AI alignment

Quantum Computing, Photonics, and Energy Bottlenecks for AGI

AI Insights #1: How Misalignment Could Lead to Takeover & Necessary Safety Properties

My current research and request for collaborators

But is it really in Rome? Limitations of the ROME model editing technique

An incomplete list of projects I'd like to work on in 2023