Aligned to Flourish (Page 2)

Dec

29

An incomplete list of projects I'd like to work on in 2023

Wrote up a short (incomplete) bullet-point list of the projects I'd like to work on in 2023. Here&

Dec 29, 2022

Dec

19

(Linkpost) Results for a survey of tool use and workflows in alignment research

In March 22nd, 2022, we released a survey with an accompanying post for the purpose of getting more insight into

Dec 19, 2022

1 min read

Dec

16

How learning efficiently applies to alignment research

As we are trying to optimize for actually solving the problem, we should not fall into the trap of learning

Dec 16, 2022

2 min read

Dec

07

Differential Training Process: Delaying capabilities until inner aligned

I've been ruminating on an idea ever since I read the section on deception in "The Core

Dec 7, 2022

3 min read

Dec

07

Near-Term AI capabilities probably bring low-hanging fruits for global poverty/health

I'm an alignment researcher, but I still think we should be vigilant about how models like GPT-N could

Dec 7, 2022

1 min read

Dec

05

Is the "Valley of Confused Abstractions" real?

Epistemic Status: Quite confused. Using this short post as a signal for discussion. Here's a link to the

Dec 5, 2022

3 min read

Dec

05

Foresight for AGI Safety Strategy

For discussion: Link to LessWrong post. Link to EA Forum post. This post is about I think why we should

Dec 5, 2022

10 min read

Nov

28

Notes on Cicero

Link to YouTube explanation by Yannic Kilcher: Link to paper (sharing on GDrive since it's behind a paywall

Nov 28, 2022

3 min read

Nov

26

Detail about factual knowledge in Transformers

This post is currently in the Appendix of a much longer post I'm currently editing and waiting for

Nov 26, 2022

2 min read

Aug

13

Current Thoughts on my Learning System

TLDR of what I've been thinking about lately: * Learning is a set of skills. You need to practice

Aug 13, 2022

4 min read