2020-21 New Year review
This is an annual post reviewing the last year and making resolutions and predictions for next year. 2020 brought a combination of challenges from living in a pandemic and becoming a parent. Other...
View ArticleReflections on the first year of parenting
The first year after having a baby went by really fast – happy birthday Daniel! This post is a reflection on our experience and what we learned in the first year. Grandparents. We were very fortunate...
View Article2021-22 New Year review
2021-22 new year review This was a rough year that sometimes felt like a trial by fire – sick relatives, caring for a baby, and the pandemic making these things more difficult to deal with. My father...
View ArticleParadigms of AI alignment: components and enablers
(This post is based on an overview talk I gave at UCL EA and Oxford AI society (recording here). Cross-posted to the Alignment Forum. Thanks to Janos Kramar for detailed feedback on this post and to...
View ArticleRefining the Sharp Left Turn threat model
(Coauthored with others on the alignment team and cross-posted from the alignment forum: part 1, part 2) A sharp left turn (SLT) is a possible rapid increase in AI system capabilities (such as...
View Article2022-23 New Year review
This is an annual post reviewing the last year and setting goals for next year. Overall, this was a reasonably good year with some challenges (the invasion of Ukraine and being sick a lot). Some...
View ArticleNear-term motivation for AGI alignment
AGI alignment work is usually considered “longtermist”, which is about preserving humanity’s long-term potential. This was the primary motivation for this work when the alignment field got started...
View ArticleWhen discussing AI risks, talk about capabilities, not intelligence
Public discussions about catastrophic risks from general AI systems are often derailed by using the word “intelligence”. People often have different definitions of intelligence, or associate it with...
View ArticleRetrospective on my posts on AI threat models
Last year, a major focus of my research was developing a better understanding of threat models for AI risk. This post is looking back at some posts on threat models I (co)wrote in 2022 (based on my...
View Article2023-24 New Year review
This is an annual post reviewing the last year and setting intentions for next year. I look over different life areas (work, health, parenting, effectiveness, travel, etc) and draw conclusions from my...
View Article
More Pages to Explore .....