Victoria Krakovna

Image may be NSFW.
Clik here to view.

2020-21 New Year review

January 3, 2021, 7:33 am

This is an annual post reviewing the last year and making resolutions and predictions for next year. 2020 brought a combination of challenges from living in a pandemic and becoming a parent. Other...

View Article

Image may be NSFW.
Clik here to view.

Reflections on the first year of parenting

November 11, 2021, 8:02 am

The first year after having a baby went by really fast – happy birthday Daniel! This post is a reflection on our experience and what we learned in the first year. Grandparents. We were very fortunate...

View Article

Image may be NSFW.
Clik here to view.

2021-22 New Year review

January 4, 2022, 2:38 pm

2021-22 new year review This was a rough year that sometimes felt like a trial by fire – sick relatives, caring for a baby, and the pandemic making these things more difficult to deal with. My father...

View Article

Image may be NSFW.
Clik here to view.

Paradigms of AI alignment: components and enablers

June 1, 2022, 6:36 pm

(This post is based on an overview talk I gave at UCL EA and Oxford AI society (recording here). Cross-posted to the Alignment Forum. Thanks to Janos Kramar for detailed feedback on this post and to...

View Article

Image may be NSFW.
Clik here to view.

Refining the Sharp Left Turn threat model

November 25, 2022, 9:01 am

(Coauthored with others on the alignment team and cross-posted from the alignment forum: part 1, part 2) A sharp left turn (SLT) is a possible rapid increase in AI system capabilities (such as...

View Article

Image may be NSFW.
Clik here to view.

2022-23 New Year review

January 6, 2023, 9:58 am

This is an annual post reviewing the last year and setting goals for next year. Overall, this was a reasonably good year with some challenges (the invasion of Ukraine and being sick a lot). Some...

View Article

Near-term motivation for AGI alignment

March 9, 2023, 5:09 am

AGI alignment work is usually considered “longtermist”, which is about preserving humanity’s long-term potential. This was the primary motivation for this work when the alignment field got started...

View Article

Image may be NSFW.
Clik here to view.

When discussing AI risks, talk about capabilities, not intelligence

August 9, 2023, 2:27 pm

Public discussions about catastrophic risks from general AI systems are often derailed by using the word “intelligence”. People often have different definitions of intelligence, or associate it with...

View Article

Image may be NSFW.
Clik here to view.

Retrospective on my posts on AI threat models

December 20, 2023, 12:38 pm

Last year, a major focus of my research was developing a better understanding of threat models for AI risk. This post is looking back at some posts on threat models I (co)wrote in 2022 (based on my...

View Article

Image may be NSFW.
Clik here to view.

2023-24 New Year review

January 3, 2024, 8:01 am

This is an annual post reviewing the last year and setting intentions for next year. I look over different life areas (work, health, parenting, effectiveness, travel, etc) and draw conclusions from my...

View Article

More Pages to Explore .....

Latest Images