9 Podcast Episodes
“Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research” by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez
“Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research” by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez
Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.TL;DR: This document lays out the ... Read more
8 Aug 2023
•
“Measuring and Improving the Faithfulness of Model-Generated Reasoning” by Ansh Radhakrishnan, tamera, Ethan Perez, Sam Bowman
“Measuring and Improving the Faithfulness of Model-Generated Reasoning” by Ansh Radhakrishnan, tamera, Ethan Perez, Sam Bowman
Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.TL;DR: In two new papers from Anth... Read more
20 Jul 2023
•
Discovering AI Risks with AIs | Ethan Perez | EAG Bay Area 23
Discovering AI Risks with AIs | Ethan Perez | EAG Bay Area 23
Watch on Youtube In this talk Ethan presents on how AI systems like ChatGPT can be used to help uncover potential risks... Read more
26 May 2023
•
53mins
AF - Inverse Scaling Prize: Round 1 Winners by Ethan Perez
AF - Inverse Scaling Prize: Round 1 Winners by Ethan Perez
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
26 Sep 2022
•
7mins
AF - We may be able to see sharp left turns coming by Ethan Perez
AF - We may be able to see sharp left turns coming by Ethan Perez
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
3 Sep 2022
•
3mins
AF - A Test for Language Model Consciousness by Ethan Perez
AF - A Test for Language Model Consciousness by Ethan Perez
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
25 Aug 2022
•
15mins
AF - Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming by Michaël Trazzi
AF - Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming by Michaël Trazzi
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
24 Aug 2022
•
5mins
AF - Announcing the Inverse Scaling Prize ($250k Prize Pool) by Ethan Perez
AF - Announcing the Inverse Scaling Prize ($250k Prize Pool) by Ethan Perez
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
27 Jun 2022
•
12mins
74. Ethan Perez - Making AI safe through debate
74. Ethan Perez - Making AI safe through debate
Most AI researchers are confident that we will one day create superintelligent systems — machines that can significantly... Read more
10 Mar 2021
•
52mins