Ethan Perez

9 Podcast Episodes

“Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research” by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez

“Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research” by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.TL;DR: This document lays out the ... Read more

8 Aug 2023

“Measuring and Improving the Faithfulness of Model-Generated Reasoning” by Ansh Radhakrishnan, tamera, Ethan Perez, Sam Bowman

“Measuring and Improving the Faithfulness of Model-Generated Reasoning” by Ansh Radhakrishnan, tamera, Ethan Perez, Sam Bowman

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.TL;DR: In two new papers from Anth... Read more

20 Jul 2023

Similar People

Discovering AI Risks with AIs | Ethan Perez | EAG Bay Area 23

Discovering AI Risks with AIs | Ethan Perez | EAG Bay Area 23

Watch on Youtube In this talk Ethan presents on how AI systems like ChatGPT can be used to help uncover potential risks... Read more

26 May 2023

53mins

AF - Inverse Scaling Prize: Round 1 Winners by Ethan Perez

AF - Inverse Scaling Prize: Round 1 Winners by Ethan Perez

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

26 Sep 2022

7mins

Most Popular

AF - We may be able to see sharp left turns coming by Ethan Perez

AF - We may be able to see sharp left turns coming by Ethan Perez

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

3 Sep 2022

3mins

AF - A Test for Language Model Consciousness by Ethan Perez

AF - A Test for Language Model Consciousness by Ethan Perez

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

25 Aug 2022

15mins

AF - Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming by Michaël Trazzi

AF - Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming by Michaël Trazzi

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

24 Aug 2022

5mins

AF - Announcing the Inverse Scaling Prize ($250k Prize Pool) by Ethan Perez

AF - Announcing the Inverse Scaling Prize ($250k Prize Pool) by Ethan Perez

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

27 Jun 2022

12mins

74. Ethan Perez - Making AI safe through debate

74. Ethan Perez - Making AI safe through debate

Most AI researchers are confident that we will one day create superintelligent systems — machines that can significantly... Read more

10 Mar 2021

52mins

“Podium: AI tools for podcasters. Generate show notes, transcripts, highlight clips, and more with AI. Try it today at https://podium.page”