OwlTail

Is now Fathom

OwlTail

Is now Fathom

Geoffrey Irving

6 Podcast Episodes

“Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla” by Neel Nanda, Tom Lieberum, Matthew Rahtz, János Kramár, Geoffrey Irving, Rohin Shah, vlad_m

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.Cross-posting a paper from the Goo... Read more

20 Jul 2023

•

Neel Nanda

Rohin Shah

Geoffrey Irving

[Week 5] “AI safety via debate” by Geoffrey Irving, Paul Christiano and Dario Amodei

Abstract: To make AI systems broadly useful for challenging real-world tasks, we need them to learn complex human goals ... Read more

12 May 2023

•

Dario Amodei

Paul Christiano

Geoffrey Irving

Similar People

Evan Hubinger

Andrew Critch

Rohin Shah

Kaj Sotala

Buck Shlegeris

Stuart Armstrong

Dylan Hadfield-Menell

Victoria Krakovna

Allan Dafoe

John Wentworth

William MacAskill

Stefan Schubert

Paul Christiano

Robin Hanson

Stuart Russell

AF - AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving by DanielFilan

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

1 Jul 2022

•

54mins

Geoffrey Irving

16 - Preparing for Debate AI with Geoffrey Irving

Many people in the AI alignment space have heard of AI safety via debate - check out AXRP episode 6 if you need a prime... Read more

1 Jul 2022

•

1hr 4mins

Geoffrey Irving

Most Popular

Elon Musk

Barack Obama

Bill Gates

LeBron James

Mark Cuban

Michelle Obama

Melinda Gates

Arnold Schwarzenegger

Kevin Hart

Terry Crews

Mike Tyson

AF - Learning the smooth prior by Geoffrey Irving

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

29 Apr 2022

•

18mins

Geoffrey Irving

AIAP: AI Alignment through Debate with Geoffrey Irving

See full article here: https://futureoflife.org/2019/03/06/ai-alignment-through-debate-with-geoffrey-irving/"To make AI ... Read more

7 Mar 2019

•

1hr 10mins

Geoffrey Irving

“Podium: AI tools for podcasters. Generate show notes, transcripts, highlight clips, and more with AI. Try it today at https://podium.page”