16 Podcast Episodes
“Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla” by Neel Nanda, Tom Lieberum, Matthew Rahtz, János Kramár, Geoffrey Irving, Rohin Shah, vlad_m
“Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla” by Neel Nanda, Tom Lieberum, Matthew Rahtz, János Kramár, Geoffrey Irving, Rohin Shah, vlad_m
Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.Cross-posting a paper from the Goo... Read more
20 Jul 2023
•
#154 - Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters
#154 - Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters
Can there be a more exciting and strange place to work today than a leading AI lab? Your CEO has said they're worried yo... Read more
9 Jun 2023
•
3hr 9mins
Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters
Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters
Can there be a more exciting and strange place to work today than a leading AI lab? Your CEO has said they're worried yo... Read more
16 May 2023
•
3hr 9mins
Rohin Shah
Rohin Shah
Dr. Rohin Shah is a Research Scientist at DeepMind, and the editor and main contributor of the Alignment Newsletter.Feat... Read more
12 Apr 2022
•
1hr 37mins
AF - Shah and Yudkowsky on alignment failures by Rohin Shah
AF - Shah and Yudkowsky on alignment failures by Rohin Shah
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
28 Feb 2022
•
2hr 24mins
Rohin Shah on the State of AGI Safety Research in 2021
Rohin Shah on the State of AGI Safety Research in 2021
Rohin Shah, Research Scientist on DeepMind's technical AGI safety team, joins us to discuss: AI value alignment; how an ... Read more
2 Nov 2021
•
1hr 43mins
AF - [AN #168]: Four technical topics for which Open Phil is soliciting grant proposals by Rohin Shah
AF - [AN #168]: Four technical topics for which Open Phil is soliciting grant proposals by Rohin Shah
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
28 Oct 2021
•
12mins
AF - [AN #167]: Concrete ML safety problems and their relevance to x-risk by Rohin Shah
AF - [AN #167]: Concrete ML safety problems and their relevance to x-risk by Rohin Shah
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
20 Oct 2021
•
14mins
AF - [AN #166]: Is it crazy to claim we're in the most important century? by Rohin Shah
AF - [AN #166]: Is it crazy to claim we're in the most important century? by Rohin Shah
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more
8 Oct 2021
•
10mins
55. Rohin Shah - Effective altruism, AI safety, and learning human preferences from the state of the world
55. Rohin Shah - Effective altruism, AI safety, and learning human preferences from the state of the world
If you walked into a room filled with objects that were scattered around somewhat randomly, how important or expensive w... Read more
28 Oct 2020
•
51mins