Joe Carlsmith Audio
Audio versions of essays by Joe Carlsmith. Philosophy, futurism, and other topics. Text versions at joecarlsmith.com.
Joe Carlsmith Audio
Introduction and summary of "Scheming AIs: Will AIs fake alignment during training in order to get power?"
•
Joe Carlsmith
Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.
This is a recording of the introductory section of my report "Scheming AIs: Will AIs fake alignment during training in order to get power?". This section includes a summary of the full report. The summary covers most of the main points and technical terminology, and I'm hoping that it will provide much of the context necessary to understand individual sections of the report on their own. (Note: the text of the report itself may not be public by the time this episode goes live.)
0. Introduction
0.1 Preliminaries
0.2 Summary of the report
0.2.1 Summary of section 1
0.2.2 Summary of section 2
0.2.3 Summary of section 3
0.2.4 Summary of section 4
0.2.5 Summary of section 5
0.2.6 Summary of section 6