LessWrong: "How might we align transformative AI if it’s developed very soon?" by Holden Karnofsky

02/11/2022 1h 39min

Listen "LessWrong: "How might we align transformative AI if it’s developed very soon?" by Holden Karnofsky"

Episode Synopsis

---narrator_time: 4h30mnarrator: pwqa: kmfeed_id: ai, ai_safety, ai_safety__technical, ai_safety__governanceclient: lesswrong---https://www.lesswrong.com/posts/rCJQAkPTEypGjSJ8X/how-might-we-align-transformative-ai-if-it-s-developed-very This post is part of my AI strategy nearcasting series: trying to answer key strategic questions about transformative AI, under the assumption that key events will happen very soon, and/or in a world that is otherwise very similar to today's. This post gives my understanding of what the set of available strategies for aligning transformative AI would be if it were developed very soon, and why they might or might not work. It is heavily based on conversations with Paul Christiano, Ajeya Cotra and Carl Shulman, and its background assumptions correspond to the arguments Ajeya makes in this piece (abbreviated as “Takeover Analysis”). I premise this piece on a nearcast in which a major AI company (“Magma,” following Ajeya’s terminology) has good reason to think that it can develop transformative AI very soon (within a year), using what Ajeya calls “human feedback on diverse tasks” (HFDT) - and has some time (more than 6 months, but less than 2 years) to set up special measures to reduce the risks of misaligned AI before there’s much chance of someone else deploying transformative AI. Share feedback on this narration.

More episodes of the podcast TYPE III AUDIO (All episodes)

"Information security in high-impact areas career review" by Jarrah Bloomfield 23/06/2023

Summary: How to find a fulfilling career that does good 14/06/2023

The end: A cheery final note — imagining your deathbed 14/06/2023

Part 12: One of the most powerful ways to improve your career: Join a community. 14/06/2023

Part 11: All the best advice we could find on how to get a job 14/06/2023

Part 10: How to make your career plan 14/06/2023

Part 9: All the evidence-based advice we found on how to be more successful in any job 14/06/2023

Part 8: How to find the right career for you 14/06/2023

Part 7: Which jobs put you in the best long-term position? 14/06/2023

Part 6: Which jobs help people the most? 14/06/2023

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

LessWrong: "How might we align transformative AI if it’s developed very soon?" by Holden Karnofsky

Listen "LessWrong: "How might we align transformative AI if it’s developed very soon?" by Holden Karnofsky"

Episode Synopsis

More episodes of the podcast TYPE III AUDIO (All episodes)

Preparing for a Hacker Threat

Prevent Attacks From Your Local Area Network

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD