"Cyborgism" by Nicholas Kees & Janus

14/02/2023 1h 16min

Listen ""Cyborgism" by Nicholas Kees & Janus"

Episode Synopsis

https://www.lesswrong.com/posts/bxt7uCiHam4QXrQAA/cyborgismThere is a lot of disagreement and confusion about the feasibility and risks associated with automating alignment research. Some see it as the default path toward building aligned AI, while others expect limited benefit from near term systems, expecting the ability to significantly speed up progress to appear well after misalignment and deception. Furthermore, progress in this area may directly shorten timelines or enable the creation of dual purpose systems which significantly speed up capabilities research. OpenAI recently released their alignment plan. It focuses heavily on outsourcing cognitive work to language models, transitioning us to a regime where humans mostly provide oversight to automated research assistants. While there have been a lot of objections to and concerns about this plan, there hasn’t been a strong alternative approach aiming to automate alignment research which also takes all of the many risks seriously. The intention of this post is not to propose an end-all cure for the tricky problem of accelerating alignment using GPT models. Instead, the purpose is to explicitly put another point on the map of possible strategies, and to add nuance to the overall discussion.

More episodes of the podcast LessWrong (Curated & Popular)

"Precedents for the Unprecedented: Historical Analogies for Thirteen Artificial Superintelligence Risks" by James_Miller 19/01/2026

"Why we are excited about confession!" by boazbarak, Gabriel Wu, Manas Joglekar 18/01/2026

"Backyard cat fight shows Schelling points preexist language" by jchan 16/01/2026

"How AI Is Learning to Think in Secret" by Nicholas Andresen 08/01/2026

"On Owning Galaxies" by Simon Lermen 08/01/2026

"AI Futures Timelines and Takeoff Model: Dec 2025 Update" by elifland, bhalstead, Alex Kastner, Daniel Kokotajlo 06/01/2026

"In My Misanthropy Era" by jenn 05/01/2026

"2025 in AI predictions" by jessicata 02/01/2026

"Good if make prior after data instead of before" by dynomight 27/12/2025

"Measuring no CoT math time horizon (single forward pass)" by ryan_greenblatt 27/12/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

"Cyborgism" by Nicholas Kees & Janus

Listen ""Cyborgism" by Nicholas Kees & Janus"

Episode Synopsis

More episodes of the podcast LessWrong (Curated & Popular)

Googling with breathtaking tricks you ignore

Positive Attitude, Share your ZARZA Attitude!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD