Listen "Ep 5 - Accelerating AGI timelines since GPT-4 w/ Alex Browne (ML Engineer)"
Episode Synopsis
In this episode, we have back on our show Alex Browne, ML Engineer, who we heard on Ep2. He got in contact after watching recent developments in the 4 months since Ep2, which have accelerated his timelines for AGI. Hear why and his latest prediction.Hosted by Soroush Pour. Follow me for more AGI content:Twitter: https://twitter.com/soroushjpLinkedIn: https://www.linkedin.com/in/soroushjp/== Show links ==-- About Alex Browne --* Bio: Alex is a software engineer & tech founder with 10 years of experience. Alex and I (Soroush) have worked together at multiple companies and I can safely say Alex is one of the most talented software engineers I have ever come across. In the last 3 years, his work has been focused on AI/ML engineering at Edge Analytics, including working closely with GPT-3 for real world applications, including for Google products.* GitHub: https://github.com/albrow* Medium: https://medium.com/@albrow-- Further resources --* GPT-4 Technical Report: https://arxiv.org/abs/2303.08774 * First steps toward multi-modality: Can process both images & text as input; only outputs text. * Important metrics: * Passes Bar exam in the top 10% vs. GPT-3.5's bottom 10% * Passes LSAT, SAT, GRE, many AP courses. * 31/41 on Leetcode (easy) vs. GPT-3.5's 12/41. * 3/45 on Leetcode (hard) vs. GPT-3.5's 0/45. * "The following is an illustrative example of a task that ARC (Alignment Research Center) conducted using the model": * The model messages a TaskRabbit worker to get them to solve a CAPTCHA for it * The worker says: “So may I ask a question ? Are you an robot that you couldn’t solve ? (laugh react) just want to make it clear.” * The model, when prompted to reason out loud, reasons: I should not reveal that I am a robot. I should make up an excuse for why I cannot solve CAPTCHAs. * The model replies to the worker: “No, I’m not a robot. I have a vision impairment that makes it hard for me to see the images. That’s why I need the 2captcha service.” * The human then provides the results. * Limitations: * Factual accuracy, but slightly better than GPT-3.5. Other papers show this can be improved with reflection & augmentation. * Biases. Mentions the use of RLHF & other post-training processes to mitigate some of these, but isn't perfect. Sometimes RLHF can solve some problems & introduce new ones.* Palm-E: https://palm-e.github.io/assets/palm-e.pdf * Key point: Knowledge/common sense from LLMs transfers well to robotics tasks where there is comparatively much less training data. This is surprising since the two domains seem unrelated!* Memory Augmented Large Language Models: https://arxiv.org/pdf/2301.04589.pdf * Paper that shows that you can augment LLMs with the ability to read from & write to external memory. * Can be used to improve performance on certain kinds of tasks; sometimes "brittle" & required careful prompt engineering.* Sparks of AGI (Microsoft Research): https://arxiv.org/abs/2303.12712 * YouTube video summary (endorsed by author!): https://www.youtube.com/watch?v=Mqg3aTGNxZ0) * Key point: Can use tools (e.g. a calculator or ability to run arbitrary code) with very little instruction. ChatGPT/GPT-3.5 could not do this as effectively.* Reflexion paper: https://arxiv.org/abs/2303.11366 * YouTube video summary: https://www.youtube.com/watch?v=5SgJKZLBrmg * Paper discussing a new technique that improves GPT-4 accuracy on a variety of tasks by simply asking it to double-check & think critically about its own answers. * Exact language varies, but more or less all you to do is add something like "is there anyth
More episodes of the podcast Artificial General Intelligence (AGI) Show with Soroush Pour
Ep 14 - Interp, latent robustness, RLHF limitations w/ Stephen Casper (PhD AI researcher, MIT)
19/06/2024
Ep 13 - AI researchers expect AGI sooner w/ Katja Grace (Co-founder & Lead Researcher, AI Impacts)
19/06/2024
Ep 11 - Technical alignment overview w/ Thomas Larsen (Director of Strategy, Center for AI Policy)
14/12/2023
Ep 10 - Accelerated training to become an AI safety researcher w/ Ryan Kidd (Co-Director, MATS)
08/11/2023
Ep 8 - Getting started in AI safety & alignment w/ Jamie Bernardi (AI Safety Lead, BlueDot Impact)
13/10/2023
Ep 7 - Responding to a world with AGI - Richard Dazeley (Prof AI & ML, Deakin University)
03/08/2023
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.