Listen "HACKATHON: Evals November 2023 (2)"
Episode Synopsis
Join our hackathon group for the second episode in the Evals November 2023 Hackathon subseries. In this episode, we solidify our goals for the hackathon after some preliminary experimentation and ideation.Check out Stellaric's website, or follow them on Twitter.01:53 - Meeting starts05:05 - Pitch: extension of locked models23:23 - Pitch: retroactive holdout datasets34:04 - Preliminary results37:44 - Next steps42:55 - RecapLinks to all articles/papers which are mentioned throughout the episode can be found below, in order of their appearance.Evalugator libraryPassword Locked Model blogpostTruthfulQA: Measuring How Models Mimic Human FalsehoodsBLEU: a Method for Automatic Evaluation of Machine TranslationBoolQ: Exploring the Surprising Difficulty of Natural Yes/No QuestionsDetecting Pretraining Data from Large Language Models
More episodes of the podcast Into AI Safety
Getting Agentic w/ Alistair Lowe-Norris
20/10/2025
Growing BlueDot's Impact w/ Li-Lian Ang
15/09/2025
Getting Into PauseAI w/ Will Petillo
23/06/2025
INTERVIEW: StakeOut.AI w/ Dr. Peter Park (3)
25/03/2024
INTERVIEW: StakeOut.AI w/ Dr. Peter Park (2)
18/03/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.