Sobering up on AI Progress w/ Dr. Sean McGregor

Dec 29, 2025·
Into AI Safety
Into AI Safety
· 4 min read
Sean McGregor and I discuss about why evaluating AI systems has become so difficult; we cover everything from the breakdown of benchmarking, how incentives shape safety work, and what approaches like BenchRisk (his recent paper at NeurIPS) and AI auditing aim to fix as systems move into the real world. We also talk about his history and journey in AI safety, including his PhD on ML for public policy, how he started the AI Incident Database, and what he's working on now: AVERI, a non-profit for frontier model auditing.

2025.12.29: AVERI will be officially launching in January of 2026. If you’re interested, consider checking out their listings to see if you’d be a good fit; if you do end up applying, let them know you found out about the opportunity from the podcast!

As part of my effort to make this whole podcasting thing more sustainable, I have created a Kairos.fm Patreon which includes an extended version of this episode. Supporting gets you access to these extended cuts, as well as other perks in development.

INTERVIEW RECORDED 2025.11.25; ASIDES RECORDED 2025.12.20; TRANSCRIPT

Chapters

00:00:00 ❙ Intro
00:02:36 ❙ What's broken about benchmarking
00:03:41 ❙ Sean’s wild PhD
00:14:28 ❙ The phantom internship
00:19:25 ❙ Sean's journey
00:22:25 ❙ Market-vs-regulatory modes and AIID
00:32:13 ❙ Drunk on AI progress
00:38:34 ❙ BenchRisk
00:43:20 ❙ Moral hazards and Master Hand
00:50:34 ❙ Liability, Section 230, and open source
00:59:20 ❙ AVERI
01:11:30 ❙ Closing thoughts & outro

BenchRisk

  • BenchRisk website
  • NeurIPS paper - Risk Management for Mitigating Benchmark Failure Modes: BenchRisk
  • NeurIPS paper - AI and the Everything in the Whole Wide World Benchmark
  • CACM paper - Datasheets for Datasets

AIID

  • AI Incident Database website
  • IAAI paper - Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database
  • Preprint - Lessons for Editors of AI Incidents from the AI Incident Database
  • AIAAIC website (another incident tracker)

Hot AI Summer

  • CACM article - A Few Useful Things to Know About Machine Learning
  • CACM article - How the AI Boom Went Bust
  • Undergraduate Thesis - Analyzing the Prospect of an Approaching AI Winter
  • Tech Genies article - AI History: The First Summer and Winter of AI
  • CACM article - There Was No ‘First AI Winter’

Measuring Generalization

  • Neural Computation article - The Lack of A Priori Distinctions Between Learning Algorithms
  • ICLR paper - Understanding deep learning requires rethinking generalization
  • ICML paper - Model-agnostic Measure of Generalization Difficulty
  • Radiology Artificial Intelligence article - Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls
  • Preprint - Quantifying Generalization Complexity for Large Language Models

Insurers Exclude AI

  • Financial Times article - Insurers retreat from AI cover as risk of multibillion-dollar claims mount
  • Tom’s Hardware article - Major insurers move to avoid liability for AI lawsuits as multi-billion dollar risks emerge — Recent public incidents have lead to costly repercussions
  • Insurance Newsnet article - Insurers Scale Back AI Coverage Amid Fears of Billion-Dollar Claims
  • Insurance Business article - Insurance’s gen AI reckoning has come

Section 230

  • Section 230 overview
  • Legal sidebar - Section 230 Immunity and Generative Artificial Intelligence
  • Bad Internet Bills website
  • TechDirt article - Section 230 Faces Repeal. Support The Coverage That’s Been Getting It Right All Along.
  • Privacy Guides video - Dissecting Bad Internet Bills with Taylor Lorenz: KOSA, SCREEN Act, Section 230
  • Journal of Technology in Behavioral Health article - Social Media and Mental Health: Benefits, Risks, and Opportunities for Research and Practice
  • Time article - Lawmakers Unveil New Bills to Curb Big Tech’s Power and Profit
  • House Hearing transcript - Legislative Solutions to Protect Children and Teens Online

Relevant Kairos.fm Episodes

  • Into AI Safety episode - Growing BlueDot’s Impact w/ Li-Lian Ang
  • muckrAIkers episode - NeurIPS 2024 Wrapped 🌯