Machine Learning

NeurIPS 2024 Wrapped 🌯

Machine Learning

NeurIPS 2024 Wrapped 🌯

The largest conference in machine learning had over 15,000 people in attendance, and so much tea!

Dec 30, 2024

Rethinking CyberSecEval: An LLM-Aided Approach to Evaluation Critique

Machine Learning

Rethinking CyberSecEval: An LLM-Aided Approach to Evaluation Critique

Investigating flaws in Meta Platforms' CyberSecEval series.

Nov 14, 2024

Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts

Machine Learning

Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts

Landing page for the Benchmark Inflation research paper.

Oct 11, 2024

Strawberry

Machine Learning

Strawberry

Introducing muckraikers, a podcast that tries to find meaning in the AI muck. This episode we discuss the release of OpenAI's o1 model, aka. Strawberry.

Sep 23, 2024

A simple technical explanation of RLH(AI)F

Machine Learning

A simple technical explanation of RLH(AI)F

Understanding reinforcement learning from human, or AI, feedback.

Sep 21, 2024

Let’s Talk About Emergence

Machine Learning

Let’s Talk About Emergence

"Emergence" has found its way into machine learning vocabulary, but current use as a machine learning specific keyword has resulted in a circular definition, further confused an already complex domain.

May 7, 2024

Open Source AI is a lie, but it doesn't have to be

Machine Learning

Open Source AI is a lie, but it doesn't have to be

Big Tech is attempting to redefine "Open Source" to their advantage; at the very least, we should know about it.

Apr 30, 2024