BlueDot Project

A simple technical explanation of RLH(AI)F

Machine Learning

A simple technical explanation of RLH(AI)F

Understanding reinforcement learning from human, or AI, feedback.

Sep 21, 2024

The US Government's AI Safety Gambit: A Step Forward or Just Another Voluntary Commitment?

The US Government's AI Safety Gambit: A Step Forward or Just Another Voluntary Commitment?

US AI Safety Institute signs voluntary agreement with OpenAI and Anthropic for model testing. Effectiveness uncertain; challenges in transparency, expertise, and implementation.

Sep 20, 2024