AI Red Teaming & Scalable Evaluation
Who does the work of keeping AI systems safe? This line of research examines the human labor behind AI red teaming and safety evaluation — who is hired to stress-test models, how their work is structured, and what gets lost when evaluation is automated at scale.
Jan 1, 2024