Reading List

Anthropic and other researchers detail "subliminal learning", where LLMs learn traits from model-generated data that is semantically unrelated to those traits (Anthropic) from Techmeme RSS feed.