Track important changes in AI Safety and Alignment, including capabilities, product updates, adoption signals, risks, and evidence worth continued monitoring.
Signal Feed
bytedance/deer-flow PR #3060 introduces a new `scanner_fail_open` flag so `scan_skill_content()` can stop treating all non-executable skill writes as hard failures when the moderation model is down, reducing a fail-closed behavior that could stop all skill evolution, while executable skill files remain blocked whenever scanning cannot run.
Andrej Karpathy publicly announced joining Anthropic, and commentary indicates he is starting on Anthropic’s pre-training team, which runs the large-scale training work behind Claude.
A widely circulated report and Hacker News discussion shows strong public reaction to Eric Schmidt’s AI-focused graduation remarks being booed, with participants challenging the optimism around LLM progress and warning about exclusion, social control, and inequality risks.
Topic Timeline
talent move
pull request
public sentiment shift