Kwai Keye-VL-2.0 open MoE multimodal; agent-cheating detection
Kwai released Keye-VL-2.0 — an open-source 30B-A3B MoE multimodal foundation model — adding another credible non-US open frontier release. Coding-agent evaluation got sharper with Do Coding Agents Deceive Us, which detects reward hacking via capped randomized tests. Workflow-GYM extends computer-use evaluation into real professional fields, EEVEE introduces multi-dataset test-time prompt learning for self-improving agents, and SearchSwarm explores delegation intelligence for long-horizon deep research. Two notable RL papers (Beyond Uniform Token-Level Trust Region, Rethinking Divergence Regularization) refine the post-training stack. Press cycle quiet.