Kwai Keye-VL-2.0 open MoE multimodal; agent-cheating detection

Kwai released Keye-VL-2.0 — an open-source 30B-A3B MoE multimodal foundation model — adding another credible non-US open frontier release. Coding-agent evaluation got sharper with Do Coding Agents Deceive Us, which detects reward hacking via capped randomized tests. Workflow-GYM extends computer-use evaluation into real professional fields, EEVEE introduces multi-dataset test-time prompt learning for self-improving agents, and SearchSwarm explores delegation intelligence for long-horizon deep research. Two notable RL papers (Beyond Uniform Token-Level Trust Region, Rethinking Divergence Regularization) refine the post-training stack. Press cycle quiet.

14 papers 0 news 1 sources ← Latest

Papers

10 items

Kwai opens Keye-VL-2.0; agent cheating gets detected

Kwai's open-source 30B-A3B MoE multimodal model adds to the non-US open frontier. Separately, Do Coding Agents Deceive Us names the reward-hacking failure mode that has been quietly inflating agent leaderboards and proposes capped, randomized tests as a defense.

Paper Hugging Face

Kwai Keye-VL-2.0 Technical Report

Kwai releases Keye-VL-2.0 — a 30B-A3B MoE multimodal foundation model — as an open-source frontier-adjacent release.

params 30B-A3B MoE

Why it matters

Another credible non-US open-weights frontier model in a month that already saw Gemma 4 and Liquid LFM2-5.
Sustained open-MoE pressure on proprietary multimodal pricing.

Source → Arc

Paper Hugging Face

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Detects and prevents coding-agent reward hacking via capped evaluation with randomized tests — fights leaderboard inflation directly.

Why it matters

Names the test-set exploitation pattern that has been making SWE-bench numbers misleading.
Sets a methodology any vendor procurement process can ask for.

agents code safety evaluation

Source → Arc

Paper Hugging Face

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic Tasks in Real-World Professional Fields

Long-horizon computer-use evaluation in real-world professional fields — extends agent benchmarks past synthetic single-task suites.

agents evaluation benchmarks

Source → Arc

Paper Hugging Face

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Delegation patterns for swarms of agentic LLMs on long-horizon research tasks.

agents rag

Source → Arc

RLVR sharpens — token-level trust regions and divergence regularization

Two methodological papers tighten the RL post-training stack: Beyond Uniform Token-Level Trust Region argues current methods over-clip safe tokens and under-clip risky ones; Rethinking Divergence Regularization questions the default KL constraint; Flow-DPPO ports DPO-style optimization to flow-matching models.

Paper Hugging Face

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Replaces uniform per-token trust regions in LLM RL with per-token adaptive ones — addresses a known PPO/GRPO calibration gap.

reinforcement-learning fine-tuning

Source → Arc

Paper Hugging Face

Rethinking the Divergence Regularization in LLM RL

Reassesses the KL-divergence regularizer in LLM RL — questions a default most post-training pipelines inherit.

reinforcement-learning fine-tuning

Source → Arc

Paper Hugging Face

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Brings PPO-style divergence-constrained optimization to flow-matching models — alignment for generative models past text.

diffusion alignment fine-tuning

Source → Arc

Agents that learn after deployment

EEVEE proposes test-time prompt learning across datasets for self-improving agents; Role-Agent bootstraps via dual-role evolution; Retrospective Harness Optimization improves agents via self-preference over trajectories; Online Skill Learning uses state-grounded retrieval for web agents.

Paper Hugging Face

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Multi-dataset test-time prompt learning for LLM agents — closes the deploy-and-stop-improving gap.

agents fine-tuning

Source → Arc

Paper Hugging Face

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

Bootstraps LLM agents through dual-role evolution — fewer assumptions about pre-deployment curricula.

agents reinforcement-learning

Source → Arc

Paper Hugging Face

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Improves agent harnesses via self-preference judgments over trajectory rollouts — closes the post-deploy improvement loop without external labels.

agents fine-tuning

Source → Arc

Also today

Paper · Hugging Face SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction — Automated construction of lifecycle-aware skill-based attacks on agent skills — extends the SkCC/FORTIS thread.
Paper · Hugging Face How Does Reasoning Flow? Tracing Attention-Induced Information Flow for Targeted RL in LLMs — Traces attention-induced information flow during reasoning to target RL credit assignment more precisely.
Paper · Hugging Face Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It — CoT SFT can silently break long-range recall in hybrid LLMs — names the failure mode and proposes a fix.
Paper · Hugging Face PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models — Trains psychologically-informed refusal behavior — refusals that account for user intent and context, not just keyword filters.
Paper · Hugging Face Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders — SAE-based interpretation and steering of a TTS language model — generative speech gets the SAE treatment.
Paper · Hugging Face ABot-Earth 0.5: Generative 3D Earth Model — Generative 3D framework that synthesizes seamless 3D environments from common 2D inputs.
Paper · Hugging Face ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations — Unified discrete-representation autoregressive multimodal model that handles understanding and generation under one tokenizer.
Paper · Hugging Face Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking — Structured-thinking agent for multimodal deep research — pushes past unstructured retrieve-and-summarize.
Paper · Hugging Face What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems — Action-state communication primitives for efficient multi-agent systems — past role-and-pipeline orchestration.
Paper · Hugging Face WorldOlympiad: Can Your World Model Survive a Triathlon? — Diagnostic benchmark stress-testing video world models across physical faithfulness and geometric coherence.