Sarmadi AI Digest May 17, 2026 Updated 6:55 AM CT Today Archive Topics Saved Subscribe RSS

Thin Saturday — NVIDIA opens a 2.6B world model; arXiv slop ban hardens

A quiet weekend cycle with two real signals. NVIDIA's SANA-WM dropped on Hacker News with 348 points — a 2.6B open-source world model that generates a full minute of 720p video on commodity GPUs, the kind of open release that resets the cost floor for video-AI startups overnight. ArXiv's slop ban tightened with a clearer enforcement story: a yearlong submission ban for authors who let LLMs do the writing. The AI 'haves and have nots' framing landed in TechCrunch — useful read on why the industry's mood has soured even as the numbers grow. Underneath, a thin but practical research drop: few-shot guidance for verifiable-reward RL, a fresh look at debiased omni-modal evaluation, and a sphere-aware flow-matching tweak that fixes a quiet bug in latent image diffusion.

6 papers 5 news 4 sources ← Latest

News

4 items

NVIDIA opens a 2.6B video world model

NVIDIA Labs released SANA-WM, a 2.6B open-source world model that generates a minute of 720p video on commodity hardware. Open weights at this size and quality compress the moat for closed video-AI vendors and give SMB builders a credible foundation to fine-tune.

News Hacker News

SANA-WM, a 2.6B open-source world model for 1-minute 720p video

NVIDIA Labs released SANA-WM — a 2.6B-parameter open-source world model that generates one minute of 720p video at consumer-GPU scale (348 HN points).

params 2.6Bvideo length 1 minresolution 720p
Why it matters
  • Open weights at this size make video-AI viable to fine-tune without lab-scale compute.
  • Compresses the moat for closed video-generation vendors (Runway, Pika, OpenAI Sora) on the consumer tier.
  • Sets a new reference point for what 'small' world models can do — relevant for any team training on game/sim data.

arXiv's slop ban gets a sharper edge

TechCrunch's writeup of arXiv's new policy makes the enforcement explicit: a yearlong submission ban for authors who let LLMs do all the work. After this week's first wave of coverage, the policy is now being read as a model other preprint servers and journals will follow.

The AI gold rush's haves and have-nots

TechCrunch crystallized the mood with a 'haves and have nots' read on the current cycle — the industry is making money and laying people off at the same time, and the dissonance is becoming the dominant story for non-flagship operators. Greg Brockman's product consolidation at OpenAI is one face of it; SMBs trying to keep up are the other.

Papers

3 items

Thin but practical research drop

A short list of HF papers worth a scan: few-shot guidance for verifiable-reward RL on hard problems, a debiased evaluation audit of omni-modal benchmarks, and a sphere-aware flow-matching fix that addresses a quiet correctness gap in latent image diffusion.

Also today