Sarmadi AI Digest May 18, 2026 Updated 6:55 AM CT Today Archive Topics Saved Subscribe RSS

Eric Schmidt booed at graduation; Apple bets on Siri privacy

The AI backlash crystallized this weekend. Eric Schmidt was booed delivering an AI-heavy commencement address at the University of Arizona, NBC tracked the same pattern at multiple ceremonies, and Stratechery published a serious read on data-center discontent. A widely-shared post arguing AI does not actually speed up engineering processes pulled 609 points on Hacker News — the operator mood is no longer just discomfort, it's a measurable headwind. Apple's response is to ship Siri with auto-deleting chats, betting privacy can be the wedge that lets it catch up. Underneath, the research wave is consolidating on harder problems: hardware-adaptive attention that frees inference from H100-only optimization, RLVR methods that learn from failures and explore strategically, and a new GUI-agent benchmark that breaks the forgiving region-tolerant assumption with pixel-precise tasks.

9 papers 10 news 6 sources ← Latest

News

8 items

The AI backlash crystallizes

Eric Schmidt was booed at the University of Arizona for an AI-heavy commencement, NBC found the same pattern at multiple graduations, and a viral essay argued AI does not actually speed up engineering processes. Stratechery's data-center discontent piece is the most serious read of the cycle — opposition is now treated as a problem to be solved, not waved off.

News The Verge AI

University of Arizona students boo Eric Schmidt's AI cheerleading during commencement

Former Google CEO Eric Schmidt was booed at the University of Arizona commencement after a speech leaning hard on AI's future.

Why it matters
  • Visible college-age rejection of AI triumphalism — the workforce-entry cohort is signaling.
  • Forces speakers, recruiters, and companies to retire the 'AI is the future' applause line.
  • Pairs with the data-center backlash to mark a clear public-opinion turn.
News Hacker News

I don't think AI will make your processes go faster

Practitioner post (609 HN points) arguing AI coding tools don't speed up engineering processes — the bottleneck is review, coordination, and intent.

Why it matters
  • Reframes the productivity story away from token-throughput toward team coordination cost.
  • Plain-spoken counter to the executive talking point that AI mechanically multiplies output.

Apple bets on privacy as the Siri wedge

Apple's revamped Siri will reportedly include auto-deleting chats — a privacy posture aimed at differentiating against ChatGPT, Gemini, and Meta AI rather than competing on raw capability. After a year of being seen as behind, Apple's strategy is becoming legible.

News The Verge AI

Revamped Siri will reportedly offer autodeleting chats

Apple is positioning the next Siri around privacy — auto-deleting chats and on-device defaults as the wedge against rivals.

Why it matters
  • Differentiates Apple Intelligence on policy, not parameters — a fight Apple can credibly win.
  • Forces Google, OpenAI, and Meta to address chat retention as a competitive issue, not just a setting.

Inference frees itself from the H100

GQLA reworks DeepSeek's multi-head latent attention to expose multiple decoding paths, breaking the H100-only optimization that has constrained where the most efficient attention can run. A small but strategic shift for teams trying to deploy frontier-quality reasoning on non-NVIDIA accelerators.

Papers

7 items

RLVR grows up: failure signal and strategic exploration

Three new RLVR papers attack the dominant failure modes in verifiable-reward training: weak credit assignment on failed trajectories, brute-force exploration that scales rollouts without intelligence, and a deeper look at why on-policy distillation actually works. Together they argue RLVR's next gains will come from smarter signal, not more compute.

Visual agents push past the forgiving regime

PAGER breaks the region-tolerant assumption that has propped up GUI-agent benchmarks with a pixel-precise geometric task. MMSkills treats visual procedural knowledge as multimodal — not just text. CiteVQA scores document MLLMs on whether they ground their answers in the right passage, not just whether the answer happens to be correct.

Paper Hugging Face

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control

GUI-agent benchmark that requires pixel-precise actions in continuous canvas space, breaking the region-tolerant assumption behind current scores.

Why it matters
  • Exposes that GUI-agent SOTA has been measured on forgiving targets — most fail under precision.
  • Practical implication: production GUI automation needs different model behavior than current evals reward.
Paper Hugging Face

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Document VQA benchmark that scores whether MLLMs ground answers in the correct supporting passage — exposes right-answer-wrong-evidence as a real failure mode.

Why it matters
  • Critical for legal, financial, and medical AI use cases where the citation matters as much as the answer.
  • Builds a direct evaluation lever for compliance-driven document intelligence buyers.

Inference frees itself from the H100

GQLA reworks DeepSeek's multi-head latent attention to expose multiple decoding paths, breaking the H100-only optimization that has constrained where the most efficient attention can run. A small but strategic shift for teams trying to deploy frontier-quality reasoning on non-NVIDIA accelerators.

Also today