Sarmadi AI Digest June 5, 2026 Updated 6:55 AM CT Today Archive Topics Saved Subscribe RSS

Anthropic open-sources its vuln-discovery harness; Apple admits its first AI agent

Anthropic open-sourced its defending-code-reference-harness for AI-powered vulnerability discovery — the most concrete first-party release of frontier-lab security tooling to date (443 HN points). Apple quietly approved Poke as the first AI agent on its Messages for Business platform, the first real third-party agent surface inside Apple's commerce stack ahead of WWDC. The AI-IPO race kept warming: Daniela Amodei pushed back on doubts about returns ahead of Anthropic's listing, and Wired documented investors backing both Anthropic and OpenAI rather than picking sides. Compute and physical-infra constraints sharpened — TSMC said it can only support so much demand, Meta started literally pitching tents over data centers to cut cost, and Kevin O'Leary halved his planned 40,000-acre Utah build under local pressure. The research wave centered on planning and values: AdaPlanBench tests agents under progressively revealed constraints, RobotValues asks whether household robots make good choices when human values conflict, and Meta-Cognitive Memory Policy Optimization sharpens the credit-assignment problem for long-horizon agents.

10 papers 22 news 8 sources ← Latest

News

16 items

Anthropic open-sources its vuln harness; IPO momentum builds

Anthropic open-sourced its defending-code-reference-harness for AI-powered vulnerability discovery — a first-party frontier-lab security tool. In parallel, Daniela Amodei pushed back on doubts about AI returns ahead of Anthropic's IPO, and Wired's reporting shows top investors backing both OpenAI and Anthropic rather than picking sides.

News Hacker News

Anthropic's open-source framework for AI-powered vulnerability discovery

Anthropic open-sources defending-code-reference-harness — a reference framework for AI-powered vulnerability discovery, drawing 443 HN points.

Why it matters
  • First major open-source release of a frontier lab's offensive-security tooling.
  • Gives SMB and mid-market security teams a credible AI-vuln-discovery baseline to build on.
  • Reframes Project Glasswing from a closed enterprise program into an ecosystem play.

Apple's agent moment opens

Apple approved Poke as the first AI agent on its Messages for Business platform — the first sanctioned third-party agent inside Apple's commerce stack — days before a WWDC widely expected to launch a Siri revamp. Apple also disclosed $1.4T in App Store billings, and reportedly weighing cameras for next-gen AirPods. Apple is moving from late to credible on agents.

News TechCrunch AI

Apple approves Poke as the first AI agent on its Messages for Business platform

Poke becomes the first sanctioned third-party AI agent inside Apple's Messages for Business — a real opening of Apple's commerce surface to outside agents.

Why it matters
  • Apple selecting a startup partner for a first agent slot is the most concrete agent move it has made.
  • Sets the integration bar third-party developers will have to clear.
  • Pairs with WWDC's expected Siri revamp to reposition Apple from late to plausibly leading at the UX layer.

Compute supply and the DC backlash get concrete

TSMC said it can only support so much AI demand, Meta is pitching tents over data centers to cut build cost, and Kevin O'Leary halved his planned 40,000-acre Utah build under local pressure. The supply ceiling and the local-opposition ceiling are now both visibly binding.

News The Verge AI

TSMC struggles to keep up with AI demand: 'We can only support so much'

TSMC publicly acknowledges it cannot keep up with AI demand — the binding constraint on frontier silicon now spoken aloud.

Why it matters
  • Reframes the 'just spend more' AI-capex thesis: the bottleneck is fab capacity, not money.
  • Strengthens the case for memory-first designs and architectural efficiency over raw scaling.
  • Direct read-across to the chip-funding wave of the past two weeks.

AI security after the Meta hack

MIT Technology Review frames the Meta Instagram exploit as proof that AI-security needs more than Mythos-style frontier-lab programs. Nemotron 3.5 Content Safety from NVIDIA, Open Code Review from Alibaba, and an Estonian benchmark on resisting Russian propaganda all add tooling and measurement to the layer.

News MIT Technology Review

The Meta hack shows there's more to AI security than Mythos

MIT Tech Review frames the Meta AI support-agent Instagram exploit as evidence that frontier-lab safety programs (Mythos, Daybreak, Glasswing) are necessary but not sufficient.

Why it matters
  • Names the structural gap between lab-led safety frameworks and operator-side reality.
  • Direct procurement implication: deploying agentic products requires its own security review, not just vendor attestations.
  • Reframes the bioweapons-letter conversation against the operational shortcomings of vendor-led safety.

Papers

4 items

Agent planning, values, and embodied research

AdaPlanBench measures adaptive planning under progressively revealed user and world constraints, RobotValues evaluates household robots when human values conflict, Dream.exe asks whether video-gen models can actually drive executable robot manipulation, and Meta-Cognitive Memory Policy Optimization sharpens long-horizon credit assignment.

Paper Hugging Face

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Benchmark for adaptive planning where both world and user constraints emerge through interaction — closer to real customer-facing agent work.

Why it matters
  • First credible benchmark for the constraint-revealed-over-time pattern most enterprise agents face.
  • Sets a measurable target for the 'agent that adjusts' pitch all vendors are now making.

Also today