Anthropic open-sources its vuln-discovery harness; Apple admits its first AI agent

Anthropic open-sourced its defending-code-reference-harness for AI-powered vulnerability discovery — the most concrete first-party release of frontier-lab security tooling to date (443 HN points). Apple quietly approved Poke as the first AI agent on its Messages for Business platform, the first real third-party agent surface inside Apple's commerce stack ahead of WWDC. The AI-IPO race kept warming: Daniela Amodei pushed back on doubts about returns ahead of Anthropic's listing, and Wired documented investors backing both Anthropic and OpenAI rather than picking sides. Compute and physical-infra constraints sharpened — TSMC said it can only support so much demand, Meta started literally pitching tents over data centers to cut cost, and Kevin O'Leary halved his planned 40,000-acre Utah build under local pressure. The research wave centered on planning and values: AdaPlanBench tests agents under progressively revealed constraints, RobotValues asks whether household robots make good choices when human values conflict, and Meta-Cognitive Memory Policy Optimization sharpens the credit-assignment problem for long-horizon agents.

10 papers 22 news 8 sources ← Latest

News

16 items

Anthropic open-sources its vuln harness; IPO momentum builds

Anthropic open-sourced its defending-code-reference-harness for AI-powered vulnerability discovery — a first-party frontier-lab security tool. In parallel, Daniela Amodei pushed back on doubts about AI returns ahead of Anthropic's IPO, and Wired's reporting shows top investors backing both OpenAI and Anthropic rather than picking sides.

News Hacker News

Anthropic's open-source framework for AI-powered vulnerability discovery

Anthropic open-sources defending-code-reference-harness — a reference framework for AI-powered vulnerability discovery, drawing 443 HN points.

Why it matters

First major open-source release of a frontier lab's offensive-security tooling.
Gives SMB and mid-market security teams a credible AI-vuln-discovery baseline to build on.
Reframes Project Glasswing from a closed enterprise program into an ecosystem play.

Source →

News TechCrunch AI

Ahead of its IPO, Anthropic's Daniela Amodei shrugs off doubts about AI's returns

Anthropic president Daniela Amodei publicly counters skeptical-returns narratives — preparing the IPO market for breakneck-growth disclosures.

Why it matters

First major executive on-record narrative-setting ahead of the S-1.
Direct counter to the 'AI bubble' framing several Wall Street analysts have advanced.

funding market products

Source →

News Wired AI

OpenAI and Anthropic May Be Rivals, but Investors Aren't Picking Sides

Wired finds top venture investors are taking positions in both OpenAI and Anthropic — 'why wouldn't you want to be in both Pepsi and Coke?'

funding market

Source →

News TechCrunch AI

Mira Murati steps back into the spotlight, carefully

Thinking Machines' Mira Murati re-engages publicly — the 'remain heads-down' window is closing as the cycle intensifies.

market products

Source →

Apple's agent moment opens

Apple approved Poke as the first AI agent on its Messages for Business platform — the first sanctioned third-party agent inside Apple's commerce stack — days before a WWDC widely expected to launch a Siri revamp. Apple also disclosed $1.4T in App Store billings, and reportedly weighing cameras for next-gen AirPods. Apple is moving from late to credible on agents.

News TechCrunch AI

Apple approves Poke as the first AI agent on its Messages for Business platform

Poke becomes the first sanctioned third-party AI agent inside Apple's Messages for Business — a real opening of Apple's commerce surface to outside agents.

Why it matters

Apple selecting a startup partner for a first agent slot is the most concrete agent move it has made.
Sets the integration bar third-party developers will have to clear.
Pairs with WWDC's expected Siri revamp to reposition Apple from late to plausibly leading at the UX layer.

products agents market

Source →

News TechCrunch AI

What to expect from WWDC 2026: Siri's highly anticipated revamp and Apple Intelligence updates

WWDC 2026 preview: Siri revamp expected to land, Apple Intelligence updates across the OS lineup.

products

Source →

News TechCrunch AI

Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission

Apple discloses $1.4T in App Store billings (up from $1.3T), with 90% commission-free — useful framing ahead of antitrust and AI-distribution arguments.

billings $1.4Tcommission-free share 90%

market products

Source →

News Wired AI

Why Apple Might Put Cameras Into Its Next AirPods

Wired examines the rumored next-gen AirPods with cameras — a possible Apple wedge into ambient AI hardware.

products

Source →

Compute supply and the DC backlash get concrete

TSMC said it can only support so much AI demand, Meta is pitching tents over data centers to cut build cost, and Kevin O'Leary halved his planned 40,000-acre Utah build under local pressure. The supply ceiling and the local-opposition ceiling are now both visibly binding.

News The Verge AI

TSMC struggles to keep up with AI demand: 'We can only support so much'

TSMC publicly acknowledges it cannot keep up with AI demand — the binding constraint on frontier silicon now spoken aloud.

Why it matters

Reframes the 'just spend more' AI-capex thesis: the bottleneck is fab capacity, not money.
Strengthens the case for memory-first designs and architectural efficiency over raw scaling.
Direct read-across to the chip-funding wave of the past two weeks.

compute infrastructure market

Source →

News TechCrunch AI

Meta steals a tactic from Tesla and builds data centers in tents

Meta is using tent structures over data-center buildouts to slash construction time and cost — Tesla-style improvisation.

infrastructure market

Source →

News The Verge AI

Kevin O'Leary agrees to downsize massive Utah data center

Kevin O'Leary halved his planned 40,000-acre Utah data center under sustained local opposition.

Why it matters

Concrete example of a celebrity-backed AI infra project shrinking in response to community pressure.
Validates the political force of the Brockovich-led backlash narrative.

infrastructure policy regulation

Source →

News Ars Technica AI

How some data center operators are tackling their water use problems

Ars details operator-side cooling and water-recovery efforts as scrutiny over AI's water footprint intensifies.

infrastructure policy

Source →

AI security after the Meta hack

MIT Technology Review frames the Meta Instagram exploit as proof that AI-security needs more than Mythos-style frontier-lab programs. Nemotron 3.5 Content Safety from NVIDIA, Open Code Review from Alibaba, and an Estonian benchmark on resisting Russian propaganda all add tooling and measurement to the layer.

News MIT Technology Review

The Meta hack shows there's more to AI security than Mythos

MIT Tech Review frames the Meta AI support-agent Instagram exploit as evidence that frontier-lab safety programs (Mythos, Daybreak, Glasswing) are necessary but not sufficient.

Why it matters

Names the structural gap between lab-led safety frameworks and operator-side reality.
Direct procurement implication: deploying agentic products requires its own security review, not just vendor attestations.
Reframes the bioweapons-letter conversation against the operational shortcomings of vendor-led safety.

safety products policy

Source →

News Hugging Face

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

NVIDIA ships Nemotron 3.5 Content Safety — customizable multimodal content-safety models for enterprise deployments.

safety open-weights multimodal

Source →

News Hacker News

Open Code Review — An AI-powered code review CLI tool

Alibaba open-sources an AI-powered code-review CLI tool (191 HN points).

code open-weights tools

Source →

News Ars Technica AI

These LLMs are the best at resisting Russian propaganda

Estonian government benchmark scores dozens of models on resistance to Russia's strategic narratives — first state-issued geopolitical-bias eval.

safety evaluation policy

Source →

Papers

4 items

Agent planning, values, and embodied research

AdaPlanBench measures adaptive planning under progressively revealed user and world constraints, RobotValues evaluates household robots when human values conflict, Dream.exe asks whether video-gen models can actually drive executable robot manipulation, and Meta-Cognitive Memory Policy Optimization sharpens long-horizon credit assignment.

Paper Hugging Face

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Benchmark for adaptive planning where both world and user constraints emerge through interaction — closer to real customer-facing agent work.

Why it matters

First credible benchmark for the constraint-revealed-over-time pattern most enterprise agents face.
Sets a measurable target for the 'agent that adjusts' pitch all vendors are now making.

agents evaluation benchmarks reasoning

Source → Arc

Paper Hugging Face

RobotValues: Evaluating Household Robots When Human Values Conflict

Evaluates household robots on actions when values conflict — task success vs autonomy, efficiency vs social appropriateness.

Why it matters

Direct relevance to Amazon Proteus, Hello Robot Stretch, and the consumer robot wave.
First serious benchmark for value-conflict choices in domestic settings.

robotics alignment evaluation

Source → Arc

Paper Hugging Face

Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?

Tests whether video generative outputs can be turned into executable robot manipulation rather than just plausible footage.

video-generation robotics world-models

Source → Arc

Paper Hugging Face

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

Trains agent memory policies with meta-cognitive credit assignment — localizes where intermediate memory quality matters.

agents memory reinforcement-learning

Source → Arc

Also today

News · TechCrunch AI Airbnb's Brian Chesky plans to launch a new AI lab — Airbnb's Brian Chesky says he'll launch an AI lab — Airbnb-side reasoning that existing LLM partnerships haven't met product needs.
News · TechCrunch AI Is Silicon Valley ready to put robots in people's homes? Hello Robot is — Hello Robot ships the fourth-generation Stretch home assistance robot — consumer robotics keeps inching past hobbyist.
News · TechCrunch AI Meta rolls out a new AI creator assistant on Facebook — Meta ships an AI creator assistant to help Facebook creators read their analytics in natural language.
News · The Verge AI Elon Musk is steamrolling Wall Street to become a trillionaire — Decoder podcast on Musk's path through SpaceX, X, xAI, and index-fund leverage toward a trillion-dollar net worth.
News · Ars Technica AI Elon Musk tries again to escape FTC audits of X data handling — Musk renews legal effort to escape FTC audits of X's data handling — privacy regulator pushes back.
News · The Verge AI AI leaders call for tougher protections against AI-aided bioweapons — The Verge details the OpenAI/Anthropic-led bioweapons letter to Congress.
News · The Verge AI Let us filter AI slop, you cowards — The Verge demands AI-content filters across major platforms — user pressure for opt-out from AI-generated content as a UX feature.
News · Ars Technica AI The skeptic's guide to humanoid robots going viral on the Internet — Ars Technica primer on how to read viral humanoid-robot demos with skepticism.
News · Hugging Face EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios — ServiceNow ships EVA-Bench Data 2.0 — a broader agent-evaluation benchmark across 3 domains, 121 tools, 213 scenarios.
News · Hacker News Fine-tuning an LLM to write docs like it's 1995 — Practitioner walkthrough fine-tuning an LLM on 1995-era documentation style.
News · Wired AI AI Has Come for Serif Fonts — Wired on the surge of serif-font branding among AI companies — critics call it 'tasteslop'.
News · Wired AI The AI IPO Race Heats Up, DOGE Whistleblower Sues Elon Musk — Uncanny Valley podcast covers the AI IPO race, the DOGE whistleblower suit, and the Meta Instagram hack.
Paper · Hugging Face EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management — Autonomous data-science agent that learns reusable skills and manages long-horizon context — moves past static action sets.
Paper · Hugging Face ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time? — Benchmark for role-playing language agents that evaluates whether character behavior evolves with story arc, not just factual recall.
Paper · Hugging Face SePO: Self-Evolving Prompt Agent for System Prompt Optimization — Self-evolving prompt agent for system-prompt optimization — automates the most-tweaked surface in production AI apps.
Paper · Hugging Face Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination — Scales code RLVR by decomposing solutions into atomic components and recombining them across training.
Paper · Hugging Face TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration — Template-guided iterative discovery of multiple problems — useful for agents that should surface issues, not just answer prompts.