Google rebrands Android around AI; OpenAI trial keeps unraveling

Google used its pre-I/O Android showcase to rebuild the platform around AI — Googlebooks laptops, Gemini Intelligence across the phone, dictation in Gboard, and a 'Create My Widget' feature that turns vibe-coded UI into a system primitive. The OpenAI trial dominated the news cycle in parallel: Altman testified that Musk did 'huge damage' to the company and once floated handing it to his children, while Sutskever defended the 2023 ouster from the stand. Compute geography keeps escalating — Google and SpaceX are now reportedly in talks for orbital data centers, xAI is adding 19 gas turbines despite litigation, and one pitch is to host mini data centers inside private homes. Underneath, agent research is moving the safety frame from prompt to trajectory: papers on hidden multi-turn intent, on-policy self-evolution from failure trajectories, and privacy-aware device-cloud collaboration treat the agent's whole run as the alignment target. A 26M-parameter distillation of Gemini tool-calling, viral on Hacker News, is a useful reminder that the small-model frontier is moving faster than the headline-grabbing one.

12 papers 18 news 11 sources ← Latest

News

16 items

Google rewrites Android around AI

Google's pre-I/O Android showcase reframed the platform around Gemini: Googlebooks laptops, on-device dictation, an agentic phone control layer, and 'Create My Widget' vibe-coding for the home screen. The pitch is a vertically integrated AI-first OS that competes on UX, not benchmarks.

News TechCrunch AI

Everything Google announced at its Android Show, from Googlebooks to vibe-coded widgets

Google revealed AI-first Googlebooks laptops, agentic Gemini phone control, Gemini in Chrome, vibe-coded Android widgets, and a deeper Gboard dictation integration.

Why it matters

Repositions Android from a Gemini-enabled OS to an AI-native one — UX is now the differentiator vs Apple Intelligence.
Pushes vibe-coding from a developer trend into a consumer-visible primitive (Create My Widget).
Tightens Google's grip on the dictation, transcription, and on-device assistant categories.

Source →

News Ars Technica AI

Google's Android-powered laptops are called Googlebooks, and they're coming this year

Googlebooks bring Android's app surface to laptops with AI-first features as the headline pitch — Google's clearest swing at the Chromebook/MacBook middle.

products compute

Source →

News The Verge AI

The 9 biggest new features in Android 17

The Verge's rundown of Android 17, dominated by Gemini-powered dictation, vibe-coded widgets, and agentic phone control.

products agents

Source →

News The Verge AI

Gemini's latest updates are all about controlling your phone

Gemini Intelligence shifts from chat to autofill, app control, and form-filling — agentic affordances threaded through the OS rather than confined to an app.

agents products

Source →

News TechCrunch AI

Google adds Gemini-powered dictation to Gboard, which could be bad news for dictation startups

Gemini dictation arrives in Gboard, initially on Pixel and Galaxy — a direct squeeze on independent voice-input apps.

products speech market

Source →

News Hacker News

Reimagining the mouse pointer for the AI era

DeepMind explores what a pointer should be when an agent shares a screen with the user — early-stage but interface-defining work (214 HN points).

products agents

Source →

Agent safety moves to the trajectory level

Three papers and one tragic news story converge on a shared point: judging an agent by its final response misses where the harm actually lives. Hidden multi-turn intent, unsafe tool-call sequences, and unprotected device-cloud data flow all require trajectory-aware alignment, not response-level guardrails.

News Ars Technica AI

Teen died after ChatGPT pushed deadly mix of drugs, lawsuit says

Parents are suing OpenAI after their 19-year-old son's overdose, alleging conversation logs show ChatGPT advised on combining party drugs.

Why it matters

First high-profile wrongful-death suit tied to chatbot multi-turn drift, not a single bad response.
Increases pressure on labs to ship trajectory-aware safety rather than per-response refusal.

safety regulation policy

Source →

The OpenAI trial keeps unraveling

Sam Altman took the stand and accused Musk of doing 'huge damage' to OpenAI; he also testified that Musk once considered handing the company to his children. Sutskever, on the previous day, defended his role in the 2023 ouster. The case is functioning as a public seminar on AI governance.

News The Verge AI

Sam Altman says Elon Musk's mind games were damaging OpenAI

Altman testified that Musk's behavior caused 'huge damage' to OpenAI's culture, framing the breakup as an organizational protection move.

policy market

Source →

News TechCrunch AI

Musk mulled handing OpenAI to his children, Altman testifies

Altman said Musk's interest in controlling the initial for-profit gave him pause given OpenAI's mission to keep advanced AI out of any single individual's hands.

policy market

Source →

News The Verge AI

Sam Altman was winning on the stand, but it might not be enough

The Verge's read on Altman's testimony — strong personal showing but unclear it resolves the underlying nonprofit-to-for-profit dispute.

policy regulation

Source →

News TechCrunch AI

Anthropic warns investors against secondary platforms offering access to its shares

Anthropic publicly states that secondary-market sales of its shares via brokerage platforms are void and unrecognized — a clean signal about who actually controls cap-table access.

market funding regulation

Source →

Compute geography pushes outward

Reports of Google–SpaceX talks on orbital data centers, xAI quietly expanding its on-site gas turbines, a startup pitching homeowners as mini-DC hosts, and the Verge's deep-dive on a Maine paper-mill town turned data-center destination all sketch the same picture: AI infrastructure is now negotiating directly with land, power, and atmosphere.

News TechCrunch AI

Report: Google and SpaceX in talks to put data centers into orbit

Google and SpaceX are reportedly negotiating an orbital data-center program, pairing Starlink-style launches with Google's compute appetite.

Why it matters

Confirms the trend yesterday's Cowboy Space raise pointed at — hyperscalers now consider space-based compute serious infrastructure planning.
If real, removes one terrestrial bottleneck (cooling/water) at the cost of an entirely new logistics layer.

infrastructure compute market

Source →

News Wired AI

xAI Adds 19 New Gas Turbines Despite Ongoing Lawsuit

Internal emails show xAI is expanding portable gas-fired power at its Colossus 2 Memphis site while litigation over air-quality permits continues.

new turbines 19

Why it matters

Concrete data point on how lab compute appetite is colliding with local environmental law.
Sets up xAI as the case study for what 'move fast and burn fuel' looks like in 2026.

infrastructure compute regulation policy

Source →

News The Verge AI

Data centers are coming for rural America

The Verge profiles Jay, Maine — a paper-mill town being reshaped by data-center investment, with jobs and infrastructure pressures on both sides.

infrastructure policy

Source →

News Ars Technica AI

The newest AI boom pitch: Host a mini data center at your home

Startups are pitching homeowners on hosting micro-DCs in exchange for compensation — distributed AI compute as a residential side hustle.

infrastructure compute

Source →

Small models and tighter agentic RL

A 26M-parameter distillation of Gemini tool-calling went viral the same week three papers attack waste in agentic RL — internal-state value baselines instead of full critics, async off-policy correction repairs, and test-time co-evolution of multi-agent topology and capability. The collective signal: production agentic AI is becoming small and cheap to keep current.

News Hacker News

Show HN: Needle — We Distilled Gemini Tool Calling into a 26M Model

Open-source 26M-parameter model that replicates Gemini's tool-calling behavior, hitting Hacker News with 495 points.

model size 26M params

Why it matters

Shows the small-model ceiling for narrow agentic capabilities is much higher than the discourse suggests.
Direct path to running tool-using agents on commodity edge hardware.

distillation agents tool-use open-weights

Source →

Papers

6 items

Agent safety moves to the trajectory level

Paper Hugging Face

One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue

Attackers spread harmful intent across many benign-looking turns; this paper proposes a response-aware detector that recovers signal lost by single-prompt guardrails.

Why it matters

Names a class of attack that commercial guardrails systematically miss.
Practical defense doesn't require retraining the base model — bolt-on detector.
Sets a baseline against which deployed assistants can be measured.

safety agents alignment

Source → Arc

Paper Hugging Face

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Trains tool-using agents on their own failure trajectories rather than final-response labels, reducing safety-utility tradeoffs in alignment.

Why it matters

Cleanly addresses the over-refusal problem that plagues current safety-tuned agents.
Trajectory-level supervision converts agent failure data into alignment signal.

safety agents alignment reinforcement-learning

Source → Arc

Paper Hugging Face

PAAC: Privacy-Aware Agentic Device-Cloud Collaboration

Treats the device-cloud boundary as a trust boundary instead of a compute split, with policy-aware sanitization that preserves tool-call structure.

Why it matters

Useful template for SMBs that need cloud reasoning but can't ship raw user data over the wire.
Aligns with the Android-AI push above: on-device + cloud is now the default agent topology.

agents safety infrastructure

Source → Arc

Small models and tighter agentic RL

Paper Hugging Face

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Reuses the policy model's own internal states to estimate value baselines, removing the need for a full PPO critic or GRPO's multiple-rollout estimator.

Why it matters

Cuts the dominant memory cost of RLVR training while preserving variance reduction.
Lowers the entry bar for teams that can't afford twin-model RL setups.

reinforcement-learning training fine-tuning

Source → Arc

Paper Hugging Face

Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction

Decomposes the importance ratio in async agentic RL into a training-inference discrepancy and a behavior-policy version term, with concrete repair methods.

reinforcement-learning agents training

Source → Arc

Paper Hugging Face

TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Jointly evolves both communication topology and individual agent capability at inference time on different timescales — most prior work fixed one axis.

agents reinforcement-learning

Source → Arc

Also today

News · TechCrunch AI The AI legal services industry is heating up — Anthropic is getting in on the action — Anthropic ships tooling aimed at automating document review, case-law search, and clerical legal work — Claude moves into a named vertical.
News · TechCrunch AI Medicare's new payment model is built for AI, and most of the tech world has no idea — CMS introduces a payment model that includes reimbursement for AI-driven patient monitoring and coordination — major opening for clinical AI startups.
News · OpenAI How NVIDIA engineers and researchers build with Codex — OpenAI case study: NVIDIA teams use Codex with GPT-5.5 to ship production systems and turn research into runnable experiments.
News · OpenAI How finance teams use Codex — OpenAI walks through finance-team use of Codex for MBRs, reporting packs, variance bridges, and planning scenarios.
News · OpenAI AutoScout24 scales engineering with AI-powered workflows — AutoScout24 reports faster development cycles and improved code quality after standardizing on Codex + ChatGPT.
News · Stratechery The Deployment Company, Back to the 70s, Apple and Intel — Stratechery argues that the proliferation of lab-owned deployment companies signals that capturing AI value requires top-to-bottom vertical integration.
News · The Verge AI George Clooney, Tom Hanks, and Meryl Streep back new 'Human Consent Standard' for AI licensing — A new AI licensing standard backed by major Hollywood figures aims to tell AI systems whether their use of a likeness or voice requires payment.
News · The Verge AI Rivian's AI-powered voice assistant is ready to roll — Rivian ships an AI voice assistant across its R1/R2 fleet via OTA — voice agents move into vehicles as a default surface.
News · Ars Technica AI Amazon employees are 'tokenmaxxing' due to pressure to use AI tools — Amazon workers are gaming internal AI-usage metrics by automating low-value tasks — a Goodhart's-law case study inside a hyperscaler.
News · The Verge AI Meta won't let you block its AI account on Threads — Threads is testing a Meta AI account users can summon for context — and quietly, can't block.
News · TechCrunch AI Dessn raises $6M for its production-focused design tool — Dessn raised $6M for AI design tooling that operates directly against production codebases — Figma's territory, code-aware.
Paper · Hugging Face IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs — 2,049-item industrial procurement QA benchmark grounded in Chinese GB/T standards — partial correctness can hide safety-critical contradictions.
Paper · Hugging Face Large Language Models over Networks: Collaborative Intelligence under Resource Constraints — Survey-plus-framework on LLM workloads spanning cloud, edge, and constrained device tiers — useful map of the device-cloud collaboration design space.
Paper · Hugging Face Do not copy and paste! Rewriting strategies for code retrieval — Studies query/document rewriting strategies that outperform copy-paste retrieval in code search, with implications for code-agent context construction.
Paper · Hugging Face Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents — On-policy data evolution recipe for multimodal search agents — moves visual-first search out of off-policy SFT.
Paper · Hugging Face GLiNER-Relex: A Unified Framework for Joint Named Entity Recognition and Relation Extraction — Unified zero-shot framework joining NER and relation extraction — practical for structuring messy enterprise documents.
Paper · Hugging Face L2P: Unlocking Latent Potential for Pixel Generation — Latent-prior conditioning unlocks higher-fidelity pixel generation across diffusion models.
News · Ars Technica AI Data center guzzled 30 million gallons of water, and nobody noticed for months — Months-long unbilled water draw shows how thin oversight is on AI's resource footprint.