Câbles réseau bleus entremêlés représentant l'infrastructure des datacenters IA

Press Review — AI Agents, Infrastructure & Regulation — May 26, 2026

Three structural signals run through this edition: the AI agent ecosystem is expanding fast — Docker, LangChain, and Google Gemini Spark all shipping within the same week — while the hardware layer is under growing pressure from HBM memory shortages driven by AI datacenter demand. And as AI-powered platforms grow in reach, regulators are starting to push back.


Quoting SpaceX S-1

Source: Simon Willison  ·  Published: May 20, 2026  ·  evergreen

SpaceX’s S-1 filing documents Cloud Services Agreements with Anthropic PBC for access to compute capacity across COLOSSUS and COLOSSUS II. The numbers are notable: $1.25 billion per month from May 2029, with a reduced rate during the ramp-up in May and June 2026. Either party can terminate with 90 days’ notice, which makes the arrangement structurally fragile despite its scale. What the filing reveals above all is the industrial magnitude of frontier AI training compute — and which actors are positioned to provide it.

Read the full article →


Google I/O, Gemini Spark, Antigravity

Source: Simon Willison  ·  Published: May 20, 2026  ·  evergreen

Google I/O’s headline was Gemini Spark — a personal AI agent integrated natively into Gmail, Drive, and Google Workspace, running on Gemini 3.5 Flash and the still-opaque Antigravity model. Security is addressed explicitly: isolated VMs per task on Google Cloud, Data Loss Prevention policies, encrypted credentials. The product direction is clear: the future of AI assistants isn’t a separate chat window, it’s an agent that acts within your existing workflow without requiring you to context-switch out of your tools.

Read the full article →


From Token Streams to Agent Streams

Source: LangChain  ·  Published: May 21, 2026  ·  evergreen

LangChain and LangGraph are moving beyond token streaming to agent event streaming: typed events, scoped subscriptions per subagent, multimodal outputs, and improved frontend resilience. These additions matter for production agent applications, where you need to know not just what the model is generating, but which agent is generating it, in response to what event, and whether a downstream component can handle the output. Moving from token streams to agent event streams is an architectural shift that unlocks real-time observability into multi-agent workflows.

Read the full article →


We built SmithDB, the data layer for agent observability

Source: LangChain  ·  Published: May 15, 2026  ·  evergreen

LangSmith’s SmithDB is a distributed database purpose-built for agent observability — designed to store and query the trace data generated by running agents at scale. The claimed performance improvement is 12x over previous solutions, with full data portability. Observability is one of the practical bottlenecks in agentic deployments: agents generate dense, nested traces, and querying them at speed matters for debugging and monitoring in production. A dedicated data layer rather than a general-purpose database with an agent schema is a sound architectural choice at this volume.

Read the full article →


Get Real-Time Visibility into GPU Usage Across Kubernetes Clusters

Source: NVIDIA Developer  ·  Published: May 21, 2026  ·  evergreen

Real-time GPU visibility across Kubernetes clusters addresses a practical gap: many teams running AI workloads can’t reliably track who’s consuming GPU capacity, how much memory is in use, or whether pods are pending or idle. NVIDIA Developer walks through the tooling to instrument this properly. The stakes are high — GPU clusters are expensive, and systematic underutilization is a direct cost. Getting the observability layer right isn’t infrastructure overhead; it’s a prerequisite for responsible AI infrastructure management.

Read the full article →


Meet Gordon: Docker’s AI Agent For Your Entire Container Workflow

Source: Docker  ·  Published: May 19, 2026  ·  critique

Docker’s Gordon is an AI agent embedded directly in Docker Desktop 4.74 and the CLI, capable of reading your logs, images, and Compose files to provide context-specific troubleshooting without you having to copy-paste output into a separate interface. Available for free with any Docker account. The broader significance: Gordon understands your environment, not just your prompts. For container workflows, that contextual awareness closes a real gap between what a generic coding assistant can do and what an operator in a running environment actually needs.

Read the full article →


Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think.

Source: VentureBeat AI  ·  Published: May 19, 2026  ·  evergreen

Google’s search box — unchanged since 1998 — is getting its first significant redesign, announced at Google I/O. The new interface is an adaptive input field that handles text, images, and video, with native integration of AI Overviews and AI Mode. Rather than running AI Mode as a parallel product, Google is merging the two interaction paradigms at the interface level. Whether users adapt naturally or experience this as disrupting a 25-year-old habit remains to be seen — but the intent to collapse “search” and “AI” into a single entry point is now explicit.

Read the full article →


The memory shortage is causing a repricing of consumer electronics

Source: Simon Willison  ·  Published: May 22, 2026  ·  evergreen

David Oks’s analysis of the HBM memory crunch is worth reading carefully. The share of wafer production allocated to high-bandwidth memory for AI GPUs is projected to jump from 2% to 20% by 2026. Since each GB of HBM consumes more than three times the silicon capacity of a GB of DDR or LPDDR, and with only three major memory manufacturers globally, the production available for consumer DRAM is being structurally squeezed. The downstream effect: consumer electronics — especially sub-$100 smartphones in emerging markets — will get more expensive or less capable. It’s the direct hardware externality of the AI datacenter boom.

Read the full article →


Deceptive Ads: Consumer Group Takes on Meta, TikTok, and Google

Source: Silicon.fr  ·  Published: May 21, 2026  ·  annonce

The BEUC report quantifies the problem: €4.2 billion stolen from European consumers in 2024 through fraudulent financial ads on Meta, TikTok, and Google. Over half of suspicious ad reports by investigators were dismissed or ignored — a direct violation of the Digital Services Act’s requirements for effective reporting and removal. Que Choisir and 30 European consumer organizations have filed a complaint with France’s Arcom demanding enforcement. The underlying question is whether the DSA has real teeth, or whether its requirements remain normative without consequence for platforms that systematically fail to comply.

Article in French.

Read the full article →


Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *