Latest

Red Hat AI 3.4 Boosts Inference Speed 3x

Mason

12 May 2026 — 1 min read

At Red Hat Summit 2026, Red Hat unveiled AI 3.4, a major update claiming up to threefold inference acceleration through speculative decoding, alongside new model serving capabilities and an MCP gateway. The release deepens partnerships with NVIDIA, Voyager Technologies, and Nissan, targeting enterprise AI workloads spanning edge computing, automotive, and even orbital infrastructure.

Speculative decoding, a technique that generates multiple candidate tokens in parallel and verifies them against the target model, is the headline performance feature. Red Hat asserts this yields up to 3x faster inference without sacrificing accuracy, a critical gain for latency-sensitive enterprise applications. The update also introduces a dedicated model serving framework, agent management tools, and an MCP (Model Control Plane) gateway designed to orchestrate and monitor distributed AI inference fleets.

On the infrastructure side, AI 3.4 brings native support for NVIDIA's Blackwell architecture, enabling enterprises to leverage the latest GPU generation for large-scale model serving. A more unexpected deployment involves Voyager Technologies: RHEL 10.1 is now running on the International Space Station as an edge computing node, marking a pilot for space-based AI inference. Meanwhile, Red Hat's collaboration with Nissan targets a software-defined vehicle platform, suggesting the company is pushing its AI stack into automotive real-time decision systems. These partnerships underscore Red Hat's strategy of embedding open-source AI capabilities into non-traditional environments, from orbit to the assembly line.

Kimi K3 Launch: Open-Source Giant Shakes AI Landscape

Moonshot AI released Kimi K3, an open-source model with 2.8 trillion parameters and 100 million token context, delivering performance comparable to top-tier closed-source systems at a fraction of the cost. The release signals a strategic pivot in the AI arms race, where competitive advantage now hinges on cost efficiency

Microsoft Taps AWS as GitHub AI Agents Break SLAs

In an unprecedented move to stabilize its platform under a relentless deluge of AI coding agent traffic, Microsoft has quietly routed core GitHub operations through rival Amazon Web Services (AWS), following a series of crippling outages that saw availability dip below 99% in June and nine incidents in May alone.

Codenotary flags 210,000 risky AI agent actions daily

Codenotary's AgentMon platform now monitors over 3 million AI-agent interactions daily across enterprise clients, flagging approximately 210,000—or 7%—as potentially unsafe or non-compliant, a signal that runtime security gaps in production AI systems are far more widespread than previously recognized. According to the company, the vast

AWS shows agentic AI future in advertising at Cannes

AWS is returning to the Cannes Lions International Festival of Creativity this June with a hands-on activation called Rue Visionnaire, placing AI agents directly in the hands of advertisers. From June 22 to 26, 2026, attendees can guide these agents through the complete creative workflow—starting with ideation and progressing

Read more

Kimi K3 Launch: Open-Source Giant Shakes AI Landscape

Microsoft Taps AWS as GitHub AI Agents Break SLAs

Codenotary flags 210,000 risky AI agent actions daily

AWS shows agentic AI future in advertising at Cannes