InfoQ Homepage DevOps Content on InfoQ

News

RSS Feed

Newer Older

Cloud

AlloyDB Ships Proxy Models That Replace LLM Calls with Local Inference Inside the Database

Google shipped AlloyDB AI functions GA with a proxy model architecture that trains a lightweight local model from LLM outputs, then runs queries at database speed without external calls. Smart batching delivers 2,400x throughput improvement. The proxy model reaches 100,000 rows per second in preview, but benchmark numbers apply only to ai.if in internal testing.

Steef-Jan Wiggers
on Jul 09, 2026
Architecture & Design

AWS Details How One Customer Scaled to One Million Lambda Functions

AWS has outlined how ProGlove, an industrial-wearables manufacturer, was able to scale its SaaS platform to run more than one million AWS Lambda functions spread across thousands of dedicated customer accounts.

Matt Foster
on Jul 09, 2026
Architecture & Design

Airbnb Shares Architecture Behind Sitar-Agent Dynamic Configuration Sidecar for Kubernetes Services

Airbnb engineers detailed Sitar-agent, a Kubernetes sidecar for dynamic configuration delivery across tens of thousands of pods, processing updates several times per minute. The system was redesigned with Java, Amazon S3 snapshot bootstrapping, and a migration from Sparkey to SQLite to improve reliability, startup performance, and configuration availability at scale.

Leela Kumili
on Jul 08, 2026
DevOps

AWS Expands DevOps Agent with AI-Powered Release Management to Validate Code before Production

Amazon Web Services (AWS) has announced a major expansion of its AWS DevOps Agent, introducing new release management capabilities designed to assess code changes and autonomously test software before it reaches production.

Craig Risi
on Jul 07, 2026
DevOps

How HubSpot Scaled Semantic Search to 20 Billion Vectors

SaaS software vendor HubSpot has described how its semantic search platform grew from a proof of concept into an internal service that now manages more than 20 billion vectors across 38-plus teams. The company says the system now supports agents, RAG, and contact deduplication, and that the increase in agent usage has made retrieval quality and latency more important than before.

Matt Saunders
on Jul 07, 2026
Cloud

Cloudflare and AWS Embed x402 Agent Payments at the Edge

Cloudflare and AWS both implemented x402 stablecoin micropayments at their edge networks within two weeks. The open protocol under the Linux Foundation revives HTTP 402 for agent-to-service payments with sub-cent transaction costs. Coinbase reports 169 million transactions in year one. Enterprise tax and invoicing gaps remain unresolved.

Steef-Jan Wiggers
on Jul 06, 2026
Cloud

Claude Reaches GA on Microsoft Foundry: European Enterprises Cannot Deploy It

Claude models reached GA on Microsoft Foundry with Azure-native billing and governance, but no European data zone exists. Anthropic's own documentation confirms data residency guarantees apply to Bedrock and Vertex AI but not Foundry. European practitioners from banking and healthcare report the offering is unapproved for production.

Steef-Jan Wiggers
on Jul 05, 2026
Cloud

Cycle Introduces EU Control Plane as Sovereignty Debate Continues

Cycle recently introduced a separate EU-based control plane, allowing European customers to keep platform management data and telemetry within Europe. The new offering is designed to improve compliance, operational isolation, and responsiveness for European organizations.

Renato Losio
on Jul 04, 2026
DevOps

OpenTelemetry Graduates to CNCF's Highest Maturity Level

The Cloud Native Computing Foundation (CNCF) has announced the graduation of OpenTelemetry, elevating the project to the foundation's highest level of maturity and formally recognizing it as production-ready for enterprise use.

Craig Risi
on Jul 03, 2026
Cloud

Oracle Quietly Halves Free Tier Ampere A1 Compute Limits with No Public Announcement

Oracle halved the Always Free Ampere A1 compute allowance from 4 OCPUs and 24 GB RAM to 2 OCPUs and 12 GB RAM with no public announcement. Support agents gave conflicting answers on whether PAYG accounts are affected. Documentation states the new limits apply to "all tenancies" while support emails say only free-tier accounts.

Steef-Jan Wiggers
on Jul 03, 2026
Cloud

Apple Extends Private Cloud Compute to Google Cloud for the First Time

Apple chose Google Cloud to run Private Cloud Compute outside its own data centers for the first time, using NVIDIA Blackwell GPUs, Intel TDX, and Google's Titan chip. Apple maintains an independent append-only hardware ledger and dual-vendor attestation roots. AWS and Azure are not part of the collaboration.

Steef-Jan Wiggers
on Jul 02, 2026
DevOps

Microsoft Brings AI-Powered Vulnerability Remediation to Azure DevOps with Copilot Autofix

Microsoft has announced the limited public preview of Copilot Autofix for GitHub Advanced Security for Azure DevOps, extending AI-powered vulnerability remediation to teams using Azure Repos.

Craig Risi
on Jun 30, 2026
Cloud

AWS Launches Lambda MicroVMs for Isolated Agent and User Code Execution

AWS launched Lambda MicroVMs, a new serverless compute primitive that runs each user session or AI agent in its own Firecracker virtual machine with hardware-level isolation, snapshot-based rapid launch, and state preservation for up to eight hours. Reddit community analysis found the minimum setup costs $3.03/day, roughly 9x Fargate spot pricing.

Steef-Jan Wiggers
on Jun 30, 2026
Java

Eliya 25 Brings a JVM-Level Diagnostic Profile to OpenJDK 25 LTS

Asymm Systems has released Eliya 25.0.3, an OpenJDK 25 LTS distribution aimed at improving production diagnostics in Java environments. It consolidates several HotSpot features into an opt-in Production profile. Eliya is designed for teams needing reliable diagnostic data, especially in regulated settings. Future enhancements are planned for Phase 2.

A N M Bazlur Rahman
on Jun 29, 2026
DevOps

AI Tools Accelerates Coding, But Not Overall Software Delivery, GitLab Research Finds

GitLab's 2026 AI Accountability Report highlights an AI Paradox: although 78% of developers say they code faster, overall software delivery has not accelerated due to downstream testing and review bottlenecks and new challenges for enterprise governance and traceability.

Sergio De Simone
on Jun 29, 2026

Newer News

Older News

InfoQ Software Architects' Newsletter

News