InfoQ Homepage News
-
Gemma 4 Multi-Token Prediction Delivers Up to ~3x Faster Token Generation
Gemma 4 can be paired with multi-token prediction (MTP) drafters that use speculative decoding to generate multiple tokens in parallel, allowing the model to verify them in a single pass and achieve up to ~3× faster inference without quality loss.
-
NodeJS Proposes Built-In Virtual File System, Sparking Debate Over AI-Generated Contributions
Matteo Collina has proposed a Virtual File System (VFS) for Node.js core through the node:vfs module. The proposal includes about 19,000 lines of code and addresses common workflow challenges. While it has community support, concerns have arisen regarding the use of AI in its development, prompting debates about its implications for code verification and necessity in the Node.js ecosystem.
-
OpenJDK News Roundup: Vector API, Compact Object Headers and G1GC as Default in JDK 27
There was a flurry of activity in the OpenJDK ecosystem during the week of May 18th, 2026, highlighting three JEPs elevated from Proposed to Target to Targeted and three JEPs elevated from Candidate to Proposed to Target for JDK 27. The proposed release schedule has also been finalized.
-
Google Introduces Middleware Architecture for Genkit Applications
Google has introduced Middleware for Genkit, its open-source framework for building AI-powered and agentic applications. The update adds a programmable interception layer around model calls, tool execution, and generation loops, giving developers more control over reliability, safety, and orchestration inside production AI systems.
-
AWS MCP Server Reaches GA with Full API Coverage and IAM-Based Governance
AWS has recently made its managed Model Context Protocol (MCP) server generally available, giving AI coding agents controlled access to AWS APIs, documentation, and operational workflows through a standard interface. It provides a safer and more auditable way to connect AI agents to AWS services without handing over broad credentials.
-
Google Cloud Introduces Cross-Engine Iceberg Support in BigQuery
At the Apache Iceberg Summit last month, Google announced new interoperability features for Apache Iceberg in BigQuery. The preview of the serverless Iceberg REST catalog lets teams create, update, and query the same Apache Iceberg tables in BigQuery and in engines like Spark, Flink, and Trino without duplicating data.
-
Uber Improves Restaurant Recommendations Using Real-Time Signals and Listwise Ranking
Uber updates its Uber Eats Home Feed recommendation system using near real-time user sequence features and a Generative Recommender model. The system evolves from hand-crafted features to transformer-based sequence modeling, reduces feature freshness from 24 hours to seconds, and shifts from pointwise scoring to listwise GenRec for improved contextual ranking and real-time personalization.
-
InfoQ Launches Online AI Engineering Cohort and Certification for Senior Software Practitioners
InfoQ has launched a five-week online AI Engineering certification for senior practitioners working on production AI systems, covering RAG, agents, AI platforms, evals, reliability, and operational trade-offs.
-
Discord Rebuilds Database Operations Around Automation to Manage ScyllaDB at Massive Scale
Discord has detailed how it rebuilt its database operations around a new internal orchestration framework called the Scylla Control Plane (SCP), enabling its small infrastructure team to automate large-scale ScyllaDB cluster management tasks that previously took days of manual work.
-
xAI Releases Grok Skills and Updates Tool Calling Responses API
xAI has released Grok Skills together with enhancements to the Responses API for Grok 4.3, enabling persistent custom expertise that the model retains across all conversations.
-
Cloudflare Completes Its Agent Infrastructure Stack with Browser Run Rebuild and Six-Layer Platform
Cloudflare rebuilt Browser Run on its own Containers platform, delivering 4x higher concurrency and 50% faster response times. The upgrade completes a six-layer agent infrastructure stack: compute (Dynamic Workers + Sandboxes), orchestration (Dynamic Workflows), memory (Agent Memory), browsing (Browser Run), and commerce (Stripe Projects).
-
Bintrail: MySQL Time-Travel Queries Using Indexed Binlogs
Bintrail is a recently introduced layer that brings point-in-time queries and row-history lookups to MySQL, the only major relational database lacking native temporal querying. Using indexed binlogs behind ProxySQL and without modifying MySQL or application code, Bintrail supports querying data as of a past timestamp and reviewing change history, primarily for recovery and audit scenarios.
-
Six Sessions at QCon AI Boston 2026 That Take Productionizing AI Seriously
QCon AI Boston 2026 is close to selling out. Discover six sessions where speakers engage directly with the gap between AI working in a demo and AI working in production.
-
How Platform Engineering Using Golden Bricks Can Enable Fast and Smooth Delivery
Platform engineering should have a product focus, as developers are customers; they must provide composable, self-service capabilities, golden bricks rather than rigid golden paths, so teams can move quickly while maintaining consistency. Success is measured through adoption, developer experience, and business outcomes such as deployment frequency and change failure rate.
-
With Android CLI, Google is Making the Android Toolchain Agent-Friendly
Google introduced new Android development tools that enable building apps up to 3x faster by using AI agents, including a redesigned Android command-line interface (CLI), structured skills", and an integrated knowledge base. These tools are designed to support agent-driven workflows and are compatible with third-party agents such as Claude Code and Codex, in addition to Google Gemini.