InfoQ Homepage Development Content on InfoQ
-
Cactus v1: Cross-Platform LLM Inference on Mobile with Zero Latency and Full Privacy
Cactus, a Y Combinator-backed startup, enables local AI inference to mobile phones, wearables, and other low-power devices through cross-platform, energy-efficient kernels and a native runtime. It delivers sub-50ms time-to-first-token for on-device inference, eliminates network latency, and defaults to complete privacy.
-
Python Workers Redux: Wasm Snapshots and Native uv Tooling
Cloudflare's latest advancements in Python Workers revolutionize serverless performance with near-instant cold starts, expanded package compatibility, and streamlined workflows via the uv package manager. By leveraging memory snapshots and WebAssembly, Cloudflare drastically reduces startup times, making Python a prime choice for AI and data science applications.
-
Nuxt Introduces Native Request Cancellation and Async Handler Extraction for Performance Gains
Nuxt 4.2 elevates the developer experience with native abort control for data fetching, improved error handling, and experimental TypeScript support. With a 39% reduction in bundle sizes and a streamlined app directory, this release enhances performance and project organization, positioning Nuxt as a leading choice for full-stack web applications built on Vue.js.
-
Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations
Target has deployed GRAM, a GenAI-powered accessory recommendation system for the Home category, using large language models to prioritize product attributes and capture aesthetic cohesion. The system helps shoppers find compatible accessories, integrates human-in-the-loop curation, and achieved measurable improvements in engagement and conversion.
-
Toad: a Unified CLI Tool for All Your LLMs That Promises Improved UX from Existing Ones
During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP).
-
Java News Roundup: GlassFish, TornadoVM, Spring Shell, WildFly, Hibernate, Kotlin
This week's Java roundup for December 15th, 2025, features news highlighting: the fifteenth milestone release of GlassFish 8.0; the first release candidate of Spring Shell 4.0; point releases of TornadoVM, Hibernate Reactive, Hibernate Search and Kotlin; the first beta release of WildFly 39; and maintenance releases of Micronaut Helidon and Vert.x.
-
QConAI NY 2025 - Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery
Aaron Erickson at QCon AI NYC 2025 emphasized treating agentic AI as an engineering challenge, focusing on reliability through the blend of probabilistic and deterministic systems. He argued for clear operational structures to minimize risks and optimize performance, highlighting the importance of specialized agents and deterministic paths to enhance accuracy and control in AI workflows.
-
Google Metrax Brings Predefined Model Evaluation Metrics to JAX
Recently open-sourced by Google, Metrax is a JAX library providing standardized, performant metrics implementations for classification, regression, NLP, vision, and audio models.
-
AWS Introduces Regional Availability for NAT Gateway
AWS has recently introduced regional availability for the managed NAT Gateway service. The new capability allows developers to create a single NAT Gateway that automatically spans multiple availability zones (AZs) in a VPC, providing high availability, eliminating the need to define separate gateways and public subnets in each zone.
-
AWS Expands Well-Architected Framework with Responsible AI and Updated ML and Generative AI Lenses
At AWS re:Invent 2025, AWS expanded its Well-Architected Framework with a new Responsible AI Lens and updated Machine Learning and Generative AI Lenses. The updates provide guidance on governance, bias mitigation, scalable ML workflows, and trustworthy AI system design across the full AI lifecycle.
-
oRPC Releases Version 1.0 with OpenAPI Support and End-to-End Type Safety
Introducing oRPC 1.0, a cutting-edge TypeScript library for building typesafe APIs, offering a stable, production-ready solution with full OpenAPI integration. Key features include enterprise-grade type safety, complex type support, and seamless integration with popular frameworks. With superior performance and comprehensive migration guides, oRPC emerges as a choice for modern API development.
-
QCon AI New York 2025: AI Platform Scaling at LinkedIn
At QCon AI NY 2025, LinkedIn's Prince Valluri and Karthik Ramgopal unveiled an internal platform for AI agents, prioritizing execution over intelligence. By using structured specifications within a robust orchestration layer, they enhance agent observability and interoperability while ensuring human accountability.
-
JEP 500: Java to Enforce Strict Final Field Immutability by Restricting Reflection
JEP 500 prepares the Java ecosystem for final field integrity in JDK 26, restricting deep reflection mutations. This crucial update aims to enhance safety and performance by closing a long-standing loophole, transitioning toward stricter encapsulation. Developers can now anticipate warnings when attempting these mutations, ensuring a reliable path for future optimizations.
-
Google Cloud Launches Managed MCP Support
Google Cloud's introduction of fully-managed Model Context Protocol (MCP) servers revolutionizes its API infrastructure, streamlining access for developers. This enterprise-ready solution enhances AI integration across services such as Google Maps and BigQuery while promoting wide-scale adoption. New tools ensure governance and security, and are currently in public preview.
-
QCon AI New York 2025: Moving Mountains: Migrating Legacy Code in Weeks instead of Years
David Stein, principal AI engineer at ServiceTitan, presented “Moving Mountains: Migrating Legacy Code in Weeks instead of Years” at QCon AI New York 2025. Stein demonstrated how migrations don’t have to be synonymous to “moving mountains” and introduced the concepts of the Principle of Acceleration and the Assembly Line Pattern.