InfoQ Homepage News
-
Target Improves Add to Cart Interactions by 11 Percent with Generative AI Recommendations
Target has deployed GRAM, a GenAI-powered accessory recommendation system for the Home category, using large language models to prioritize product attributes and capture aesthetic cohesion. The system helps shoppers find compatible accessories, integrates human-in-the-loop curation, and achieved measurable improvements in engagement and conversion.
-
Toad: A Unified CLI Tool for All Your LLMs That Promises Improved UX From Existing Ones
During his sabbatical, Will McGugan, maker of Rich and Textual( frameworks for making Textual User Interfaces (TUI)), put his UI skills to work to build Toad. The newly publicly released tool aims to provide a unified, “beautiful” GUI for multiple coding agents in your terminal, accessible via the same tool via the Agent Communication Protocol (ACP).
-
Neptune Combines AI‑Assisted Infrastructure as Code and Cloud Deployments
Now available in beta, Neptune is a conversational AI agent designed to act like an AI platform engineer, handling the provisioning, wiring, and configuration of the cloud services needed to run a containerized app. Neptune is both language and cloud-agnostic, with support for AWS, GCP, and Azure.
-
Meta Details GEM Ads Model Using LLM-Scale Training, Hybrid Parallelism, and Knowledge Transfer
Meta released details about its Generative Ads Model (GEM), a foundation model designed to improve ads recommendation across its platforms. The model addresses core challenges in recommendation systems (RecSys) by processing billions of daily user-ad interactions where meaningful signals such as clicks and conversions are very sparse.
-
Java News Roundup: GlassFish, TornadoVM, Spring Shell, WildFly, Hibernate, Kotlin
This week's Java roundup for December 15th, 2025, features news highlighting: the fifteenth milestone release of GlassFish 8.0; the first release candidate of Spring Shell 4.0; point releases of TornadoVM, Hibernate Reactive, Hibernate Search and Kotlin; the first beta release of WildFly 39; and maintenance releases of Micronaut Helidon and Vert.x.
-
IBM Research Introduces CUGA, an Open-Source Configurable Agent Framework on Hugging Face
IBM Research has released CUGA (Configurable Generalist Agent) on Hugging Face Spaces, making its enterprise-oriented agent framework easier to evaluate with open models and real workflows. The move positions CUGA as a practical alternative to brittle, tightly coupled agent frameworks that often struggle with tool misuse, long-horizon reasoning, and recovery from failure.
-
AWS Launches ECS Express Mode to Simplify Containerised Application Deployment
AWS has released Amazon ECS Express Mode, bringing a simplified process to deploying containerised web applications and APIs. Express Mode lets users deploy production-ready services in one shot, bypassing the usual detail required around ancillary requirements such as IAM roles, load-balancers and scaling.
-
QConAI NY 2025 - Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery
Aaron Erickson at QCon AI NYC 2025 emphasized treating agentic AI as an engineering challenge, focusing on reliability through the blend of probabilistic and deterministic systems. He argued for clear operational structures to minimize risks and optimize performance, highlighting the importance of specialized agents and deterministic paths to enhance accuracy and control in AI workflows.
-
Google Metrax Brings Predefined Model Evaluation Metrics to JAX
Recently open-sourced by Google, Metrax is a JAX library providing standardized, performant metrics implementations for classification, regression, NLP, vision, and audio models.
-
AWS Introduces Regional Availability for NAT Gateway
AWS has recently introduced regional availability for the managed NAT Gateway service. The new capability allows developers to create a single NAT Gateway that automatically spans multiple availability zones (AZs) in a VPC, providing high availability, eliminating the need to define separate gateways and public subnets in each zone.
-
Decathlon Switches to Polars to Optimize Data Pipelines and Infrastructure Costs
Decathlon, one of the world's leading sports retailers, recently shared why it adopted the open source library Polars to optimize its data pipelines. The Decathlon Digital team found that migrating from Apache Spark to Polars for small input datasets provides significant speed and cost savings.
-
AWS Expands Well-Architected Framework with Responsible AI and Updated ML and Generative AI Lenses
At AWS re:Invent 2025, AWS expanded its Well-Architected Framework with a new Responsible AI Lens and updated Machine Learning and Generative AI Lenses. The updates provide guidance on governance, bias mitigation, scalable ML workflows, and trustworthy AI system design across the full AI lifecycle.
-
oRPC Releases Version 1.0 with OpenAPI Support and End-to-End Type Safety
Introducing oRPC 1.0, a cutting-edge TypeScript library for building typesafe APIs, offering a stable, production-ready solution with full OpenAPI integration. Key features include enterprise-grade type safety, complex type support, and seamless integration with popular frameworks. With superior performance and comprehensive migration guides, oRPC emerges as a choice for modern API development.
-
QCon AI New York 2025: AI Platform Scaling at LinkedIn
At QCon AI NY 2025, LinkedIn's Prince Valluri and Karthik Ramgopal unveiled an internal platform for AI agents, prioritizing execution over intelligence. By using structured specifications within a robust orchestration layer, they enhance agent observability and interoperability while ensuring human accountability.
-
Pinterest Engineering Reduces Android CI Build Times by 36% with Runtime-Aware Sharding
Pinterest published a technical case study detailing how its engineering team cut Android end-to-end (E2E) continuous integration (CI) build times by more than 36 percent by adopting a runtime-aware test-sharding strategy and building an internal testing platform.