InfoQ Homepage News
-
CodeClash Benchmarks LLMs through Multi-Round Coding Competitions
Researchers from Standford, Princeton, and Cornell have developed a new benchmark to better evaluate coding abilities of large language models (LLMs). Called CodeClash, the new benchmark pits LLMs against each other in multi-round tournaments to assess their capacity to achieve competitive, high-level objectives beyond narrowly defined, task-specific problems.
-
Dev Proxy v1.3.0: HAR File Generation, LLM Usage Tracking, and Enhanced API Testing Feature
Dev Proxy v1.3.0 has been released, introducing several new features aimed at improving debugging, testing, and API monitoring. The update includes two new plugins, HAR file generation for standardized network tracing and OpenAI usage tracking for better visibility into AI-related costs, along with enhancements to permissions analysis, OpenAPI specification generation, and overall stability.
-
OpenJDK News Roundup: Vector API, Ahead-of-Time Object Caching, Prepare to Make Final Mean Final
There was a flurry of activity in the OpenJDK ecosystem during the week of November 3, 2025, highlighting three JEPs elevated from Proposed to Target to Targeted and three JEPs elevated from Candidate to Proposed to Target for JDK 26. The proposed release schedule has also been finalized.
-
Embedding Atlas: Apple’s Open-Source Tool for Exploring Large-Scale Embeddings Locally
Apple has introduced Embedding Atlas, a new open-source tool for visualizing and exploring large-scale embeddings interactively. Designed for researchers, data scientists, and developers, the platform provides a fast and intuitive way to analyze complex, high-dimensional data—from text embeddings to multimodal representations—without requiring any backend infrastructure or external data upload.
-
GitHub Expands Copilot Ecosystem with AgentHQ
GitHub has announced AgentHQ, a new addition to its platform that aims to unify the fragmented landscape of AI tools within the software development process.
-
Redis Critical Remote Code Execution Vulnerability Discovered after 13 Years
Redis recently released a security advisory regarding CVE-2025-49844. This critical (CVSS 10.0) use-after-free (UAF) vulnerability in Lua scripting could allow authenticated attackers to execute remote code on older versions of Redis and Valkey with Lua scripting enabled. Developers are urged to upgrade to patched releases as soon as possible.
-
AWS Launches Capabilities by Region Tool
AWS has launched "AWS Capabilities by Region," a powerful tool that streamlines service visibility for architects and developers. No more manual checks—now you can compare AWS services across regions interactively and plan deployments efficiently. With enhanced transparency and automated capability checks, streamline global projects and minimize delays.
-
Microsoft Moves Azure DevOps MCP Server from Preview to General Availability
Microsoft announced in October 2025 that its Azure DevOps MCP Server, a local Model Context Provider designed to bring richer context to AI assistants like GitHub Copilot, has exited public preview and become generally available.
-
TanStack Start: A New Meta Framework Powered by React or SolidJS
Introducing TanStack Start v1—a revolutionary full-stack framework for React and Solid applications. Built on TanStack Router and Vite, it offers type-safe APIs, streaming SSR, and universal deployment. Optimized for performance and flexibility, TanStack Start presents a compelling alternative to Next.js, catering to modern development needs with seamless integration and incremental adoption.
-
Grails 7.0, Now under the Apache Software Foundation, Delivers Enhancements for Micronaut and GORM
The release of Apache Grails 7.0.0 delivers notable changes such as: the ability to disable Micronaut auto configuration through the Grails plugin; improved reproducibility with implementations of GORM services; and a temporary removal of the GORM for Neo4J until it is compatible with Grails 7.0 or 8.0. InfoQ spoke to Søren Berg Glasius and James Fredley about this release.
-
Android GenAI Prompt API Enables Natural Language Requests with Gemini Nano
The ML Kit GenAI Prompt API, now available in alpha, enables Android developers to send natural language and multimodal requests to Gemini Nano running on-device, extending the text summarization and image description capabilities introduced with the initial GenAI release.
-
Inside Uber’s Query Architecture: Simplifying Layers and Improving Observability
Uber rebuilt its Apache Pinot query architecture, replacing the Presto-based Neutrino system with a lightweight proxy called Cellar and Pinot’s Multi-Stage Engine Lite Mode. The redesign simplifies SQL execution, improves resource management, and ensures predictable performance for large-scale analytics workloads.
-
Grafana and GitLab Introduce Serverless CI/CD Observability Integration
In a move to streamline development workflows, Daniel Fritzgerald of GrafanaLabs has published a new open-source solution that links GitLab CI/CD events into Grafana's observability stack via a serverless architecture.
-
How AI with Prompt Engineering Supports Software Testing
AI is becoming a key QA tool, aiding in faster scenario generation, risk detection, and test planning. Arbaz Surti showed how effective prompting using roles, context, and output format helps to get clear, relevant, and actionable test scenarios. AI can boost testers, but human judgment is needed to ensure relevance and quality.
-
Azure APIM Simplifies Event-Driven Architecture with Native Service Bus Policy
Microsoft's new feature in API Management (APIM) enables seamless messaging to Azure Service Bus, simplifying API connections in event-driven architectures. By using the send-service-bus-message policy, developers can easily route HTTP requests to Service Bus for asynchronous processing, enhancing integration, security, and control without additional components.