InfoQ Homepage Artificial Intelligence Content on InfoQ
-
Microsoft Open Sources Evals for Agent Interop Starter Kit to Benchmark Enterprise AI Agents
Microsoft's Evals for Agent Interop is an open-source starter kit that enables developers to evaluate AI agents in realistic work scenarios. It features curated scenarios, datasets, and an evaluation harness to assess agent performance across tools like email and calendars.
-
Microsoft Agent Framework RC Simplifies Agentic Development in .NET and Python
Microsoft has announced that the Microsoft Agent Framework has reached Release Candidate status for both .NET and Python. This milestone indicates that the API surface is stable and feature-complete for what is planned in version 1.0, setting the stage for an upcoming general availability release.
-
Google Brings its Developer Documentation into the Age of AI Agents
Google has announced the public preview of the Developer Knowledge API. It comes with a Model Context Protocol (MCP) server. This gives AI development tools a simple, machine-readable way to reach Google's official developer documentation.
-
AI "Vibe Coding" Threatens Open Source as Maintainers Face Crisis
Daniel Stenberg shut down cURL's bug bounty after AI submissions hit 20%. Mitchell Hashimoto banned AI code from Ghostty. Steve Ruiz closed all external PRs to tldraw. Economic research shows "vibe coding" weakens the user engagement that sustains open source. As developers delegate to AI agents, documentation visits and bug reports collapse—threatening the ecosystem's viability.
-
Anthropic Study: AI Coding Assistance Reduces Developer Skill Mastery by 17%
Anthropic research shows developers using AI assistance scored 17% lower on comprehension tests when learning new coding libraries, though productivity gains were not statistically significant. Those who used AI for conceptual inquiry scored 65% or higher, while those delegating code generation to AI scored below 40%.
-
Hugging Face Introduces Community Evals for Transparent Model Benchmarking
Hugging Face has launched Community Evals, a feature that enables benchmark datasets on the Hub to host their own leaderboards and automatically collect evaluation results from model repositories.
-
Leapwork Research Shows Why AI in Testing Still Depends on Reliability, Not Just Innovation
Leapwork recently released new research showing that while confidence in AI-driven software testing is growing rapidly, accuracy, stability, and ongoing manual effort remain decisive factors in how far teams are willing to trust automation.
-
Does AI Make the Agile Manifesto Obsolete?
Capgemini's Steve Jones argues AI agents building apps in hours have killed the Agile Manifesto, as its human-centric principles don't fit agentic SDLCs. While Forrester reports 95% still find Agile relevant, Kent Beck proposes "augmented coding" and AWS suggests "Intent Design" over sprint planning. The debate: Is Agile dead, or evolving for AI collaboration?
-
Teleport Launches Agentic Identity Framework to Secure AI Agents across Enterprise Infrastructure
Teleport recently unveiled the Teleport Agentic Identity Framework, a new AI-centered security model designed to help enterprises safely deploy autonomous and semi-autonomous AI agents across cloud and on-premises environments.
-
QCon Previews 20th Anniversary Conferences: Production AI, Resilience, and Staff+ Engineering
Celebrating its 20th anniversary, QCon’s 2026 conferences in London and San Francisco will focus on the engineering realities of agentic AI, resilient architectures, and platform ROI. The programs continue the series' two-decade tradition of practitioner-led content, curated by senior engineers from companies like Zoox, UBS, and LinkedIn.
-
Next Moca Releases Agent Definition Language as an Open Source Specification
Moca has open-sourced Agent Definition Language (ADL), a vendor-neutral specification intended to standardize how AI agents are defined, reviewed, and governed across frameworks and platforms. The project is released under the Apache 2.0 license and is positioned as a missing “definition layer” for AI agents, comparable to the role OpenAPI plays for APIs.
-
Cloudflare Demonstrates Moltworker, Bringing Self-Hosted AI Agents to the Edge
Cloudflare has introduced Moltworker, an open-source solution for running Moltbot—a self-hosted personal AI agent—on its Developer Platform, eliminating the need for local hardware, such as Mac minis. Rebranded from Clawdbot, Moltbot serves as a personal assistant in chat applications, integrating with AI models, browsers, and third-party tools while maintaining user control.
-
Vercel Introduces Skills.sh, an Open Ecosystem for Agent Commands
Vercel has released Skills.sh, an open-source tool designed to provide AI agents with a standardized way to execute reusable actions, or skills, through the command line.
-
Agent Trace: Cursor Proposes an Open Specification for AI Code Attribution
Cursor has published Agent Trace, a draft open specification aimed at standardizing how AI-generated code is attributed in software projects. Released as a Request for Comments (RFC), the proposal defines a vendor-neutral format for recording AI contributions alongside human authorship in version-controlled codebases.
-
Daggr Introduced as an Open-Source Python Library for Inspectable AI Workflows
The Gradio team has released Daggr, a new open-source Python library designed to simplify the construction and debugging of multi-step AI workflows. Daggr allows developers to define workflows programmatically in Python while automatically generating a visual canvas that exposes intermediate states, inputs, and outputs for each step in the pipeline.