InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Launches Code Wiki, an AI-Driven System for Continuous, Interactive Code Documentation

Google has introduced Code Wiki, a new platform designed to keep software documentation continuously synchronized with the code it describes. The system generates a structured wiki for each repository, automatically updates it after every change, and powers an integrated chat interface that understands the entire codebase.

Robert Krzaczyński
on Nov 21, 2025
AI, ML & Data Engineering

QConSF 2025 - Developing Claude Code at Anthropic at AI Speed

At QCon San Francisco 2025, Adam Wolff showcased Claude Code at Anthropic, where AI powers 90% of production code. With a focus on speed over planning, Claude Code's design evolved through experimentation, addressing challenges like Unicode issues and shell command bottlenecks. Discover successful iterations and lessons learned in real-time software development.

Andrew Hoblitzell
on Nov 20, 2025
AI, ML & Data Engineering

Google Announces Gemini 3

Google's Gemini 3, unveiled on November 18, 2025, sets a new standard for multimodal AI, integrating seamlessly across platforms like Search and Vertex AI. With capabilities for text, code, and rich media, it empowers both consumer and enterprise applications. Gemini 3 Pro and its advanced Deep Think mode enhance reasoning and task execution, revolutionizing workflows and analytics.

Andrew Hoblitzell
on Nov 20, 2025
AI, ML & Data Engineering

AI Agents Fail Manipulation Tests in Microsoft's Magentic Marketplace Simulation

Researchers at Microsoft, working in collaboration with Arizona State University, have introduced Magentic Marketplace, an open-source simulation environment designed to study how LLM-based agents behave in multi-agent economic systems. The platform addresses a growing need in AI research as autonomous agents gain capabilities in software development.

Vinod Goje
on Nov 20, 2025
Architecture & Design

AI-Generated Code Creates New Wave of Technical Debt, Report Finds

AI-generated code is “highly functional but systematically lacking in architectural judgment”, a new report from Ox Security has found. In a report released in late October called Army of Juniors: The AI Code Security Crisis, AI application security (AppSec) company Ox Security outlined 10 architecture and security anti-patterns that are commonly found in AI-generated code.

Patrick Farry
on Nov 18, 2025
AI, ML & Data Engineering

New IBM Granite 4 Models to Reduce AI Costs with Inference-Efficient Hybrid Mamba-2 Architecture

IBM recently announced the Granite 4.0 family of small language models. The model family aims to deliver faster speeds and significantly lower operational costs at acceptable accuracy vs. larger models. Granite 4.0 features a new hybrid Mamba/transformer architecture that largely reduces memory requirements, enabling Granite to run on significantly cheaper GPUs and at significantly reduced costs.

Bruno Couriol
on Nov 18, 2025
AI, ML & Data Engineering

KubeCon NA 2025 - Erica Hughberg and Alexa Griffith on Tools for the Age of GenAI

Generative AI technologies need to support new workloads, traffic patterns, and infrastructure demands and require a new set of tools for the age of GenAI. Erica Hughberg from Tetrate and Alexa Griffith from Bloomberg spoke last week at KubeCon + CloudNativeCon North America 2025 Conference about what it takes to build GenAI platforms capable of serving model inference at scale.

Srini Penchikala
on Nov 17, 2025
AI, ML & Data Engineering

Code Arena Launches as a New Benchmark for Real-World AI Coding Performance

LMArena has launched Code Arena, a new evaluation platform that measures AI models' performance in building complete applications instead of just generating code snippets. It emphasizes agentic behavior, allowing models to plan, scaffold, iterate, and refine code within controlled environments that replicate actual development workflows.

Robert Krzaczyński
on Nov 17, 2025
AI, ML & Data Engineering

Kimi's K2 Opensource Language Model Supports Dynamic Resource Availability and New Optimizer

Kimi released K2, a Mixture-of-Experts large language model with 32 billion activated parameters and 1.04 trillion total parameters, trained on 15.5 trillion tokens. The release introduces MuonClip, a new optimizer that builds on the Muon optimizer by adding a QK-clip technique designed to address training instability, which the team reports resulted in "zero loss spike" during pre-training.

Vinod Goje
on Nov 17, 2025
AI, ML & Data Engineering

Anthropic Adds Sandboxing and Web Access to Claude Code for Safer AI-Powered Coding

Anthropic released sandboxing capabilities for Claude Code and launched a web-based version of the tool that runs in isolated cloud environments. The company introduced these features to address security risks that arise when Claude Code writes, tests, and debugs code with broad access to developer codebases and files.

Vinod Goje
on Nov 14, 2025
AI, ML & Data Engineering

Google Unveils Project Suncatcher, Envisioning AI Models Running in Space

Google has unveiled Project Suncatcher, a research initiative exploring how solar powered satellite constellations equipped with Tensor Processing Units TPUs could one day enable large scale artificial intelligence computation in space.

Daniel Dominguez
on Nov 14, 2025
AI, ML & Data Engineering

KubeCon NA 2025 - Salesforce’s Approach to Self-Healing Using AIOps and Agentic AI

AIOps and Agentic AI technologies can help in developing solutions to intelligently analyze Kubernetes cluster health, automatically diagnose problems, and orchestrate issue resolutions with minimal human intervention. Vikram Venkataraman and Srikanth Rajan spoke at KubeCon + CloudNativeCon NA 2025 Conference about Salesforce’s approach to self-healing systems using AIOps and AI Agents.

Srini Penchikala
on Nov 12, 2025
AI, ML & Data Engineering

New Claude Haiku 4.5 Model Promises Faster Performance at One-Third the Cost

Anthropic released Claude Haiku 4.5, making the model available to all users as its latest entry in the small, fast model category. The company positions the new model as delivering performance levels comparable to Claude Sonnet 4, which launched five months ago as a state-of-the-art model, but at "one-third the cost and more than twice the speed."

Vinod Goje
on Nov 12, 2025
AI, ML & Data Engineering

Anthropic Finds LLMs Can Be Poisoned Using Small Number of Documents

Anthropic's Alignment Science team released a study on poisoning attacks on LLM training. The experiments covered a range of model sizes and datasets, and found that only 250 malicious examples in pre-training data were needed to create a "backdoor" vulnerability. Anthropic concludes that these attacks actually become easier as models scale up.

Anthony Alford
on Nov 11, 2025
AI, ML & Data Engineering

CodeClash Benchmarks LLMs through Multi-Round Coding Competitions

Researchers from Standford, Princeton, and Cornell have developed a new benchmark to better evaluate coding abilities of large language models (LLMs). Called CodeClash, the new benchmark pits LLMs against each other in multi-round tournaments to assess their capacity to achieve competitive, high-level objectives beyond narrowly defined, task-specific problems.

Sergio De Simone
on Nov 10, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News