InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Kimi's K2 Opensource Language Model Supports Dynamic Resource Availability and New Optimizer
Kimi released K2, a Mixture-of-Experts large language model with 32 billion activated parameters and 1.04 trillion total parameters, trained on 15.5 trillion tokens. The release introduces MuonClip, a new optimizer that builds on the Muon optimizer by adding a QK-clip technique designed to address training instability, which the team reports resulted in "zero loss spike" during pre-training.
-
Anthropic Adds Sandboxing and Web Access to Claude Code for Safer AI-Powered Coding
Anthropic released sandboxing capabilities for Claude Code and launched a web-based version of the tool that runs in isolated cloud environments. The company introduced these features to address security risks that arise when Claude Code writes, tests, and debugs code with broad access to developer codebases and files.
-
Google Unveils Project Suncatcher, Envisioning AI Models Running in Space
Google has unveiled Project Suncatcher, a research initiative exploring how solar powered satellite constellations equipped with Tensor Processing Units TPUs could one day enable large scale artificial intelligence computation in space.
-
KubeCon NA 2025 - Salesforce’s Approach to Self-Healing Using AIOps and Agentic AI
AIOps and Agentic AI technologies can help in developing solutions to intelligently analyze Kubernetes cluster health, automatically diagnose problems, and orchestrate issue resolutions with minimal human intervention. Vikram Venkataraman and Srikanth Rajan spoke at KubeCon + CloudNativeCon NA 2025 Conference about Salesforce’s approach to self-healing systems using AIOps and AI Agents.
-
New Claude Haiku 4.5 Model Promises Faster Performance at One-Third the Cost
Anthropic released Claude Haiku 4.5, making the model available to all users as its latest entry in the small, fast model category. The company positions the new model as delivering performance levels comparable to Claude Sonnet 4, which launched five months ago as a state-of-the-art model, but at "one-third the cost and more than twice the speed."
-
Anthropic Finds LLMs Can Be Poisoned Using Small Number of Documents
Anthropic's Alignment Science team released a study on poisoning attacks on LLM training. The experiments covered a range of model sizes and datasets, and found that only 250 malicious examples in pre-training data were needed to create a "backdoor" vulnerability. Anthropic concludes that these attacks actually become easier as models scale up.
-
CodeClash Benchmarks LLMs through Multi-Round Coding Competitions
Researchers from Standford, Princeton, and Cornell have developed a new benchmark to better evaluate coding abilities of large language models (LLMs). Called CodeClash, the new benchmark pits LLMs against each other in multi-round tournaments to assess their capacity to achieve competitive, high-level objectives beyond narrowly defined, task-specific problems.
-
Embedding Atlas: Apple’s Open-Source Tool for Exploring Large-Scale Embeddings Locally
Apple has introduced Embedding Atlas, a new open-source tool for visualizing and exploring large-scale embeddings interactively. Designed for researchers, data scientists, and developers, the platform provides a fast and intuitive way to analyze complex, high-dimensional data—from text embeddings to multimodal representations—without requiring any backend infrastructure or external data upload.
-
GitHub Expands Copilot Ecosystem with AgentHQ
GitHub has announced AgentHQ, a new addition to its platform that aims to unify the fragmented landscape of AI tools within the software development process.
-
Redis Critical Remote Code Execution Vulnerability Discovered after 13 Years
Redis recently released a security advisory regarding CVE-2025-49844. This critical (CVSS 10.0) use-after-free (UAF) vulnerability in Lua scripting could allow authenticated attackers to execute remote code on older versions of Redis and Valkey with Lua scripting enabled. Developers are urged to upgrade to patched releases as soon as possible.
-
Android GenAI Prompt API Enables Natural Language Requests with Gemini Nano
The ML Kit GenAI Prompt API, now available in alpha, enables Android developers to send natural language and multimodal requests to Gemini Nano running on-device, extending the text summarization and image description capabilities introduced with the initial GenAI release.
-
Meta and Hugging Face Launch OpenEnv, a Shared Hub for Agentic Environments
Meta's PyTorch team and Hugging Face have launched OpenEnv, an open-source platform for standardizing AI agent environments. The OpenEnv Hub features secure sandboxes that define the necessary tools and APIs for safe, predictable AI operation. Developers can explore, contribute, and refine environments, paving the way for scalable agent development in the open-source RL ecosystem.
-
Cursor 2.0 Expands Composer Capabilities for Context-Aware Development
Cursor has launched version 2.0 of its AI-driven code editor, featuring Composer, a new model that enables developers to write and modify code through natural language interaction.
-
Apple Releases Pico-Banana-400K Dataset to Advance Text-Guided Image Editing
Pico-Banana-400K is a curated dataset of 400,000 images developed by Apple researchers to make it easier to create text-guided image editing models. The images were generated using Google's Nano-Banana to modify real photographs from the Open Images collecion and were then filtered using Gemini-2.5-Pro based on their overall quality and prompt compliance.
-
QCon London 2026 Announces Tracks: AI Engineering, Building Teams, Tech of Finance, and More
The QCon London 2026 tracks are live: 15 practitioner-curated deep dives on AI adoption, resilient architectures, distributed systems, performance, modern languages, data, security, and Staff+ leadership, rooted in real production lessons.