InfoQ Homepage Artificial Intelligence Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

New IBM Granite 4 Models to Reduce AI Costs with Inference-Efficient Hybrid Mamba-2 Architecture

IBM recently announced the Granite 4.0 family of small language models. The model family aims to deliver faster speeds and significantly lower operational costs at acceptable accuracy vs. larger models. Granite 4.0 features a new hybrid Mamba/transformer architecture that largely reduces memory requirements, enabling Granite to run on significantly cheaper GPUs and at significantly reduced costs.

Bruno Couriol
on Nov 18, 2025
AI, ML & Data Engineering

KubeCon NA 2025 - Erica Hughberg and Alexa Griffith on Tools for the Age of GenAI

Generative AI technologies need to support new workloads, traffic patterns, and infrastructure demands and require a new set of tools for the age of GenAI. Erica Hughberg from Tetrate and Alexa Griffith from Bloomberg spoke last week at KubeCon + CloudNativeCon North America 2025 Conference about what it takes to build GenAI platforms capable of serving model inference at scale.

Srini Penchikala
on Nov 17, 2025
AI, ML & Data Engineering

Anthropic Adds Sandboxing and Web Access to Claude Code for Safer AI-Powered Coding

Anthropic released sandboxing capabilities for Claude Code and launched a web-based version of the tool that runs in isolated cloud environments. The company introduced these features to address security risks that arise when Claude Code writes, tests, and debugs code with broad access to developer codebases and files.

Vinod Goje
on Nov 14, 2025
AI, ML & Data Engineering

Google Unveils Project Suncatcher, Envisioning AI Models Running in Space

Google has unveiled Project Suncatcher, a research initiative exploring how solar powered satellite constellations equipped with Tensor Processing Units TPUs could one day enable large scale artificial intelligence computation in space.

Daniel Dominguez
on Nov 14, 2025
AI, ML & Data Engineering

KubeCon NA 2025 - Salesforce’s Approach to Self-Healing Using AIOps and Agentic AI

AIOps and Agentic AI technologies can help in developing solutions to intelligently analyze Kubernetes cluster health, automatically diagnose problems, and orchestrate issue resolutions with minimal human intervention. Vikram Venkataraman and Srikanth Rajan spoke at KubeCon + CloudNativeCon NA 2025 Conference about Salesforce’s approach to self-healing systems using AIOps and AI Agents.

Srini Penchikala
on Nov 12, 2025
AI, ML & Data Engineering

New Claude Haiku 4.5 Model Promises Faster Performance at One-Third the Cost

Anthropic released Claude Haiku 4.5, making the model available to all users as its latest entry in the small, fast model category. The company positions the new model as delivering performance levels comparable to Claude Sonnet 4, which launched five months ago as a state-of-the-art model, but at "one-third the cost and more than twice the speed."

Vinod Goje
on Nov 12, 2025
AI, ML & Data Engineering

Embedding Atlas: Apple’s Open-Source Tool for Exploring Large-Scale Embeddings Locally

Apple has introduced Embedding Atlas, a new open-source tool for visualizing and exploring large-scale embeddings interactively. Designed for researchers, data scientists, and developers, the platform provides a fast and intuitive way to analyze complex, high-dimensional data—from text embeddings to multimodal representations—without requiring any backend infrastructure or external data upload.

Robert Krzaczyński
on Nov 08, 2025
AI, ML & Data Engineering

GitHub Expands Copilot Ecosystem with AgentHQ

GitHub has announced AgentHQ, a new addition to its platform that aims to unify the fragmented landscape of AI tools within the software development process.

Daniel Dominguez
on Nov 08, 2025
Culture & Methods

How AI with Prompt Engineering Supports Software Testing

AI is becoming a key QA tool, aiding in faster scenario generation, risk detection, and test planning. Arbaz Surti showed how effective prompting using roles, context, and output format helps to get clear, relevant, and actionable test scenarios. AI can boost testers, but human judgment is needed to ensure relevance and quality.

Ben Linders
on Nov 06, 2025
AI, ML & Data Engineering

Cursor 2.0 Expands Composer Capabilities for Context-Aware Development

Cursor has launched version 2.0 of its AI-driven code editor, featuring Composer, a new model that enables developers to write and modify code through natural language interaction.

Daniel Dominguez
on Nov 04, 2025
AI, ML & Data Engineering

QCon London 2026 Announces Tracks: AI Engineering, Building Teams, Tech of Finance, and More

The QCon London 2026 tracks are live: 15 practitioner-curated deep dives on AI adoption, resilient architectures, distributed systems, performance, modern languages, data, security, and Staff+ leadership, rooted in real production lessons.

Artenisa Chatziou
on Nov 03, 2025
AI, ML & Data Engineering

PyTorch Foundation Welcomes Ray, Unveils Monarch for Simplified Distributed AI

At the 2025 PyTorch Conference, the PyTorch Foundation unveiled new initiatives to advance open AI infrastructure, introducing frameworks like Ray and PyTorch Monarch for streamlined distributed workloads. Highlighting transparency in AI development, new projects from Stanford and AI2 aim to enhance reproducibility. The foundation is solidifying its role as a central hub for scalable AI solutions.

Hien Luu
on Oct 30, 2025
AI, ML & Data Engineering

Inside the Architectures Powering Modern AI Systems: QCon San Francisco 2025

Senior engineers face fast-moving AI adoption without clear patterns. QCon SF 2025 brings real-world lessons from teams at Netflix, Meta, Intuit, Anthropic & more, showing how to build reliable AI systems at scale. Early bird ends Nov 11.

Artenisa Chatziou
on Oct 30, 2025
Cloud

Amazon Timestream for InfluxDB Adds Support for InfluxDB 3 Core and Enterprise

InfluxData has launched InfluxDB 3 Core and Enterprise on Amazon Timestream, offering a high-speed, open-source time-series database for real-time applications. With enhanced security, scalability, and performance, developers can seamlessly integrate with AWS services. InfluxDB 3 redefines data management for AI-driven environments, enabling rapid analytics and decision-making.

Steef-Jan Wiggers
on Oct 30, 2025
DevOps

HashiCorp Previews “Agentic Infrastructure” Future with Project Infragraph

At its annual conference, HashiConf 2025, the now-IBM-owned HashiCorp, revealed a new strategic initiative: Project Infragraph, a real-time infrastructure graph designed to underpin an era of agent-driven automation for hybrid clouds.

Craig Risi
on Oct 28, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News