InfoQ Homepage Large language models Content on InfoQ
-
State Space Models Can Enable AI in Low-Power Edge Computing
At the the 2025 Embedded Vision Summit, Tony Lewis, chief technology officer at BrainChip, presented research done by his company into state space models (SSMs) and how they can provide LLM capabilities with very low power consumption in limited computing environments, such as those found on dashcams, medical devices, security cameras, and even toys.
-
Perplexity Launches Comet: a Browser Designed around AI-Assisted Interaction
Perplexity has introduced Comet, a new web browser designed to integrate natural language interaction directly into the browsing experience. Unlike conventional browsers built around navigation and search, Comet aims to support users in research, comparison, and task execution by combining browsing with persistent context and AI assistance.
-
Amazon Launches Bedrock AgentCore for Enterprise AI Agent Infrastructure
Amazon announced the preview of Amazon Bedrock AgentCore, a collection of enterprise-grade services that help developers deploy and operate AI agents at scale across frameworks and foundation models. The platform addresses infrastructure challenges developers face when building production AI agents.
-
Wix Adds Chaos to CI/CD Pipelines with AI and Improves Reliability
Cloud-based web development service Wix has written about a new approach to integrating artificial intelligence into continuous integration and continuous deployment (CI/CD) systems. In a blog post, Wix demonstrates how probabilistic AI can coexist with deterministic development processes, adding chaos without compromising reliability.
-
Inaugural MCP Dev Summit Charts AI Integration's Future
Developers and contributors of the Model Context Protocol (MCP) converged in San Francisco in May 2025 for their first developer summit, charting the future of this rapidly adopted open standard to enable seamless integration between LLM applications and external data sources and tools. Discussions focused on a roadmap for MCP, including critical enterprise features.
-
Docker Expands Compose for Agent Development and Ties in Cloud Offload Support
Docker launched a new feature to let developers define, build, and run agents using Docker Compose, with the aim to streamline agent development process and reduce repetitive tasks. Additionally, Docker Offload, now in beta, provides a way to seamlessly offload building and running models to remote GPU compute.
-
Anthropic Introduces Economic Futures Program to Address the Economic Impact of AI
Anthropic has announced the launch of its Economic Futures Program, an initiative designed to address the economic impact of AI.
-
Docker Desktop 4.43 Expands Model Runner and Brings New Compose-Kubernetes Bridge
Following the introduction of Model Runner a few months ago, Docker Desktop 4.43 expands its capabilities with improved model management and broader OpenAI compatibility. The release also debuts a new Compose Bridge to simplify the generation of Kubernetes configurations and upgrade the Gordon AI agent.
-
LM Studio 0.3.17 Adds Model Context Protocol (MCP) Support for Tool-Integrated LLMs
LM Studio has released version 0.3.17, introducing support for the Model Context Protocol (MCP) — a step forward in enabling language models to access external tools and data sources. Originally developed by Anthropic, MCP defines a standardized interface for connecting LLMs to services such as GitHub, Notion, or Stripe, enabling more powerful, contextual reasoning.
-
Gemma 3n Introduces Novel Techniques for Enhanced Mobile AI Inference
Launched in early preview last May, Gemma 3n is now officially available. It targets mobile-first, on-device AI applications, using new techniques designed to increase efficiency and improve performance, such as per-layer embeddings and transformer nesting.
-
Google Launches Gemini CLI: Open-Source Terminal AI Agent for Developers
Google has released Gemini CLI, a new open-source AI command-line interface that brings the full capabilities of its Gemini 2.5 Pro model directly into developers’ terminals. Designed for flexibility, transparency, and developer-first workflows, Gemini CLI provides high-performance, natural language AI assistance through a lightweight, locally accessible interface.
-
Experiences from Using AI as a Software Architect
Artificial intelligence excels at refining language and processing large text volumes, but lacks human-like contextual reasoning and emotional intelligence, Avraham Poupko said. Many human traits come into play when doing software architecture. As an architect, he suggests using AI for exploring tradeoffs and refining language with clarity and precision.
-
Google's Agent2Agent Protocol Enters the Linux Foundation
Recently open-sourced by Google, the Agent2Agent protocol is now part of the Linux Foundation, along with its accompanying SDKs and developer tools.
-
Apple's Illusion of Thinking Paper Explores Limits of Large Reasoning Models
Apple Machine Learning Research published a paper titled "The Illusion of Thinking," which investigates the abilities of Large Reasoning Models (LRMs) on a set of puzzles. As the complexity of the puzzles increases, the researchers found that LRMs encounter a "collapse" threshold where the models reduce their reasoning effort, indicating a limit to the models' scalability.
-
Anthropic Upgrades App-Building Capabilities to Claude Artifacts
Anthropic has upgraded Claude with new app-building capabilities, allowing users to create, host, and share AI applications directly from text prompts. This functionality, known as Artifacts, enables users to build functional tools like data analyzers, flashcard generators, or study aids by simply describing their ideas.