InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text

Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models.

Daniel Dominguez
on Aug 08, 2025
AI, ML & Data Engineering

GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities

Zhipu AI has released GLM-4.5 and GLM-4.5-Air, two new AI models designed to handle reasoning, coding, and agent tasks within a single architecture. They use a dual-mode system to switch between complex problem-solving and faster responses, aiming to improve both accuracy and speed.

Robert Krzaczyński
on Aug 07, 2025
AI, ML & Data Engineering

OpenAI Launches Study Mode in ChatGPT to Support Step-by-Step Learning

OpenAI has introduced Study Mode in ChatGPT, a feature intended to guide users through problems in a step-by-step manner rather than supplying immediate answers. It uses interactive prompts, structured responses, and follow-up questions to encourage active engagement and support comprehension.

Robert Krzaczyński
on Aug 04, 2025
AI, ML & Data Engineering

Anthropic Proposes Transparency Framework to Safeguard Frontier AI Development

Anthropic has proposed a new transparency framework designed to address the growing need for accountability in the development of frontier AI models. This proposal focuses on the largest AI companies that are developing powerful AI models, distinguished by factors such as computing power, cost, evaluation performance, and annual R&D expenditures.

Daniel Dominguez
on Jul 29, 2025
Mobile

Apple Shares Details on Upcoming AI Foundation Models for iOS 26

In a recent tech report, Apple has provided more details on the performance and characteristics of the new Apple Intelligence Foundation Models that will be part of iOS 26, as announced at the latest WWDC 2025.

Sergio De Simone
on Jul 28, 2025
AI, ML & Data Engineering

Databricks Agent Bricks Automates Enterprise AI Development with TAO and ALHF Methods

Databricks introduced Agent Bricks, a new product that changes how enterprises develop domain-specific agents. The automated workflow includes generating task-specific evaluations and LLM judges for quality assessment, creating synthetic data that resembles customer data to supplement agent learning, and searching across optimization techniques to refine agent performance.

Vinod Goje
on Jul 28, 2025
AI, ML & Data Engineering

Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling

Qwen Team has announced Qwen3-Coder, a new family of agentic code models designed for long-context, multi-step programming tasks. The most capable variant, Qwen3-Coder-480B-A35B-Instruct, is a Mixture-of-Experts model with a total of 480 billion parameters and 35 billion active parameters per forward pass.

Robert Krzaczyński
on Jul 26, 2025
Architecture & Design

Google Apigee Adds Built-in LLM Governance with Model Armor

Google Cloud has launched the public preview of Model Armor, a native LLM governance framework integrated into the Apigee API management platform. Detailed in a community post, Model Armor introduces out-of-the-box enforcement for LLM-specific policies such as prompt validation, output filtering, and token-level controls at the API layer.

Leela Kumili
on Jul 25, 2025
Architecture & Design

State Space Models Can Enable AI in Low-Power Edge Computing

At the the 2025 Embedded Vision Summit, Tony Lewis, chief technology officer at BrainChip, presented research done by his company into state space models (SSMs) and how they can provide LLM capabilities with very low power consumption in limited computing environments, such as those found on dashcams, medical devices, security cameras, and even toys.

Patrick Farry
on Jul 24, 2025
AI, ML & Data Engineering

Perplexity Launches Comet: a Browser Designed around AI-Assisted Interaction

Perplexity has introduced Comet, a new web browser designed to integrate natural language interaction directly into the browsing experience. Unlike conventional browsers built around navigation and search, Comet aims to support users in research, comparison, and task execution by combining browsing with persistent context and AI assistance.

Robert Krzaczyński
on Jul 23, 2025
AI, ML & Data Engineering

Amazon Launches Bedrock AgentCore for Enterprise AI Agent Infrastructure

Amazon announced the preview of Amazon Bedrock AgentCore, a collection of enterprise-grade services that help developers deploy and operate AI agents at scale across frameworks and foundation models. The platform addresses infrastructure challenges developers face when building production AI agents.

Vinod Goje
on Jul 22, 2025
DevOps

Wix Adds Chaos to CI/CD Pipelines with AI and Improves Reliability

Cloud-based web development service Wix has written about a new approach to integrating artificial intelligence into continuous integration and continuous deployment (CI/CD) systems. In a blog post, Wix demonstrates how probabilistic AI can coexist with deterministic development processes, adding chaos without compromising reliability.

Matt Saunders
on Jul 20, 2025
AI, ML & Data Engineering

Inaugural MCP Dev Summit Charts AI Integration's Future

Developers and contributors of the Model Context Protocol (MCP) converged in San Francisco in May 2025 for their first developer summit, charting the future of this rapidly adopted open standard to enable seamless integration between LLM applications and external data sources and tools. Discussions focused on a roadmap for MCP, including critical enterprise features.

Hien Luu
on Jul 17, 2025
DevOps

Docker Expands Compose for Agent Development and Ties in Cloud Offload Support

Docker launched a new feature to let developers define, build, and run agents using Docker Compose, with the aim to streamline agent development process and reduce repetitive tasks. Additionally, Docker Offload, now in beta, provides a way to seamlessly offload building and running models to remote GPU compute.

Sergio De Simone
on Jul 14, 2025
AI, ML & Data Engineering

Anthropic Introduces Economic Futures Program to Address the Economic Impact of AI

Anthropic has announced the launch of its Economic Futures Program, an initiative designed to address the economic impact of AI.

Daniel Dominguez
on Jul 11, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News