InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Anthropic’s Claude Opus 4.1 Improves Refactoring and Safety, Scores 74.5% SWE-bench Verified

Anthropic has launched Claude Opus 4.1, an update that strengthens coding reliability in multi-file projects and improves reasoning across long interactions. The model also raised its SWE-bench Verified score to 74.5%, up from 72.5%. Building on Opus 4, the new version strengthens Claude’s ability to act as a coding assistant, particularly in multi-file contexts.

Hien Luu
on Aug 28, 2025
AI, ML & Data Engineering

Qwen Team Open Sources State-of-the-Art Image Model Qwen-Image

Qwen Team recently open sourced Qwen-Image, an image foundation model. Qwen-Image supports text-to-image (T2I) generation and text-image-to-image (TI2I) editing tasks, and outperforms other models on a variety of benchmarks.

Anthony Alford
on Aug 26, 2025
AI, ML & Data Engineering

Claude Sonnet 4 Expands to 1 Million Token Context Window

Anthropic has upgraded Claude Sonnet 4 to support a context length of up to 1 million tokens, a fivefold increase over its previous limit. The feature, now in public beta, is accessible through the Anthropic API and Amazon Bedrock, with Google Cloud’s Vertex AI support expected soon.

Robert Krzaczyński
on Aug 22, 2025
AI, ML & Data Engineering

DeepMind Launches Genie 3, a Text-to-3D Interactive World Model

DeepMind has introduced Genie 3, the latest version of its “world model” framework for generating interactive 3D environments directly from text prompts.

Daniel Dominguez
on Aug 18, 2025
Architecture & Design

Unsloth Tutorials Aim to Make it Easier to Compare and Fine-tune LLMs

In a recent Reddit post, Unsloth published comprehensive tutorials of all of the open models they support. The tutorials can be used to compare the models’ strengths and weaknesses, as well as their performance benchmarks.

Patrick Farry
on Aug 16, 2025
AI, ML & Data Engineering

Anthropic Investigates How Large Language Models Develop a Character

Recent research by Anthropic engineers explores identifiable patterns of activity that seems to give rise to an emerging personality. These traits, known as persona vectors, help explain how a model's personality shifts over its lifecycle and lay the groundwork for better controlling those changes.

Sergio De Simone
on Aug 12, 2025
AI, ML & Data Engineering

Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text

Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models.

Daniel Dominguez
on Aug 08, 2025
AI, ML & Data Engineering

GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities

Zhipu AI has released GLM-4.5 and GLM-4.5-Air, two new AI models designed to handle reasoning, coding, and agent tasks within a single architecture. They use a dual-mode system to switch between complex problem-solving and faster responses, aiming to improve both accuracy and speed.

Robert Krzaczyński
on Aug 07, 2025
AI, ML & Data Engineering

OpenAI Launches Study Mode in ChatGPT to Support Step-by-Step Learning

OpenAI has introduced Study Mode in ChatGPT, a feature intended to guide users through problems in a step-by-step manner rather than supplying immediate answers. It uses interactive prompts, structured responses, and follow-up questions to encourage active engagement and support comprehension.

Robert Krzaczyński
on Aug 04, 2025
AI, ML & Data Engineering

Anthropic Proposes Transparency Framework to Safeguard Frontier AI Development

Anthropic has proposed a new transparency framework designed to address the growing need for accountability in the development of frontier AI models. This proposal focuses on the largest AI companies that are developing powerful AI models, distinguished by factors such as computing power, cost, evaluation performance, and annual R&D expenditures.

Daniel Dominguez
on Jul 29, 2025
Mobile

Apple Shares Details on Upcoming AI Foundation Models for iOS 26

In a recent tech report, Apple has provided more details on the performance and characteristics of the new Apple Intelligence Foundation Models that will be part of iOS 26, as announced at the latest WWDC 2025.

Sergio De Simone
on Jul 28, 2025
AI, ML & Data Engineering

Databricks Agent Bricks Automates Enterprise AI Development with TAO and ALHF Methods

Databricks introduced Agent Bricks, a new product that changes how enterprises develop domain-specific agents. The automated workflow includes generating task-specific evaluations and LLM judges for quality assessment, creating synthetic data that resembles customer data to supplement agent learning, and searching across optimization techniques to refine agent performance.

Vinod Goje
on Jul 28, 2025
AI, ML & Data Engineering

Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling

Qwen Team has announced Qwen3-Coder, a new family of agentic code models designed for long-context, multi-step programming tasks. The most capable variant, Qwen3-Coder-480B-A35B-Instruct, is a Mixture-of-Experts model with a total of 480 billion parameters and 35 billion active parameters per forward pass.

Robert Krzaczyński
on Jul 26, 2025
Architecture & Design

Google Apigee Adds Built-in LLM Governance with Model Armor

Google Cloud has launched the public preview of Model Armor, a native LLM governance framework integrated into the Apigee API management platform. Detailed in a community post, Model Armor introduces out-of-the-box enforcement for LLM-specific policies such as prompt validation, output filtering, and token-level controls at the API layer.

Leela Kumili
on Jul 25, 2025
Architecture & Design

State Space Models Can Enable AI in Low-Power Edge Computing

At the the 2025 Embedded Vision Summit, Tony Lewis, chief technology officer at BrainChip, presented research done by his company into state space models (SSMs) and how they can provide LLM capabilities with very low power consumption in limited computing environments, such as those found on dashcams, medical devices, security cameras, and even toys.

Patrick Farry
on Jul 24, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News