InfoQ Homepage Large language models Content on InfoQ
-
DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture
DeepSeek has released version V3.1 of its large language model, introducing a hybrid architecture that combines thinking and non-thinking modes in a single system.
-
Anthropic’s Claude Opus 4.1 Improves Refactoring and Safety, Scores 74.5% SWE-bench Verified
Anthropic has launched Claude Opus 4.1, an update that strengthens coding reliability in multi-file projects and improves reasoning across long interactions. The model also raised its SWE-bench Verified score to 74.5%, up from 72.5%. Building on Opus 4, the new version strengthens Claude’s ability to act as a coding assistant, particularly in multi-file contexts.
-
Qwen Team Open Sources State-of-the-Art Image Model Qwen-Image
Qwen Team recently open sourced Qwen-Image, an image foundation model. Qwen-Image supports text-to-image (T2I) generation and text-image-to-image (TI2I) editing tasks, and outperforms other models on a variety of benchmarks.
-
Claude Sonnet 4 Expands to 1 Million Token Context Window
Anthropic has upgraded Claude Sonnet 4 to support a context length of up to 1 million tokens, a fivefold increase over its previous limit. The feature, now in public beta, is accessible through the Anthropic API and Amazon Bedrock, with Google Cloud’s Vertex AI support expected soon.
-
DeepMind Launches Genie 3, a Text-to-3D Interactive World Model
DeepMind has introduced Genie 3, the latest version of its “world model” framework for generating interactive 3D environments directly from text prompts.
-
Unsloth Tutorials Aim to Make it Easier to Compare and Fine-tune LLMs
In a recent Reddit post, Unsloth published comprehensive tutorials of all of the open models they support. The tutorials can be used to compare the models’ strengths and weaknesses, as well as their performance benchmarks.
-
Anthropic Investigates How Large Language Models Develop a Character
Recent research by Anthropic engineers explores identifiable patterns of activity that seems to give rise to an emerging personality. These traits, known as persona vectors, help explain how a model's personality shifts over its lifecycle and lay the groundwork for better controlling those changes.
-
Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text
Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models.
-
GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities
Zhipu AI has released GLM-4.5 and GLM-4.5-Air, two new AI models designed to handle reasoning, coding, and agent tasks within a single architecture. They use a dual-mode system to switch between complex problem-solving and faster responses, aiming to improve both accuracy and speed.
-
OpenAI Launches Study Mode in ChatGPT to Support Step-by-Step Learning
OpenAI has introduced Study Mode in ChatGPT, a feature intended to guide users through problems in a step-by-step manner rather than supplying immediate answers. It uses interactive prompts, structured responses, and follow-up questions to encourage active engagement and support comprehension.
-
Anthropic Proposes Transparency Framework to Safeguard Frontier AI Development
Anthropic has proposed a new transparency framework designed to address the growing need for accountability in the development of frontier AI models. This proposal focuses on the largest AI companies that are developing powerful AI models, distinguished by factors such as computing power, cost, evaluation performance, and annual R&D expenditures.
-
Apple Shares Details on Upcoming AI Foundation Models for iOS 26
In a recent tech report, Apple has provided more details on the performance and characteristics of the new Apple Intelligence Foundation Models that will be part of iOS 26, as announced at the latest WWDC 2025.
-
Databricks Agent Bricks Automates Enterprise AI Development with TAO and ALHF Methods
Databricks introduced Agent Bricks, a new product that changes how enterprises develop domain-specific agents. The automated workflow includes generating task-specific evaluations and LLM judges for quality assessment, creating synthetic data that resembles customer data to supplement agent learning, and searching across optimization techniques to refine agent performance.
-
Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling
Qwen Team has announced Qwen3-Coder, a new family of agentic code models designed for long-context, multi-step programming tasks. The most capable variant, Qwen3-Coder-480B-A35B-Instruct, is a Mixture-of-Experts model with a total of 480 billion parameters and 35 billion active parameters per forward pass.
-
Google Apigee Adds Built-in LLM Governance with Model Armor
Google Cloud has launched the public preview of Model Armor, a native LLM governance framework integrated into the Apigee API management platform. Detailed in a community post, Model Armor introduces out-of-the-box enforcement for LLM-specific policies such as prompt validation, output filtering, and token-level controls at the API layer.