InfoQ Homepage Large language models Content on InfoQ
-
DeepMind Launches Genie 3, a Text-to-3D Interactive World Model
DeepMind has introduced Genie 3, the latest version of its “world model” framework for generating interactive 3D environments directly from text prompts.
-
Unsloth Tutorials Aim to Make it Easier to Compare and Fine-tune LLMs
In a recent Reddit post, Unsloth published comprehensive tutorials of all of the open models they support. The tutorials can be used to compare the models’ strengths and weaknesses, as well as their performance benchmarks.
-
Anthropic Investigates How Large Language Models Develop a Character
Recent research by Anthropic engineers explores identifiable patterns of activity that seems to give rise to an emerging personality. These traits, known as persona vectors, help explain how a model's personality shifts over its lifecycle and lay the groundwork for better controlling those changes.
-
Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text
Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models.
-
GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities
Zhipu AI has released GLM-4.5 and GLM-4.5-Air, two new AI models designed to handle reasoning, coding, and agent tasks within a single architecture. They use a dual-mode system to switch between complex problem-solving and faster responses, aiming to improve both accuracy and speed.
-
OpenAI Launches Study Mode in ChatGPT to Support Step-by-Step Learning
OpenAI has introduced Study Mode in ChatGPT, a feature intended to guide users through problems in a step-by-step manner rather than supplying immediate answers. It uses interactive prompts, structured responses, and follow-up questions to encourage active engagement and support comprehension.
-
Anthropic Proposes Transparency Framework to Safeguard Frontier AI Development
Anthropic has proposed a new transparency framework designed to address the growing need for accountability in the development of frontier AI models. This proposal focuses on the largest AI companies that are developing powerful AI models, distinguished by factors such as computing power, cost, evaluation performance, and annual R&D expenditures.
-
Apple Shares Details on Upcoming AI Foundation Models for iOS 26
In a recent tech report, Apple has provided more details on the performance and characteristics of the new Apple Intelligence Foundation Models that will be part of iOS 26, as announced at the latest WWDC 2025.
-
Databricks Agent Bricks Automates Enterprise AI Development with TAO and ALHF Methods
Databricks introduced Agent Bricks, a new product that changes how enterprises develop domain-specific agents. The automated workflow includes generating task-specific evaluations and LLM judges for quality assessment, creating synthetic data that resembles customer data to supplement agent learning, and searching across optimization techniques to refine agent performance.
-
Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling
Qwen Team has announced Qwen3-Coder, a new family of agentic code models designed for long-context, multi-step programming tasks. The most capable variant, Qwen3-Coder-480B-A35B-Instruct, is a Mixture-of-Experts model with a total of 480 billion parameters and 35 billion active parameters per forward pass.
-
Google Apigee Adds Built-in LLM Governance with Model Armor
Google Cloud has launched the public preview of Model Armor, a native LLM governance framework integrated into the Apigee API management platform. Detailed in a community post, Model Armor introduces out-of-the-box enforcement for LLM-specific policies such as prompt validation, output filtering, and token-level controls at the API layer.
-
State Space Models Can Enable AI in Low-Power Edge Computing
At the the 2025 Embedded Vision Summit, Tony Lewis, chief technology officer at BrainChip, presented research done by his company into state space models (SSMs) and how they can provide LLM capabilities with very low power consumption in limited computing environments, such as those found on dashcams, medical devices, security cameras, and even toys.
-
Perplexity Launches Comet: a Browser Designed around AI-Assisted Interaction
Perplexity has introduced Comet, a new web browser designed to integrate natural language interaction directly into the browsing experience. Unlike conventional browsers built around navigation and search, Comet aims to support users in research, comparison, and task execution by combining browsing with persistent context and AI assistance.
-
Amazon Launches Bedrock AgentCore for Enterprise AI Agent Infrastructure
Amazon announced the preview of Amazon Bedrock AgentCore, a collection of enterprise-grade services that help developers deploy and operate AI agents at scale across frameworks and foundation models. The platform addresses infrastructure challenges developers face when building production AI agents.
-
Wix Adds Chaos to CI/CD Pipelines with AI and Improves Reliability
Cloud-based web development service Wix has written about a new approach to integrating artificial intelligence into continuous integration and continuous deployment (CI/CD) systems. In a blog post, Wix demonstrates how probabilistic AI can coexist with deterministic development processes, adding chaos without compromising reliability.