InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Releases MedGemma: Open AI Models for Medical Text and Image Analysis

Google has released MedGemma, a pair of open-source generative AI models designed to support medical text and image understanding in healthcare applications. Based on the Gemma 3 architecture, the models are available in two configurations: MedGemma 4B, a multimodal model capable of processing both images and text, and MedGemma 27B, a larger model focused solely on medical text.

Robert Krzaczyński
on May 30, 2025
AI, ML & Data Engineering

Mistral Releases Devstral, an Open-Source LLM for Software Engineering Agents

Mistral AI announced the release of Devstral, a new open-source large language model designed to improve the automation of software engineering workflows, particularly in complex coding environments that require reasoning across multiple files and components.

Daniel Dominguez
on May 27, 2025
DevOps

Cisco Reveals JARVIS: an AI Assistant for Platform-Engineering Teams

Introducing JARVIS by Cisco, an AI-powered assistant revolutionizing platform-engineering workflows. With seamless integration across 40+ tools, JARVIS automates complex tasks, reducing project timelines from weeks to hours. Powered by a hybrid AI architecture, it ensures accuracy and reliability while enhancing productivity.

Mark Silvester
on May 24, 2025
AI, ML & Data Engineering

HashiCorp Releases Terraform MCP Server for AI Integration

HashiCorp has released the Terraform MCP Server, an open-source implementation of the Model Context Protocol designed to improve how large language models interact with infrastructure as code.

Matt Foster
on May 22, 2025
AI, ML & Data Engineering

Prime Intellect Releases INTELLECT-2: a 32B Parameter Model Trained via Decentralized Reinforcement

Prime Intellect has released INTELLECT-2, a 32 billion parameter language model trained using fully asynchronous reinforcement learning across a decentralized network of compute contributors. Unlike traditional centralized model training, INTELLECT-2 is developed on a permissionless infrastructure where rollout generation, policy updates, and training are distributed and loosely coupled.

Robert Krzaczyński
on May 21, 2025
AI, ML & Data Engineering

Gemma 3 Supports Vision-Language Understanding, Long Context Handling, and Improved Multilinguality

Google’s generative artificial intelligence (AI) model Gemma 3 supports vision-language understanding, long context handling, and improved multi-linguality. In a recent blog post, Google DeepMind and AI Studio teams discussed the new features in Gemma 3. The model also highlights KV-cache memory reduction, a new tokenizer and offers better performance and higher resolution vision encoders.

Srini Penchikala
on May 20, 2025
AI, ML & Data Engineering

OpenAI Launches Codex Software Engineering Agent Preview

OpenAI has launched Codex, a research preview of a cloud-based software engineering agent designed to automate common development tasks such as writing code, debugging, testing, and generating pull requests. Integrated into ChatGPT, Codex runs each assignment in a secure sandbox environment preloaded with the user's codebase and configured to reflect their development setup.

Robert Krzaczyński
on May 19, 2025
AI, ML & Data Engineering

Llama 4 Scout and Maverick Now Available on Amazon Bedrock and SageMaker JumpStart

AWS recently announced the availability of Meta's latest foundation models, Llama 4 Scout and Llama 4 Maverick, in Amazon Bedrock and AWS SageMaker JumpStart. Both models provide multimodal capabilities and follow the mixture-of-experts architecture.

Sergio De Simone
on May 18, 2025
AI, ML & Data Engineering

Mistral Unveils Medium 3: Enterprise-Ready Language Model

Mistral AI has unveiled Mistral Medium 3, a mid-sized language model aimed at enterprises seeking a balance between cost-efficiency, strong performance, and flexible deployment options. The model is now available through Mistral’s platform and Amazon SageMaker, with further releases planned for IBM WatsonX, Azure AI Foundry, Google Cloud Vertex AI, and NVIDIA NIM.

Robert Krzaczyński
on May 16, 2025
AI, ML & Data Engineering

CMU Researchers Introduce LegoGPT: Building Stable LEGO Structures from Text Prompts

Researchers at Carnegie Mellon University have introduced LegoGPT, a system that generates physically stable and buildable LEGO® structures from natural language descriptions. The project combines large language models with engineering constraints to produce designs that can be assembled manually or by robotic systems.

Robert Krzaczyński
on May 14, 2025
AI, ML & Data Engineering

Anthropic Introduces Web Search Functionality for Claude Models

Anthropic has announced the addition of web search capabilities to its Claude models, available via the Anthropic API. This update enables Claude to access current information from the web, allowing developers to create applications and AI agents that provide up-to-date insights.

Daniel Dominguez
on May 14, 2025
AI, ML & Data Engineering

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

LlamaFirewall is a security framework aimed at safeguarding AI agents against prompt injection, goal misalignment, and insecure code generation. It achieved over 90% efficacy in reducing attack success rates when evaluated on the AgentDojo benchmark. Additionally, developers can update its behavior by adding new security guardrails.

Sergio De Simone
on May 13, 2025
AI, ML & Data Engineering

Meta Announces API and Protection Tools at First LlamaCon Event

At Meta's first-ever LlamaCon event, the company announced several new tools for building with their Llama AI models: a limited preview of the Llama API that allows developers to experiment with different models, and new Llama Protection Tools for securing AI applications.

Anthony Alford
on May 13, 2025
AI, ML & Data Engineering

Google Introduces DolphinGemma to Support Dolphin Communication Research

Google has released a new AI model called DolphinGemma, which has been developed to assist researchers in analyzing and interpreting dolphin vocalizations. The project is part of an ongoing collaboration with the Wild Dolphin Project (WDP) and researchers at Georgia Tech, and it focuses on identifying patterns in the natural communication of Atlantic spotted dolphins.

Robert Krzaczyński
on May 13, 2025
AI, ML & Data Engineering

Hugging Face to Democratize Robotics with Open-Source Reachy 2 Robot

Hugging Face has acquired Pollen Robotics, a French startup that developed the humanoid robot Reachy 2. The acquisition aims to make robotics more accessible by open-sourcing the robot’s design and allowing developers to modify and improve its code.

Daniel Dominguez
on May 10, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News