InfoQ Homepage Hugging Face Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Mistral AI Releases Magistral, Its First Reasoning-Focused Language Model

Mistral AI has released Magistral, a new model family built for transparent, multi-step reasoning. Available in open and enterprise versions, it supports structured logic, multilingual output, and traceable decision-making.

Robert Krzaczyński
on Jun 16, 2025
AI, ML & Data Engineering

Hugging Face to Democratize Robotics with Open-Source Reachy 2 Robot

Hugging Face has acquired Pollen Robotics, a French startup that developed the humanoid robot Reachy 2. The acquisition aims to make robotics more accessible by open-sourcing the robot’s design and allowing developers to modify and improve its code.

Daniel Dominguez
on May 10, 2025
AI, ML & Data Engineering

Meta AI Releases Llama 4: Early Impressions and Community Feedback

Meta has officially released the first models in its new Llama 4 family—Scout and Maverick—marking a step forward in its open-weight large language model ecosystem. Designed with a native multimodal architecture and a mixture-of-experts (MoE) framework, these models aim to support a broader range of applications, from image understanding to long-context reasoning.

Robert Krzaczyński
on Apr 07, 2025
AI, ML & Data Engineering

Roblox Releases Cube 3D, an AI Open-Source Model for 3D Model Generation

Roblox has introduced Cube 3D, a generative AI system designed for creating 3D and 4D objects and environments.

Daniel Dominguez
on Mar 22, 2025
AI, ML & Data Engineering

Hugging Face Publishes Guide on Efficient LLM Training across GPUs

Hugging Face has published the Ultra-Scale Playbook: Training LLMs on GPU Clusters, an open-source guide that provides a detailed exploration of the methodologies and technologies involved in training LLMs across GPU clusters.

Daniel Dominguez
on Mar 04, 2025
AI, ML & Data Engineering

Hugging Face Expands Serverless Inference Options with New Provider Integrations

Hugging Face has launched the integration of four serverless inference providers Fal, Replicate, SambaNova, and Together AI, directly into its model pages. These providers are also integrated into Hugging Face's client SDKs for JavaScript and Python, allowing users to run inference on various models with minimal setup.

Daniel Dominguez
on Feb 04, 2025
AI, ML & Data Engineering

DeepSeek Release Another Open-Source AI Model, Janus Pro

DeepSeek has released Janus-Pro, an updated version of its multimodal model, Janus. The new model improves training strategies, data scaling, and model size, enhancing multimodal understanding and text-to-image generation.

Daniel Dominguez
on Jan 31, 2025
AI, ML & Data Engineering

Synthetic Data Generator Simplifies Dataset Creation with Large Language Models

Hugging Face has introduced the Synthetic Data Generator, a new tool leveraging Large Language Models (LLMs), that offers a streamlined, no-code approach to creating custom datasets. The tool facilitates the creation of text classification and chat datasets through a clear and accessible process, making it usable for both non-technical users and experienced AI practitioners.

Robert Krzaczyński
on Jan 27, 2025
AI, ML & Data Engineering

Microsoft Phi-4 is a Small Language Model Specialized for Complex Math Reasoning

Phi-4 is 14B parameter model from Microsoft Research that aims to improve the state of the art for math reasoning. Previously available on Azure AI Foundry, Phi-4 has recently become available on Hugging Face under the MIT license.

Sergio De Simone
on Jan 24, 2025
AI, ML & Data Engineering

Hugging Face Smolagents is a Simple Library to Build LLM-Powered Agents

Smolagents is a library created at Hugging Face to build agents based on large language models (LLMs). Hugging Faces says its new library aims to be simple and LLM-agnostic. It supports secure "agents that write their actions in code" and is integrated with Hugging Face Hub.

Sergio De Simone
on Jan 04, 2025
AI, ML & Data Engineering

NVIDIA Unveils Hymba 1.5B: a Hybrid Approach to Efficient NLP Models

NVIDIA researchers have unveiled Hymba 1.5B, an open-source language model that combines transformer and state-space model (SSM) architectures to achieve unprecedented efficiency and performance. Designed with NVIDIA’s optimized training pipeline, Hymba addresses the computational and memory limitations of traditional transformers while enhancing the recall capabilities of SSMs.

Robert Krzaczyński
on Jan 03, 2025
AI, ML & Data Engineering

LLaMA-Mesh: NVIDIA’s Breakthrough in Unifying 3D Mesh Generation and Language Models

NVIDIA researchers have introduced LLaMA-Mesh, a groundbreaking approach that extends large language models (LLMs) to generate and interpret 3D mesh data in a unified, text-based framework. LLaMA-Mesh tokenizes 3D meshes as plain text, enabling the seamless integration of spatial and textual information.

Robert Krzaczyński
on Jan 02, 2025
AI, ML & Data Engineering

Hugging Face and Entalpic Unveil LeMaterial: Transforming Materials Science through AI

Entalpic, in collaboration with Hugging Face, has launched LeMaterial, an open-source initiative to tackle key challenges in materials science. By unifying data from major resources into LeMat-Bulk, a harmonized dataset with 6.7 million entries, LeMaterial aims to streamline materials discovery and accelerate innovation in areas such as LEDs, batteries, and photovoltaic cells.

Robert Krzaczyński
on Dec 19, 2024
AI, ML & Data Engineering

Meta Releases Llama 3.3: a Multilingual Model with Enhanced Performance and Efficiency

Meta has released Llama 3.3, a multilingual large language model aimed at supporting a range of AI applications in research and industry. Featuring a 128k-token context window and architectural improvements for efficiency, the model demonstrates strong performance in benchmarks for reasoning, coding, and multilingual tasks. It is available under a community license on Hugging Face.

Robert Krzaczyński
on Dec 14, 2024
AI, ML & Data Engineering

Mistral AI Releases Pixtral Large: a Multimodal Model for Advanced Image and Text Analysis

Mistral AI released Pixtral Large, a 124-billion-parameter multimodal model designed for advanced image and text processing with a 1-billion-parameter vision encoder. Built on Mistral Large 2, it achieves leading performance on benchmarks like MathVista and DocVQA, excelling in tasks that require reasoning across text and visual data.

Robert Krzaczyński
on Dec 04, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

News