InfoQ Homepage Gemma Content on InfoQ

News

RSS Feed

Cloud

Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models

Google has launched SQL-native managed inference for 180,000+ Hugging Face models in BigQuery. The preview release collapses the ML lifecycle into a unified SQL interface, eliminating the need for separate Kubernetes or Vertex AI management. Key features include automated resource governance via endpoint_idle_ttl and secure identity-based execution using existing data warehouse permissions.

Steef-Jan Wiggers
on Jan 28, 2026
AI, ML & Data Engineering

Google Introduces TranslateGemma Open Models for Multilingual Translation

Google has released TranslateGemma, a set of open translation models based on the Gemma 3 architecture, offering 4B, 12B, and 27B parameter variants designed to support machine translation across 55 languages and to run on platforms ranging from mobile and edge devices to consumer hardware and cloud accelerators.

Daniel Dominguez
on Jan 28, 2026
Mobile

Gemma 3n Available for On-Device Inference Alongside RAG and Function Calling Libraries

Google has announced that Gemma 3n is now available in preview on the new LiteRT Hugging Face community, alongside many previously released models. Gemma 3n is a multimodal small language model that supports text, image, video, and audio inputs. It also supports finetuning, customization through retrieval-augmented generation (RAG), and function calling using new AI Edge SDKs.

Sergio De Simone
on May 29, 2025
AI, ML & Data Engineering

Gemma 3 Supports Vision-Language Understanding, Long Context Handling, and Improved Multilinguality

Google’s generative artificial intelligence (AI) model Gemma 3 supports vision-language understanding, long context handling, and improved multi-linguality. In a recent blog post, Google DeepMind and AI Studio teams discussed the new features in Gemma 3. The model also highlights KV-cache memory reduction, a new tokenizer and offers better performance and higher resolution vision encoders.

Srini Penchikala
on May 20, 2025
AI, ML & Data Engineering

Docker Model Runner Aims to Make it Easier to Run LLM Models Locally

Currently in preview with Docker Desktop 4.40 for macOS on Apple Silicon, Docker Model Runner allows developers to run models locally and iterate on application code using the local models- without disrupting their container-based workflows.

Sergio De Simone
on Apr 22, 2025
Mobile

Google Launches Gemma 3 1B for Mobile and Web Apps

Requiring a "mere" 529MB, Gemma 3 1B is a small language model (SLM) specifically meant for distribution across mobile and Web apps, where models must download quickly and be responsive to keep user engagement high.

Sergio De Simone
on Mar 17, 2025

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

Expanding Swift from Apps to Services

[Video Podcast] Improving Valkey with Madelyn Olson

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Scaling to 100+ as a Director: Lessons from Growing Engineering Organizations

From Alert Fatigue to Agent-Assisted Intelligent Observability

Helpful links

Choose your language

News

Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models

Google Introduces TranslateGemma Open Models for Multilingual Translation

Gemma 3n Available for On-Device Inference Alongside RAG and Function Calling Libraries

Gemma 3 Supports Vision-Language Understanding, Long Context Handling, and Improved Multilinguality

Docker Model Runner Aims to Make it Easier to Run LLM Models Locally

Google Launches Gemma 3 1B for Mobile and Web Apps

How CNAME Ordering in RFC Specs Caused Cloudflare 1.1.1.1 Outage

Expanding Swift from Apps to Services

Google Pushes for gRPC Support in Model Context Protocol

Uber Moves In-House Search Indexing to Pull-Based Ingestion in OpenSearch

[Video Podcast] Improving Valkey with Madelyn Olson

LinkedIn Leverages GitHub Actions, CodeQL, and Semgrep for Code Scanning

Getting Feedback from Test-Driven Development and Testing in Production

Scaling to 100+ as a Director: Lessons from Growing Engineering Organizations

The Technical Founder's Path: Code, Leadership, and Balance

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Next Moca Releases Agent Definition Language as an Open Source Specification

Cloudflare Demonstrates Moltworker, Bringing Self-Hosted AI Agents to the Edge

Datadog Integrates Google Agent Development Kit into LLM Observability Tools

From Alert Fatigue to Agent-Assisted Intelligent Observability

Etleap Launches Iceberg Pipeline Platform to Simplify Enterprise Adoption of Apache Iceberg

QCon London

QCon AI Boston

QCon San Francisco

InfoQ Software Architects' Newsletter

News