InfoQ Homepage Model Fine Tuning Content on InfoQ

News

RSS Feed

AI, ML & Data Engineering

Google Introduces TranslateGemma Open Models for Multilingual Translation

Google has released TranslateGemma, a set of open translation models based on the Gemma 3 architecture, offering 4B, 12B, and 27B parameter variants designed to support machine translation across 55 languages and to run on platforms ranging from mobile and edge devices to consumer hardware and cloud accelerators.

Daniel Dominguez
on Jan 28, 2026
AI, ML & Data Engineering

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

At QCon AI NYC 2025, Will Hang from OpenAI unveiled Agent RFT—a cutting-edge reinforcement fine-tuning approach for tool-using agents. By optimizing prompts and tasks before model adjustments, Hang showcased effective strategies to enhance decision-making and efficiency, emphasizing a balanced grading system. The session revealed a future where smarter agents reduce latency and improve outcomes.

Andrew Hoblitzell
on Dec 17, 2025
AI, ML & Data Engineering

Thinking Machines Releases Tinker API for Flexible Model Fine-Tuning

Thinking Machines has released Tinker, an API for fine-tuning open-weight language models. The service is designed to reduce infrastructure overhead for developers, providing managed scheduling, GPU allocation, and checkpoint handling. By abstracting away cluster management, Tinker allows fine-tuning through simple Python calls.

Daniel Dominguez
on Oct 07, 2025
Architecture & Design

Unsloth Tutorials Aim to Make it Easier to Compare and Fine-tune LLMs

In a recent Reddit post, Unsloth published comprehensive tutorials of all of the open models they support. The tutorials can be used to compare the models’ strengths and weaknesses, as well as their performance benchmarks.

Patrick Farry
on Aug 16, 2025
AI, ML & Data Engineering

Nvidia's GB200 NVL72 Supercomputer Achieves 2.7× Faster Inference on DeepSeek V3

In collaboration with NVIDIA, researchers from SGLang have published early benchmarks of the GB200 (Grace Blackwell) NVL72 system, showing up to a 2.7× increase in LLM inference throughput compared to the H100 on the DeepSeek-V3 671B model.

Matt Foster
on Jun 29, 2025
AI, ML & Data Engineering

instructlab.ai Uses Synthetic Data to Reduce Complexity of Fine-Tuning LLMs

InstructLab.ai implements the large-scale alignment for the chatbots concept(LAB), which intends to overcome the scalability challenges in the instruction-tuning phase of a large language model (LLM). Its approach leverages a synthetic data-based alignment tuning method for LLMs. Crafted taxonomies deliver the synthesization seeds for training data, reducing the need for human-annotated data.

Olimpiu Pop
on Mar 07, 2025

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

Expanding Swift from Apps to Services

[Video Podcast] Improving Valkey with Madelyn Olson

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Scaling to 100+ as a Director: Lessons from Growing Engineering Organizations

From Alert Fatigue to Agent-Assisted Intelligent Observability

Helpful links

Choose your language

News

Google Introduces TranslateGemma Open Models for Multilingual Translation

OpenAI at QCon AI NYC: Fine Tuning the Enterprise

Thinking Machines Releases Tinker API for Flexible Model Fine-Tuning

Unsloth Tutorials Aim to Make it Easier to Compare and Fine-tune LLMs

Nvidia's GB200 NVL72 Supercomputer Achieves 2.7× Faster Inference on DeepSeek V3

instructlab.ai Uses Synthetic Data to Reduce Complexity of Fine-Tuning LLMs

How CNAME Ordering in RFC Specs Caused Cloudflare 1.1.1.1 Outage

Expanding Swift from Apps to Services

Google Pushes for gRPC Support in Model Context Protocol

Uber Moves In-House Search Indexing to Pull-Based Ingestion in OpenSearch

[Video Podcast] Improving Valkey with Madelyn Olson

LinkedIn Leverages GitHub Actions, CodeQL, and Semgrep for Code Scanning

Getting Feedback from Test-Driven Development and Testing in Production

Scaling to 100+ as a Director: Lessons from Growing Engineering Organizations

The Technical Founder's Path: Code, Leadership, and Balance

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Next Moca Releases Agent Definition Language as an Open Source Specification

Cloudflare Demonstrates Moltworker, Bringing Self-Hosted AI Agents to the Edge

Datadog Integrates Google Agent Development Kit into LLM Observability Tools

From Alert Fatigue to Agent-Assisted Intelligent Observability

Etleap Launches Iceberg Pipeline Platform to Simplify Enterprise Adoption of Apache Iceberg

QCon London

QCon AI Boston

QCon San Francisco

InfoQ Software Architects' Newsletter

News