InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Gemini 2.5 Deep Think Parallelizes Creative Problem-Solving

As part of Google AI Ultra subscription, Gemini 2.5 Deep Think is a model designed for creative problem-solving through the use of parallel thinking techniques and extended inference time.

Sergio De Simone
on Aug 18, 2025
Cloud

Oracle Brings Database Services Directly to AWS Cloud

AWS recently announced the general availability of Oracle Database@AWS, a new option for Oracle Exadata workloads, including Oracle Real Application Clusters (RAC), within AWS. This new option allows enterprises to migrate Oracle Exadata workloads to dedicated infrastructure on AWS, simplifying cloud migration and unifying data across platforms.

Renato Losio
on Aug 16, 2025
AI, ML & Data Engineering

Roblox Open-Sources AI System to Detect Conversations Potentially Harmful to Kids

Roblox Sentinel is an AI system designed to detect early signs of potential child endangerment for further analysis and investigation. Implemented as a Python library, Sentinel uses contrastive learning to handle highly imbalanced datasets that often challenge traditional classifiers and can be applied to a wide range of use cases.

Sergio De Simone
on Aug 16, 2025
Architecture & Design

Unsloth Tutorials Aim to Make it Easier to Compare and Fine-tune LLMs

In a recent Reddit post, Unsloth published comprehensive tutorials of all of the open models they support. The tutorials can be used to compare the models’ strengths and weaknesses, as well as their performance benchmarks.

Patrick Farry
on Aug 16, 2025
AI, ML & Data Engineering

LangChain Launches Open SWE, an Open-Source Asynchronous Coding Agent

LangChain has released Open SWE, a fully open-source, asynchronous coding agent designed to operate in the cloud and handle complex software development tasks. The company says Open SWE represents a shift away from real-time “copilot” assistants toward more autonomous, long-running agents that integrate directly with a developer’s existing workflows.

Robert Krzaczyński
on Aug 13, 2025
AI, ML & Data Engineering

Google Launches Jules, an Asynchronous Coding Agent Powered by Gemini 2.5

Google has moved Jules, its asynchronous, agent-based coding assistant, out of beta and into general availability, positioning it as a tool for developers who want to offload routine programming tasks. Powered by the Gemini 2.5 Pro model, Jules is designed to handle a wide range of coding activities, from writing tests and building new features to fixing bugs or generating audio changelogs.

Robert Krzaczyński
on Aug 12, 2025
AI, ML & Data Engineering

Google DeepMind Open Sources Aeneas, an AI Model for Analyzing Ancient Texts

Google DeepMind open sourced Aeneas, a generative AI model for understanding ancient inscriptions. Aeneas can process both text and image input and outperforms other state-of-the-art models at restoring missing characters in damaged inscriptions.

Anthony Alford
on Aug 12, 2025
AI, ML & Data Engineering

Anthropic Investigates How Large Language Models Develop a Character

Recent research by Anthropic engineers explores identifiable patterns of activity that seems to give rise to an emerging personality. These traits, known as persona vectors, help explain how a model's personality shifts over its lifecycle and lay the groundwork for better controlling those changes.

Sergio De Simone
on Aug 12, 2025
AI, ML & Data Engineering

Vercel Releases AI Elements Library for React UI Integration

Vercel has released AI Elements, an open-source library of React UI primitives built atop shadcn/ui and designed to integrate with the Vercel AI SDK.

Daniel Dominguez
on Aug 12, 2025
AI, ML & Data Engineering

OpenAI’s GPT-5 Debuts with Commoditizing Costs and Higher Scrutiny

On August 7, 2025, OpenAI launched GPT-5, introducing advanced capabilities like a 400K token context and improved pricing for production use. Despite initial backlash over perceived performance issues, GPT-5 shows significant reductions in factual errors and enhances model integration through its new Agents SDK. As a result, user engagement has surged, solidifying its market influence.

Andrew Hoblitzell
on Aug 11, 2025
AI, ML & Data Engineering

New Rust Client Enables Building Safe, High-Performance Apps with Aerospike

Aerospike has officially released its Rust client to support high-throughput, low-latency applications interacting with its real-time NoSQL database.

Sergio De Simone
on Aug 09, 2025
Cloud

How Cloudflare Migrated Quicksilver to Multi-Level Caching While Serving Billions of Requests

The engineering team at Cloudflare recently shared how they transitioned Quicksilver, their internal global key-value store, to a tiered caching architecture. They described their incremental journey from storing everything everywhere to adopting a distributed caching system, improving storage efficiency while preserving consistency guarantees and low-latency reads at the edge.

Renato Losio
on Aug 09, 2025
AI, ML & Data Engineering

Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text

Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models.

Daniel Dominguez
on Aug 08, 2025
AI, ML & Data Engineering

OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment

OpenAI has released gpt-oss-120b and gpt-oss-20b, two open-weight language models designed for high-performance reasoning, tool use, and efficient deployment. These are the company’s first fully open-weight language models since GPT-2, and are available under the permissive Apache 2.0 license.

Robert Krzaczyński
on Aug 08, 2025
AI, ML & Data Engineering

GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities

Zhipu AI has released GLM-4.5 and GLM-4.5-Air, two new AI models designed to handle reasoning, coding, and agent tasks within a single architecture. They use a dual-mode system to switch between complex problem-solving and faster responses, aiming to improve both accuracy and speed.

Robert Krzaczyński
on Aug 07, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News