InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google DeepMind’s AlphaGeometry2 AI Achieves Gold-Medal Math Olympiad Performance

Google DeepMind's AlphaGeometry2 (AG2) AI model solved 84% of the geometry problems from the last 25 years of International Math Olympiads (IMO), outperforming the average human gold-medalist performance.

Anthony Alford
on Feb 25, 2025
AI, ML & Data Engineering

Perplexity Unveils Deep Research: AI-Powered Tool for Advanced Analysis

Perplexity has introduced Deep Research, an AI-powered tool designed for conducting in-depth analysis across various fields, including finance, marketing, and technology. The system automates the research process by performing multiple searches, analyzing extensive sources, and synthesizing findings into structured reports within minutes.

Robert Krzaczyński
on Feb 24, 2025
Cloud

AWS Reduces Latency and Costs for Key/Value Datastores with AZ Affinity Routing and GLIDE Valkey

AWS recently introduced Availability Zone (AZ) awareness in version 1.2 of the open source Valkey General Language Independent Driver for Enterprise (GLIDE) client library. By implementing AZ affinity routing in the open source key/value datastore, developers can reduce latency and costs by directing requests to replicas within the same AZ as the client.

Renato Losio
on Feb 22, 2025
AI, ML & Data Engineering

Google Gemini's Long-term Memory Vulnerable to a Kind of Phishing Attack

AI security hacker Johann Rehberger described a prompt injection attack against Google Gemini able to modify its long-term memories using a technique he calls delayed tool invocation. The researcher described the attack as a sort of social engineering/phishing attack triggered by the user interacting with a malicious document.

Sergio De Simone
on Feb 21, 2025
AI, ML & Data Engineering

OmniHuman-1: Advancing AI-Generated Human Animation

OmniHuman-1, an advanced AI-driven human video generation model, has been introduced, marking a significant leap in multimodal animation technology. OmniHuman-1 enables the creation of highly lifelike human videos using minimal input, such as a single image and motion cues like audio or video.

Robert Krzaczyński
on Feb 20, 2025
AI, ML & Data Engineering

Latin America Launches Latam-GPT to Improve AI Cultural Relevance

Latin America is advancing in the development of artificial intelligence with the creation of Latam-GPT, a language model designed to better represent the history, culture, and linguistic diversity of the region.

Daniel Dominguez
on Feb 20, 2025
AI, ML & Data Engineering

Meta Introduces LLM-Powered Tool for Software Testing

Meta has unveiled the Automated Compliance Hardening (ACH) tool, a mutation-guided, LLM-based test generation system. Designed to enhance software reliability and security, ACH generates faults in source code and subsequently creates tests to detect and address these issues.

Robert Krzaczyński
on Feb 19, 2025
AI, ML & Data Engineering

UC Berkeley's Sky Computing Lab Introduces Model to Reduce AI Language Model Inference Costs

UC Berkeley's Sky Computing Lab has released Sky-T1-32B-Flash, an updated reasoning language model that addresses the common issue of AI overthinking. The model, developed through the NovaSky (Next-generation Open Vision and AI) initiative, "slashes inference costs on challenging questions by up to 57%" while maintaining accuracy across mathematics, coding, science, and general knowledge domains.

Vinod Goje
on Feb 19, 2025
AI, ML & Data Engineering

OpenAI Cancels o3 Release and Announces Roadmap for GPT 4.5, 5

OpenAI is restructuring its AI strategy to focus solely on GPT-5, consolidating capabilities like reasoning, voice synthesis, and deep research into one unified model. This shift aims to simplify product offerings and enhance user experience, with tiered subscription levels for varying intelligence. As competition heats up, the success of GPT-5 will be pivotal for OpenAI’s future.

Andrew Hoblitzell
on Feb 18, 2025
AI, ML & Data Engineering

OpenAI Releases Operator, an AI Agent for Web-Based Tasks

OpenAI released a research preview of Operator, an AI agent that can use a web browser to perform tasks on a user's behalf. Operator achieves new state-of-the-art performance on the WebArena and WebVoyager benchmarks.

Anthony Alford
on Feb 18, 2025
AI, ML & Data Engineering

Distributed Multi-Modal Database Aerospike 8 Brings Support for Real-Time ACID Transactions

Aerospike has announced version 8.0 of its distributed multi-modal database, bringing support for distributed ACID transactions. This enables large-scale online transaction processing (OLTP) applications like banking, e-commerce, inventory management, health care, order processing, and more, says the company.

Sergio De Simone
on Feb 16, 2025
AI, ML & Data Engineering

Gemini 2.0 Family Expands with Cost-Efficient Flash-Lite and Pro-Experimental Models

Announced last December, the Gemini 2.0 family of models now has a new member, Gemini 2.0 Flash-Lite, which Google says is cost-optimized for large scale text output use cases and is now available in preview. Along with Flash-Lite, Google also announced Gemini 2.0 Pro.

Sergio De Simone
on Feb 13, 2025
AI, ML & Data Engineering

Microsoft Introduces CoRAG: Enhancing AI Retrieval with Iterative Reasoning

Microsoft AI has introduced Chain-of-Retrieval Augmented Generation (CoRAG), a new AI framework designed to enhance Retrieval-Augmented Generation (RAG) models. Unlike traditional RAG systems, which rely on a single retrieval step, CoRAG enables iterative search and reasoning, allowing AI models to refine their retrievals dynamically before generating answers.

Robert Krzaczyński
on Feb 11, 2025
AI, ML & Data Engineering

OpenAI Releases Reasoning Model o3-mini, Faster and More Accurate Than o1

OpenAI released OpenAI o3-mini, their latest reasoning LLM. o3-mini is optimized for STEM applications and outperforms the full o1 model on science, math, and coding benchmarks, with lower response latency than o1-mini.

Anthony Alford
on Feb 11, 2025
Cloud

OpenAI Features New o3-mini Model on Microsoft Azure OpenAI Service

OpenAI has launched the advanced o3-mini model via Microsoft Azure, enhancing AI applications with improved cost efficiency, faster performance, and adjustable reasoning capabilities. Designed for complex tasks, it supports structured outputs and backward compatibility. With widespread access, the o3-mini empowers developers to drive innovation across various industries.

Steef-Jan Wiggers
on Feb 11, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News