InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Distributed Multi-Modal Database Aerospike 8 Brings Support for Real-Time ACID Transactions
Aerospike has announced version 8.0 of its distributed multi-modal database, bringing support for distributed ACID transactions. This enables large-scale online transaction processing (OLTP) applications like banking, e-commerce, inventory management, health care, order processing, and more, says the company.
-
Gemini 2.0 Family Expands with Cost-Efficient Flash-Lite and Pro-Experimental Models
Announced last December, the Gemini 2.0 family of models now has a new member, Gemini 2.0 Flash-Lite, which Google says is cost-optimized for large scale text output use cases and is now available in preview. Along with Flash-Lite, Google also announced Gemini 2.0 Pro.
-
Microsoft Introduces CoRAG: Enhancing AI Retrieval with Iterative Reasoning
Microsoft AI has introduced Chain-of-Retrieval Augmented Generation (CoRAG), a new AI framework designed to enhance Retrieval-Augmented Generation (RAG) models. Unlike traditional RAG systems, which rely on a single retrieval step, CoRAG enables iterative search and reasoning, allowing AI models to refine their retrievals dynamically before generating answers.
-
OpenAI Releases Reasoning Model o3-mini, Faster and More Accurate Than o1
OpenAI released OpenAI o3-mini, their latest reasoning LLM. o3-mini is optimized for STEM applications and outperforms the full o1 model on science, math, and coding benchmarks, with lower response latency than o1-mini.
-
OpenAI Features New o3-mini Model on Microsoft Azure OpenAI Service
OpenAI has launched the advanced o3-mini model via Microsoft Azure, enhancing AI applications with improved cost efficiency, faster performance, and adjustable reasoning capabilities. Designed for complex tasks, it supports structured outputs and backward compatibility. With widespread access, the o3-mini empowers developers to drive innovation across various industries.
-
FerretDB, an Open-Source Alternative to MongoDB, Releases Version 2.0
FerretDB has announced the first release candidate of version 2.0. Now powered by the recently released DocumentDB, FerretDB serves as an open-source alternative to MongoDB, bringing significant performance improvements, enhanced feature compatibility, vector search capabilities, and replication support.
-
OpenEuroLLM: Europe’s New Initiative for Open-Source AI Development
A consortium of 20 European research institutions, companies, and EuroHPC centers has launched OpenEuroLLM, an initiative to develop open-source, multilingual large language models (LLMs). Coordinated by Jan Hajič and co-led by Peter Sarlin, the project aims to provide transparent and compliant AI models for commercial and public sector applications.
-
How Data Contracts Support Collaboration between Data Teams
Data contracts define the interface between data providers and consumers, specifying things like data models, quality guarantees, and ownership. They are essential for distributed data ownership in data mesh, ensuring data is discoverable, interoperable, and governed. Data contracts improve communication between teams and enhance the reliability and quality of data products.
-
OpenAI Launches Deep Research: Advancing AI-Assisted Investigation
OpenAI has launched Deep Research, a new agent within ChatGPT designed to conduct in-depth, multi-step investigations across the web. Initially available to Pro users, with plans to expand access to Plus and Team users, Deep Research automates time-consuming research by retrieving, analyzing, and synthesizing online information.
-
DeepSeek Database Leaking Sensitive Information Highlights AI Security Risks
Cloud security firm Wiz uncovered unprotected DeepSeek database giving full control over database operations and access to internal data including millions of lines of chat logs. While the vulnerability has been quickly fixed, the incident shows the need for the AI industry to enforce higher security standards, says the company.
-
DeepSeek Open-Sources DeepSeek-R1 LLM with Performance Comparable to OpenAI's o1 Model
DeepSeek open-sourced DeepSeek-R1, an LLM fine-tuned with reinforcement learning )RL) to improve reasoning capability. DeepSeek-R1 achieves results on par with OpenAI's o1 model on several benchmarks, including MATH-500 and SWE-bench.
-
Hugging Face Expands Serverless Inference Options with New Provider Integrations
Hugging Face has launched the integration of four serverless inference providers Fal, Replicate, SambaNova, and Together AI, directly into its model pages. These providers are also integrated into Hugging Face's client SDKs for JavaScript and Python, allowing users to run inference on various models with minimal setup.
-
Block Launches Open-Source AI Framework Codename Goose
Block’s Open Source Program Office has launched Codename Goose, an open-source, non-commercial AI agent framework designed to automate tasks and integrate seamlessly with existing tools. Goose provides users with a flexible, on-machine AI assistant that can be customized through extensions, enabling developers and other professionals to enhance their productivity.
-
OpenAI Introduces ChatGPT Gov for U.S. Government Agencies
OpenAI has launched ChatGPT Gov, a version of its AI-powered chatbot designed specifically for U.S. government agencies. This tailored deployment provides federal, state, and local agencies with access to OpenAI’s latest AI models while allowing them to maintain control over security, privacy, and compliance.
-
Google's Vertex AI in Firebase SDK Now Ready for Production Use
Three months after its launch in beta, the Vertex AI in Firebase SDK is now ready for production, says Google engineer Thomas Ezan, who further explores three dimensions that are essential for its successful deployment to production: abuse prevention, remote configuration, and responsible AI use.