InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Synthetic Data Generator Simplifies Dataset Creation with Large Language Models
Hugging Face has introduced the Synthetic Data Generator, a new tool leveraging Large Language Models (LLMs), that offers a streamlined, no-code approach to creating custom datasets. The tool facilitates the creation of text classification and chat datasets through a clear and accessible process, making it usable for both non-technical users and experienced AI practitioners.
-
OpenAI Presents Research on Inference-Time Compute to Better AI Security
OpenAI presented Trading Inference-Time Compute for Adversarial Robustness, a research paper that investigates the relationship between inference-time compute and the robustness of AI models against adversarial attacks.
-
Databases in 2024: Growth, Change and Controversy
Andrew Pavlo’s annual retrospective on the database world has recently been released, covering trends and innovations from the past year. The opinionated report, "Databases in 2024: A Year in Review," highlights that while we may indeed be in the "golden era of databases," last year brought significant license changes, the rapid growth of DuckDB, and some surprising new releases.
-
Microsoft Phi-4 is a Small Language Model Specialized for Complex Math Reasoning
Phi-4 is 14B parameter model from Microsoft Research that aims to improve the state of the art for math reasoning. Previously available on Azure AI Foundry, Phi-4 has recently become available on Hugging Face under the MIT license.
-
Amazon Bedrock Introduces Multi-Agent Systems (MAS) with Open Source Framework Integration
Amazon Web Services has released a multi-agent collaboration capability for Amazon Bedrock, introducing a framework for deploying and managing multiple AI agents that collaborate on complex tasks. The system enables specialized agents to work together under a supervisor agent's coordination, addressing challenges developers face with agent orchestration in distributed AI systems.
-
Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models
Microsoft Research unveiled rStar-Math, a framework that demonstrates the ability of small language models (SLMs) to achieve mathematical reasoning capabilities comparable to, and in some cases exceeding, larger models like OpenAI's o1-mini. This is accomplished without the need for more advanced models, representing a novel approach to enhancing the inference capabilities of AI.
-
Nvidia Ingest Aims to Make it Easier to Extract Structured Information from Documents
Nvidia Ingest is a new microservice aimed at processing document content and extracting metadata into a well-defined JSON schema. Ingest is able to process PDFs, Word, and PowerPoint documents and extract structured information from tables, charts, images, and text using optical character recognition.
-
Microsoft Research AI Frontiers Lab Launches AutoGen v0.4 Library
Microsoft Research’s AI Frontiers Lab has announced the release of AutoGen version 0.4, an open-source framework designed to build advanced AI agent systems. This latest version as stated marks the complete redesign of the AutoGen library, focusing on enhancing code quality, robustness, usability, and the scalability of agent workflows.
-
DeepSeek Open-Sources DeepSeek-V3, a 671B Parameter Mixture of Experts LLM
DeepSeek open-sourced DeepSeek-V3, a Mixture-of-Experts (MoE) LLM containing 671B parameters. It was pre-trained on 14.8T tokens using 2.788M GPU hours and outperforms other open-source models on a range of LLM benchmarks, including MMLU, MMLU-Pro, and GPQA.
-
Google Releases Experimental AI Reasoning Model
Google has introduced Gemini 2.0 Flash Thinking Experimental, an AI reasoning model available in its AI Studio platform.
-
Google Vertex AI Provides RAG Engine for Large Language Model Grounding
Vertex AI RAG Engine is a managed orchestration service aimed to make it easier to connect large language models (LLMs) to external data sources to be more up-to-date, generate more relevant responses, and hallucinate less.
-
Apache Hudi 1.0 Now Generally Available
The Apache Software Foundation has recently announced the general availability of Apache Hudi 1.0, the transactional data lake platform with support for near real-time analytics. Initially introduced in 2017, Apache Hudi provides an open table format optimized for efficient writes in incremental data pipelines and fast query performance.
-
Major LLMs Have the Capability to Pursue Hidden Goals, Researchers Find
Researchers at AI safety firm Apollo Research found that AI agents may covertly pursue misaligned goals and hide their true objectives. Known as in-context scheming, this behavior does not seem to be accidental as LLMs explicitly reason about deceptive strategies and consider them a viable strategy.
-
Microsoft Research Introduces AIOpsLab: a Framework for AI-Driven Cloud Operations
Microsoft Research unveiled AIOpsLab, an open-source framework designed to advance the development and evaluation of AI agents for cloud operations. The tool provides a standardized and scalable platform to address challenges in fault diagnosis, incident mitigation, and system reliability within complex cloud environments.
-
Shaping an Impactful Data Product Strategy
Lior Barak and Gaëlle Seret advocate proactive, business-focused strategies for data engineering. Barak proposes a 3-year roadmap using his Data Ecosystem Vision Board to align teams on strategic capabilities and measure ROI, cost, and impact. Seret promotes a "data as a product" approach, co-creating visions with stakeholders and evolving shared taxonomies to ensure long-term alignment.