InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

OpenAI Presents Research on Inference-Time Compute to Better AI Security

OpenAI presented Trading Inference-Time Compute for Adversarial Robustness, a research paper that investigates the relationship between inference-time compute and the robustness of AI models against adversarial attacks.

Daniel Dominguez
on Jan 25, 2025
AI, ML & Data Engineering

Databases in 2024: Growth, Change and Controversy

Andrew Pavlo’s annual retrospective on the database world has recently been released, covering trends and innovations from the past year. The opinionated report, "Databases in 2024: A Year in Review," highlights that while we may indeed be in the "golden era of databases," last year brought significant license changes, the rapid growth of DuckDB, and some surprising new releases.

Renato Losio
on Jan 25, 2025
AI, ML & Data Engineering

Microsoft Phi-4 is a Small Language Model Specialized for Complex Math Reasoning

Phi-4 is 14B parameter model from Microsoft Research that aims to improve the state of the art for math reasoning. Previously available on Azure AI Foundry, Phi-4 has recently become available on Hugging Face under the MIT license.

Sergio De Simone
on Jan 24, 2025
AI, ML & Data Engineering

Amazon Bedrock Introduces Multi-Agent Systems (MAS) with Open Source Framework Integration

Amazon Web Services has released a multi-agent collaboration capability for Amazon Bedrock, introducing a framework for deploying and managing multiple AI agents that collaborate on complex tasks. The system enables specialized agents to work together under a supervisor agent's coordination, addressing challenges developers face with agent orchestration in distributed AI systems.

Vinod Goje
on Jan 23, 2025
AI, ML & Data Engineering

Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models

Microsoft Research unveiled rStar-Math, a framework that demonstrates the ability of small language models (SLMs) to achieve mathematical reasoning capabilities comparable to, and in some cases exceeding, larger models like OpenAI's o1-mini. This is accomplished without the need for more advanced models, representing a novel approach to enhancing the inference capabilities of AI.

Robert Krzaczyński
on Jan 23, 2025
AI, ML & Data Engineering

Nvidia Ingest Aims to Make it Easier to Extract Structured Information from Documents

Nvidia Ingest is a new microservice aimed at processing document content and extracting metadata into a well-defined JSON schema. Ingest is able to process PDFs, Word, and PowerPoint documents and extract structured information from tables, charts, images, and text using optical character recognition.

Sergio De Simone
on Jan 22, 2025
.NET

Microsoft Research AI Frontiers Lab Launches AutoGen v0.4 Library

Microsoft Research’s AI Frontiers Lab has announced the release of AutoGen version 0.4, an open-source framework designed to build advanced AI agent systems. This latest version as stated marks the complete redesign of the AutoGen library, focusing on enhancing code quality, robustness, usability, and the scalability of agent workflows.

Almir Vuk
on Jan 22, 2025
AI, ML & Data Engineering

DeepSeek Open-Sources DeepSeek-V3, a 671B Parameter Mixture of Experts LLM

DeepSeek open-sourced DeepSeek-V3, a Mixture-of-Experts (MoE) LLM containing 671B parameters. It was pre-trained on 14.8T tokens using 2.788M GPU hours and outperforms other open-source models on a range of LLM benchmarks, including MMLU, MMLU-Pro, and GPQA.

Anthony Alford
on Jan 21, 2025
AI, ML & Data Engineering

Google Releases Experimental AI Reasoning Model

Google has introduced Gemini 2.0 Flash Thinking Experimental, an AI reasoning model available in its AI Studio platform.

Daniel Dominguez
on Jan 21, 2025
AI, ML & Data Engineering

Google Vertex AI Provides RAG Engine for Large Language Model Grounding

Vertex AI RAG Engine is a managed orchestration service aimed to make it easier to connect large language models (LLMs) to external data sources to be more up-to-date, generate more relevant responses, and hallucinate less.

Sergio De Simone
on Jan 20, 2025
AI, ML & Data Engineering

Apache Hudi 1.0 Now Generally Available

The Apache Software Foundation has recently announced the general availability of Apache Hudi 1.0, the transactional data lake platform with support for near real-time analytics. Initially introduced in 2017, Apache Hudi provides an open table format optimized for efficient writes in incremental data pipelines and fast query performance.

Renato Losio
on Jan 18, 2025
AI, ML & Data Engineering

Major LLMs Have the Capability to Pursue Hidden Goals, Researchers Find

Researchers at AI safety firm Apollo Research found that AI agents may covertly pursue misaligned goals and hide their true objectives. Known as in-context scheming, this behavior does not seem to be accidental as LLMs explicitly reason about deceptive strategies and consider them a viable strategy.

Sergio De Simone
on Jan 17, 2025
AI, ML & Data Engineering

Microsoft Research Introduces AIOpsLab: a Framework for AI-Driven Cloud Operations

Microsoft Research unveiled AIOpsLab, an open-source framework designed to advance the development and evaluation of AI agents for cloud operations. The tool provides a standardized and scalable platform to address challenges in fault diagnosis, incident mitigation, and system reliability within complex cloud environments.

Robert Krzaczyński
on Jan 16, 2025
Culture & Methods

Shaping an Impactful Data Product Strategy

Lior Barak and Gaëlle Seret advocate proactive, business-focused strategies for data engineering. Barak proposes a 3-year roadmap using his Data Ecosystem Vision Board to align teams on strategic capabilities and measure ROI, cost, and impact. Seret promotes a "data as a product" approach, co-creating visions with stakeholders and evolving shared taxonomies to ensure long-term alignment.

Rafiq Gemmail
on Jan 15, 2025
AI, ML & Data Engineering

HuatuoGPT-o1: Advancing Complex Medical Reasoning with AI

Researchers from The Chinese University of Hong Kong, Shenzhen, and the Shenzhen Research Institute of Big Data have introduced HuatuoGPT-o1, a medical large language model (LLM) designed to improve reasoning in complex healthcare scenarios.

Robert Krzaczyński
on Jan 14, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News