InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

LlamaFirewall is a security framework aimed at safeguarding AI agents against prompt injection, goal misalignment, and insecure code generation. It achieved over 90% efficacy in reducing attack success rates when evaluated on the AgentDojo benchmark. Additionally, developers can update its behavior by adding new security guardrails.

Sergio De Simone
on May 13, 2025
AI, ML & Data Engineering

Meta Announces API and Protection Tools at First LlamaCon Event

At Meta's first-ever LlamaCon event, the company announced several new tools for building with their Llama AI models: a limited preview of the Llama API that allows developers to experiment with different models, and new Llama Protection Tools for securing AI applications.

Anthony Alford
on May 13, 2025
AI, ML & Data Engineering

Google Introduces DolphinGemma to Support Dolphin Communication Research

Google has released a new AI model called DolphinGemma, which has been developed to assist researchers in analyzing and interpreting dolphin vocalizations. The project is part of an ongoing collaboration with the Wild Dolphin Project (WDP) and researchers at Georgia Tech, and it focuses on identifying patterns in the natural communication of Atlantic spotted dolphins.

Robert Krzaczyński
on May 13, 2025
AI, ML & Data Engineering

OpenAI Introduces GPT‑4.1 Family with Enhanced Performance and Long-Context Support

OpenAI has released a new family of language models—GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano—available via its API. The models improve on GPT‑4o and GPT‑4.5 across several technical benchmarks and introduce support for up to 1 million tokens of context.

Robert Krzaczyński
on May 12, 2025
AI, ML & Data Engineering

DeepSeek Launches Prover-V2 Open-Source LLM for Formal Math Proofs

DeepSeek has released DeepSeek-Prover-V2, a new open-source large language model specifically designed for formal theorem proving in Lean 4. The model builds on a recursive theorem proving pipeline powered by the company's DeepSeek-V3 foundation model.

Vinod Goje
on May 12, 2025
AI, ML & Data Engineering

Hugging Face to Democratize Robotics with Open-Source Reachy 2 Robot

Hugging Face has acquired Pollen Robotics, a French startup that developed the humanoid robot Reachy 2. The acquisition aims to make robotics more accessible by open-sourcing the robot’s design and allowing developers to modify and improve its code.

Daniel Dominguez
on May 10, 2025
Cloud

Google Cloud Announces Rapid Storage for Millisecond-Latency Workloads

At the recent Google Cloud Next 2025, the cloud provider announced Rapid Storage, a new Cloud Storage zonal bucket designed to deliver consistent single-digit millisecond data access for frequently accessed data and latency-sensitive applications. The new storage class provides under 1ms random read and write latency, 20x faster data access, and 6 TB/s of throughput.

Renato Losio
on May 10, 2025
Architecture & Design

InfoQ Dev Summit Boston 2025: AI, Platforms, and Developer Experience

Software development is shifting fast. Senior engineers need real-world insights on AI, platforms, and developer autonomy. InfoQ Dev Summit Boston (June 9-10) offers 2 days with over 27 sessions of curated, technical talks delivered by engineers actively working at scale. We are focused on helping teams navigate the software evolution, with the clarity and context needed to make better decisions.

Eder Ignatowicz
on May 09, 2025
AI, ML & Data Engineering

Meta Launches AutoPatchBench to Evaluate LLM Agents on Security Fixes

AutoPatchBench is a standardized benchmark designed to help researchers and developers evaluate and compare how effectively LLM agents can automatically patch security vulnerabilities in C/C++ native code.

Sergio De Simone
on May 07, 2025
Cloud

Akka Launches New Deployment Options for Agentic AI

Akka has unveiled new deployment options to enhance flexibility for developers building resilient, distributed AI systems. Now offering self-managed nodes and self-hosted regions, Akka empowers enterprises to deploy across various infrastructures. This innovation positions Akka as a leader in agentic AI development, enabling scalable, stateful applications that adapt to rapidly changing demands.

Steef-Jan Wiggers
on May 05, 2025
AI, ML & Data Engineering

From Architecture to Deployment: How AI-Powered Toolkits Are Unifying Developer Workflows

Developer tooling is undergoing a shift as AI moves beyond code completion to unify multiple stages of the software development workflow.

Matt Foster
on May 04, 2025
AI, ML & Data Engineering

OpenAI Launches BrowseComp to Benchmark AI Agents' Web Search and Deep Research Skills

OpenAI has released BrowseComp, a new benchmark designed to test AI agents' ability to locate difficult-to-find information on the web. The benchmark contains 1,266 challenging problems that require agents to persistently navigate through multiple websites to retrieve entangled information.

Vinod Goje
on May 04, 2025
Cloud

Cloudflare Upgrades D1 Database with Global Read Replication

During the recent Developer Week 2025, Cloudflare announced the beta of global read replication for its serverless SQL database D1, providing a globally distributed option without sacrificing consistency. With automatically provisioned replicas in every region, applications can now serve read queries faster while maintaining strong sequential consistency across requests.

Renato Losio
on May 03, 2025
Cloud

Google Unveils Ironwood TPU for AI Inference

Google's Ironwood TPU, its most advanced custom AI accelerator, powers the "age of inference" with unmatched performance and scalability. With up to 9,216 liquid-cooled chips, it outpaces competitors, delivering 42.5 Exaflops. Engineered for high-efficiency, low-latency AI tasks, Ironwood redefines potential in AI hardware, leveraging AlphaChip to revolutionize chip design.

Steef-Jan Wiggers
on May 02, 2025
Cloud

Cloudflare AutoRAG Streamlines Retrieval-Augmented Generation

Cloudflare has launched a managed service for using retrieval-augmented generation in LLM-based systems. Now in beta, CloudFlare AutoRAG aims to make it easier for developers to build pipelines that integrate rich context data into LLMs.

Sergio De Simone
on Apr 30, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News