InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Meta Open Sources LlamaFirewall for AI Agent Combined Protection
LlamaFirewall is a security framework aimed at safeguarding AI agents against prompt injection, goal misalignment, and insecure code generation. It achieved over 90% efficacy in reducing attack success rates when evaluated on the AgentDojo benchmark. Additionally, developers can update its behavior by adding new security guardrails.
-
Meta Announces API and Protection Tools at First LlamaCon Event
At Meta's first-ever LlamaCon event, the company announced several new tools for building with their Llama AI models: a limited preview of the Llama API that allows developers to experiment with different models, and new Llama Protection Tools for securing AI applications.
-
Google Introduces DolphinGemma to Support Dolphin Communication Research
Google has released a new AI model called DolphinGemma, which has been developed to assist researchers in analyzing and interpreting dolphin vocalizations. The project is part of an ongoing collaboration with the Wild Dolphin Project (WDP) and researchers at Georgia Tech, and it focuses on identifying patterns in the natural communication of Atlantic spotted dolphins.
-
OpenAI Introduces GPT‑4.1 Family with Enhanced Performance and Long-Context Support
OpenAI has released a new family of language models—GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano—available via its API. The models improve on GPT‑4o and GPT‑4.5 across several technical benchmarks and introduce support for up to 1 million tokens of context.
-
DeepSeek Launches Prover-V2 Open-Source LLM for Formal Math Proofs
DeepSeek has released DeepSeek-Prover-V2, a new open-source large language model specifically designed for formal theorem proving in Lean 4. The model builds on a recursive theorem proving pipeline powered by the company's DeepSeek-V3 foundation model.
-
Hugging Face to Democratize Robotics with Open-Source Reachy 2 Robot
Hugging Face has acquired Pollen Robotics, a French startup that developed the humanoid robot Reachy 2. The acquisition aims to make robotics more accessible by open-sourcing the robot’s design and allowing developers to modify and improve its code.
-
Google Cloud Announces Rapid Storage for Millisecond-Latency Workloads
At the recent Google Cloud Next 2025, the cloud provider announced Rapid Storage, a new Cloud Storage zonal bucket designed to deliver consistent single-digit millisecond data access for frequently accessed data and latency-sensitive applications. The new storage class provides under 1ms random read and write latency, 20x faster data access, and 6 TB/s of throughput.
-
InfoQ Dev Summit Boston 2025: AI, Platforms, and Developer Experience
Software development is shifting fast. Senior engineers need real-world insights on AI, platforms, and developer autonomy. InfoQ Dev Summit Boston (June 9-10) offers 2 days with over 27 sessions of curated, technical talks delivered by engineers actively working at scale. We are focused on helping teams navigate the software evolution, with the clarity and context needed to make better decisions.
-
Meta Launches AutoPatchBench to Evaluate LLM Agents on Security Fixes
AutoPatchBench is a standardized benchmark designed to help researchers and developers evaluate and compare how effectively LLM agents can automatically patch security vulnerabilities in C/C++ native code.
-
Akka Launches New Deployment Options for Agentic AI
Akka has unveiled new deployment options to enhance flexibility for developers building resilient, distributed AI systems. Now offering self-managed nodes and self-hosted regions, Akka empowers enterprises to deploy across various infrastructures. This innovation positions Akka as a leader in agentic AI development, enabling scalable, stateful applications that adapt to rapidly changing demands.
-
From Architecture to Deployment: How AI-Powered Toolkits Are Unifying Developer Workflows
Developer tooling is undergoing a shift as AI moves beyond code completion to unify multiple stages of the software development workflow.
-
OpenAI Launches BrowseComp to Benchmark AI Agents' Web Search and Deep Research Skills
OpenAI has released BrowseComp, a new benchmark designed to test AI agents' ability to locate difficult-to-find information on the web. The benchmark contains 1,266 challenging problems that require agents to persistently navigate through multiple websites to retrieve entangled information.
-
Cloudflare Upgrades D1 Database with Global Read Replication
During the recent Developer Week 2025, Cloudflare announced the beta of global read replication for its serverless SQL database D1, providing a globally distributed option without sacrificing consistency. With automatically provisioned replicas in every region, applications can now serve read queries faster while maintaining strong sequential consistency across requests.
-
Google Unveils Ironwood TPU for AI Inference
Google's Ironwood TPU, its most advanced custom AI accelerator, powers the "age of inference" with unmatched performance and scalability. With up to 9,216 liquid-cooled chips, it outpaces competitors, delivering 42.5 Exaflops. Engineered for high-efficiency, low-latency AI tasks, Ironwood redefines potential in AI hardware, leveraging AlphaChip to revolutionize chip design.
-
Cloudflare AutoRAG Streamlines Retrieval-Augmented Generation
Cloudflare has launched a managed service for using retrieval-augmented generation in LLM-based systems. Now in beta, CloudFlare AutoRAG aims to make it easier for developers to build pipelines that integrate rich context data into LLMs.