InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Windsurf Launches SWE-1 Family of Models for Software Engineering

Windsurf has introduced its first set of SWE-1 models, aimed at supporting the full range of software engineering tasks, not limited to code generation. The lineup consists of three models SWE-1, SWE-1-lite, and SWE-1-mini, each designed for specific scenarios.

Daniel Dominguez
on May 19, 2025
AI, ML & Data Engineering

OpenAI’s Stargate Project Aims to Build AI Infrastructure in Partner Countries Worldwide

OpenAI has announced a new initiative called "OpenAI for Countries" as part of its Stargate project, aiming to help nations develop AI infrastructure based on democratic principles. This expansion follows the company's initial $500 billion investment plan for AI infrastructure in the United States.

Vinod Goje
on May 18, 2025
AI, ML & Data Engineering

Llama 4 Scout and Maverick Now Available on Amazon Bedrock and SageMaker JumpStart

AWS recently announced the availability of Meta's latest foundation models, Llama 4 Scout and Llama 4 Maverick, in Amazon Bedrock and AWS SageMaker JumpStart. Both models provide multimodal capabilities and follow the mixture-of-experts architecture.

Sergio De Simone
on May 18, 2025
AI, ML & Data Engineering

Mistral Unveils Medium 3: Enterprise-Ready Language Model

Mistral AI has unveiled Mistral Medium 3, a mid-sized language model aimed at enterprises seeking a balance between cost-efficiency, strong performance, and flexible deployment options. The model is now available through Mistral’s platform and Amazon SageMaker, with further releases planned for IBM WatsonX, Azure AI Foundry, Google Cloud Vertex AI, and NVIDIA NIM.

Robert Krzaczyński
on May 16, 2025
AI, ML & Data Engineering

CMU Researchers Introduce LegoGPT: Building Stable LEGO Structures from Text Prompts

Researchers at Carnegie Mellon University have introduced LegoGPT, a system that generates physically stable and buildable LEGO® structures from natural language descriptions. The project combines large language models with engineering constraints to produce designs that can be assembled manually or by robotic systems.

Robert Krzaczyński
on May 14, 2025
DevOps

Google Cloud Enhances AI/ML Workflows with Hierarchical Namespace in Cloud Storage

On March 17, 2025, Google Cloud introduced a hierarchical namespace (HNS) feature in Cloud Storage, aiming to optimize AI and machine learning (ML) workloads by improving data organization, performance, and reliability.

Craig Risi
on May 14, 2025
AI, ML & Data Engineering

Anthropic Introduces Web Search Functionality for Claude Models

Anthropic has announced the addition of web search capabilities to its Claude models, available via the Anthropic API. This update enables Claude to access current information from the web, allowing developers to create applications and AI agents that provide up-to-date insights.

Daniel Dominguez
on May 14, 2025
AI, ML & Data Engineering

Meta Open Sources LlamaFirewall for AI Agent Combined Protection

LlamaFirewall is a security framework aimed at safeguarding AI agents against prompt injection, goal misalignment, and insecure code generation. It achieved over 90% efficacy in reducing attack success rates when evaluated on the AgentDojo benchmark. Additionally, developers can update its behavior by adding new security guardrails.

Sergio De Simone
on May 13, 2025
AI, ML & Data Engineering

Meta Announces API and Protection Tools at First LlamaCon Event

At Meta's first-ever LlamaCon event, the company announced several new tools for building with their Llama AI models: a limited preview of the Llama API that allows developers to experiment with different models, and new Llama Protection Tools for securing AI applications.

Anthony Alford
on May 13, 2025
AI, ML & Data Engineering

Google Introduces DolphinGemma to Support Dolphin Communication Research

Google has released a new AI model called DolphinGemma, which has been developed to assist researchers in analyzing and interpreting dolphin vocalizations. The project is part of an ongoing collaboration with the Wild Dolphin Project (WDP) and researchers at Georgia Tech, and it focuses on identifying patterns in the natural communication of Atlantic spotted dolphins.

Robert Krzaczyński
on May 13, 2025
AI, ML & Data Engineering

OpenAI Introduces GPT‑4.1 Family with Enhanced Performance and Long-Context Support

OpenAI has released a new family of language models—GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano—available via its API. The models improve on GPT‑4o and GPT‑4.5 across several technical benchmarks and introduce support for up to 1 million tokens of context.

Robert Krzaczyński
on May 12, 2025
AI, ML & Data Engineering

DeepSeek Launches Prover-V2 Open-Source LLM for Formal Math Proofs

DeepSeek has released DeepSeek-Prover-V2, a new open-source large language model specifically designed for formal theorem proving in Lean 4. The model builds on a recursive theorem proving pipeline powered by the company's DeepSeek-V3 foundation model.

Vinod Goje
on May 12, 2025
AI, ML & Data Engineering

Hugging Face to Democratize Robotics with Open-Source Reachy 2 Robot

Hugging Face has acquired Pollen Robotics, a French startup that developed the humanoid robot Reachy 2. The acquisition aims to make robotics more accessible by open-sourcing the robot’s design and allowing developers to modify and improve its code.

Daniel Dominguez
on May 10, 2025
Cloud

Google Cloud Announces Rapid Storage for Millisecond-Latency Workloads

At the recent Google Cloud Next 2025, the cloud provider announced Rapid Storage, a new Cloud Storage zonal bucket designed to deliver consistent single-digit millisecond data access for frequently accessed data and latency-sensitive applications. The new storage class provides under 1ms random read and write latency, 20x faster data access, and 6 TB/s of throughput.

Renato Losio
on May 10, 2025
Architecture & Design

InfoQ Dev Summit Boston 2025: AI, Platforms, and Developer Experience

Software development is shifting fast. Senior engineers need real-world insights on AI, platforms, and developer autonomy. InfoQ Dev Summit Boston (June 9-10) offers 2 days with over 27 sessions of curated, technical talks delivered by engineers actively working at scale. We are focused on helping teams navigate the software evolution, with the clarity and context needed to make better decisions.

Eder Ignatowicz
on May 09, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News