InfoQ Homepage Machine Learning Content on InfoQ

News

RSS Feed

Newer Older

DevOps

Uber Completes Massive Kubernetes Migration for Microservices and Large-Scale Compute Workloads

Uber has successfully completed a large Kubernetes migration, transitioning its entire compute platform from Apache Mesos to Kubernetes across multiple data centers and cloud environments.

Claudio Masolo
on May 24, 2025
Mobile

Google Enhances LiteRT for Faster On-Device Inference

The new release of LiteRT, formerly known as TensorFlow Lite, introduces a new API to simplify on-device ML inference, enhanced GPU acceleration, support for Qualcomm NPU (Neural Processing Unit) accelerators, and advanced inference features.

Sergio De Simone
on May 24, 2025
AI, ML & Data Engineering

OpenAI’s Stargate Project Aims to Build AI Infrastructure in Partner Countries Worldwide

OpenAI has announced a new initiative called "OpenAI for Countries" as part of its Stargate project, aiming to help nations develop AI infrastructure based on democratic principles. This expansion follows the company's initial $500 billion investment plan for AI infrastructure in the United States.

Vinod Goje
on May 18, 2025
DevOps

Google Cloud Enhances AI/ML Workflows with Hierarchical Namespace in Cloud Storage

On March 17, 2025, Google Cloud introduced a hierarchical namespace (HNS) feature in Cloud Storage, aiming to optimize AI and machine learning (ML) workloads by improving data organization, performance, and reliability.

Craig Risi
on May 14, 2025
AI, ML & Data Engineering

DeepSeek Launches Prover-V2 Open-Source LLM for Formal Math Proofs

DeepSeek has released DeepSeek-Prover-V2, a new open-source large language model specifically designed for formal theorem proving in Lean 4. The model builds on a recursive theorem proving pipeline powered by the company's DeepSeek-V3 foundation model.

Vinod Goje
on May 12, 2025
DevOps

Uber’s Journey to Ray on Kubernetes

Uber has detailed a recent transition to running Ray-based machine learning workloads on Kubernetes. This marks an evolution in its infrastructure, with the aim of enhancing scalability, efficiency, and developer experience. The company recently published a two-part series from Uber Engineering delving into the motivations, challenges, and solutions encountered during this migration.

Craig Risi
on May 08, 2025
AI, ML & Data Engineering

AI Continent: European Commission Outlines Strategy for Scaling AI Development

The European Commission has presented the AI Continent Action Plan, a new strategy designed to strengthen the European Union’s capacity for AI development and deployment. The plan outlines coordinated investment in infrastructure, access to high-quality data, AI adoption in strategic sectors, and support for regulatory implementation.

Robert Krzaczyński
on Apr 17, 2025
AI, ML & Data Engineering

Claude for Education: Anthropic’s AI Assistant Goes to University

Anthropic has announced the launch of Claude for Education, a specialized version of its AI assistant, Claude, developed specifically for colleges and universities. The initiative aims to support students, faculty, and administrators with secure and responsible AI integration across academics and campus operations.

Robert Krzaczyński
on Apr 12, 2025
Culture & Methods

How Senior Software Engineers Can Learn from Junior Engineers

A rigid hierarchical dynamic between senior and junior software engineers can stifle innovation, discourage fresh perspectives, and create barriers to collaboration. According to Beth Anderson, senior engineers can actively learn from their junior counterparts. She suggests creating an environment of mutual growth, psychological safety, and continuous learning.

Ben Linders
on Apr 11, 2025
AI, ML & Data Engineering

QCon London 2025: Achieving AI Precision through Intelligent Data Retrieval

Adi Polak, a Confluent expert, addressed AI precision challenges at QCOn London 2025, introducing agentic RAG to enhance data retrieval accuracy. With insights on the limitations of current systems and actionable strategies for implementation, Polak emphasized precision as a crucial factor in operationalizing AI, building trust, and improving business outcomes.

Steef-Jan Wiggers
on Apr 11, 2025
DevOps

Optimize AI Workloads: Google Cloud’s Tips and Tricks

Google Cloud has announced a suite of new tools and features designed to help organizations reduce costs and improve efficiency of AI workloads across their cloud infrastructure. The announcement comes as enterprises increasingly seek ways to optimize spending on AI initiatives while maintaining performance and scalability.

Claudio Masolo
on Apr 09, 2025
AI, ML & Data Engineering

Announcing QCon AI: Focusing on Practical, Scalable AI Implementation for Engineering Teams

QCon AI focuses on practical, real-world AI for senior developers, architects, and engineering leaders. Join us Dec 16-17, 2025, in NYC to learn how teams are building and scaling AI in production—covering MLOps, system reliability, cost optimization, and more. No hype, just actionable insights from those doing the work.

Artenisa Chatziou
on Apr 07, 2025
DevOps

How SREs and GenAI Work Together to Decrease eBay's Downtime: an Architect's Insights at KubeCon EU

During his KubeCon EU keynote, Vijay Samuel, Principal MTS Architect at eBay, shared his team’s experience of enhancing incident response capabilities by incorporating ML and LLM building blocks. They realised that GenAIs are not a silver bullet but can help engineers through complex incident investigations through logs, traces, and dashboard explanations.

Olimpiu Pop
on Apr 05, 2025
Cloud

Recap of Cloudflare Security Week 2025: From Quantum Cryptography to AI Labyrinth

During the recent Cloudflare Security Week 2025, the cloud provider announced various improvements to its cybersecurity services and multiple reports analyzing trends and challenges in security threats. Additionally, they announced AI Labyrinth, a new version of honeypots against unauthorized crawlers, and Cloudflare for AI, a suite of tools aimed at helping the adoption of secure AI technologies.

Renato Losio
on Apr 05, 2025
Cloud

Azure AI Foundry Supports NVIDIA NIM and AgentIQ for AI Agents

Microsoft and NVIDIA have teamed up to integrate NVIDIA NIM microservices and AgentIQ into Azure AI Foundry, streamlining AI agent application development. This partnership accelerates project lifecycles, optimizing performance and reducing costs. The toolkit enhances AI efficiency through real-time telemetry, enabling effortless deployment and advanced functionalities for developers.

Steef-Jan Wiggers
on Mar 31, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News