InfoQ Homepage Platform Engineering Content on InfoQ
-
How Discord Scaled its ML Platform from Single-GPU Workflows to a Shared Ray Cluster
Discord has detailed how it rebuilt its machine learning platform after hitting the limits of single-GPU training. The changes enabled daily retrains for large models and contributed to a 200% uplift in a key ads ranking metric.
-
Enhancing Reliability Using Service-Level Prioritized Load Shedding: Netflix at QCon SF 2025
At QCon San Francisco, Netflix engineers unveiled their advanced Service-Level-Prioritized Load-Shedding strategy, enhancing reliability during traffic spikes. By prioritizing high-value requests and automating management across microservices, they safeguard user experience and system stability. Key insights stress prioritization, automation, and structured load shedding for optimal resilience.
-
Crossplane Reaches Production Maturity by Graduating CNCF
The Cloud Native Computing Foundation (CNCF) has graduated Crossplane, marking a major milestone for the open-source project that turns Kubernetes into a universal control plane for cloud infrastructure. For practitioners, it signals that Crossplane is no longer an experimental idea but a production-hardened foundation for building internal platforms.
-
QCon London 2026 Announces Tracks: AI Engineering, Building Teams, Tech of Finance, and More
The QCon London 2026 tracks are live: 15 practitioner-curated deep dives on AI adoption, resilient architectures, distributed systems, performance, modern languages, data, security, and Staff+ leadership, rooted in real production lessons.
-
Inside the Architectures Powering Modern AI Systems: QCon San Francisco 2025
Senior engineers face fast-moving AI adoption without clear patterns. QCon SF 2025 brings real-world lessons from teams at Netflix, Meta, Intuit, Anthropic & more, showing how to build reliable AI systems at scale. Early bird ends Nov 11.
-
HashiCorp Warns Traditional Secret Scanning Tools are Falling behind
HashiCorp has issued a warning that traditional secret scanning tools are failing to keep up with the realities of modern software development. In a new blog post, the company argues that post-commit detection and brittle pattern matching leave dangerous gaps in coverage.
-
QCon AI New York 2025 Schedule Published, Highlights Practical Enterprise AI
The QCon AI New York 2025 schedule is now live for its Dec 16-17 event. Focused on moving AI from PoC to production, the program offers a practical roadmap for senior engineers & tech leaders. It addresses the real-world challenges of building, scaling, and deploying reliable, enterprise-grade AI systems, helping organizations overcome the hurdles of productionizing their AI initiatives.
-
11 Sessions Not to Miss at QCon San Francisco 2025
As QCon San Francisco (Nov 17-21, 2025) approaches, the conference's program committee and track hosts are sharing their top picks from this year's lineup. Their selections span a wide range of topics, from AI-accelerated development and platform engineering to resilience patterns and career growth, all with QCon's signature focus on real-world case studies and lessons learned.
-
Pulumi Launches Neo: an Agentic AI Platform Engineer for Multi-Cloud Infrastructure
Infrastructure automation company Pulumi has introduced what's claimed to be the first artificial intelligence-based platform engineering agent for the industry, named Neo. The tool works to resolve some of the infrastructure bottlenecks that develop as a side effect of AI tools speeding up software development.
-
Gitpod Rebrands to Ona, Aiming to Become the AI-Powered Center of Software Development
Gitpod, known for offering browser-based cloud development environments, has rebranded as Ona, signaling a major shift in its vision from IDE-centric workflows to AI-driven software engineering.
-
InfoQ Dev Summit Munich 2025: Master the 'How' with Deep-Dive, Practitioner-Led Guidance
At InfoQ Dev Summit Munich (Oct 15-16), learn directly from the senior engineers building complex systems. This practitioner-led conference offers deep dives on real-world implementation patterns from software leaders at Allianz, Skyscanner, Zalando, and Delivery Hero.
-
Google Cloud Launches 'Cloud Setup' to Streamline Foundational Infrastructure
Google Cloud has launched Google Cloud Setup, a streamlined service for creating secure, best-practice cloud environments. Offering guided workflows for various needs—proof-of-concept, production, and enhanced security—this tool reduces manual efforts, enabling rapid application deployment in minutes, not days. Enjoy hassle-free configuration with built-in best practices and cost-effective access.
-
Uber Unveils Multi-Cloud Secrets Management Platform to Secure 150,000+ Credentials
Uber has revealed details of its internally developed Multi-Cloud Secrets Management Platform, designed to address the security challenges of managing over 150,000 secrets across its massive distributed infrastructure. The platform represents a significant evolution in how large-scale technology companies approach credential security in multi-cloud environments.
-
Cisco Reveals JARVIS: an AI Assistant for Platform-Engineering Teams
Introducing JARVIS by Cisco, an AI-powered assistant revolutionizing platform-engineering workflows. With seamless integration across 40+ tools, JARVIS automates complex tasks, reducing project timelines from weeks to hours. Powered by a hybrid AI architecture, it ensures accuracy and reliability while enhancing productivity.
-
InfoQ Dev Summit Boston 2025: AI, Platforms, and Developer Experience
Software development is shifting fast. Senior engineers need real-world insights on AI, platforms, and developer autonomy. InfoQ Dev Summit Boston (June 9-10) offers 2 days with over 27 sessions of curated, technical talks delivered by engineers actively working at scale. We are focused on helping teams navigate the software evolution, with the clarity and context needed to make better decisions.