InfoQ Homepage DevOps Content on InfoQ
-
Google Go Module Mirror Served Backdoor for 3+ Years
In February 2025, researchers at Socket uncovered a significant supply chain attack within the Go programming ecosystem. A malicious package, named github.com/boltdb-go/bolt, was discovered impersonating the legitimate and widely-used BoltDB module.
-
AWS Introduces MCP Servers for AI-Assisted Cloud Development
AWS has launched the open-source Model Context Protocol (MCP) Servers, revolutionizing AI-powered code assistants. These servers enhance development speed and security, ensuring adherence to AWS best practices. With features like automated Infrastructure as Code and cost insights, MCP democratizes AWS expertise and empowers developers to optimize cloud solutions effortlessly.
-
Amazon VPC Route Server Generally Available, Providing Routing Flexibility and Fault Tolerance
AWS has recently announced the general availability of Amazon VPC Route Server. This new option simplifies dynamic routing in a VPC, allowing developers to advertise routing information via Border Gateway Protocol (BGP) from virtual appliances and dynamically update the VPC route tables associated with subnets and internet gateways.
-
QCon London 2025: Hybrid Cloud-Native Networking in Enterprise - Some Assembly Required
In an engaging talk at QCon London 2025, Louis Ryan, CTO of Solo.io and co-creator of Istio, addressed the complexities of hybrid cloud-native networking. He emphasized intentional assembly of network components, critical evaluation of tools, and treating networking as a primary focus to ensure reliability, observability, and security in today's intricate enterprise environments.
-
The Open-Source Version of InfluxDB 3 Reaches GA
Two years after releasing the GA version of InfluxData’s enterprise edition, their open-source version also reached that level of maturity. Conceptualised for real-time workloads and ease of running, the core version leaves aside features like long-term storage optimisations, compaction or high availability (HA), read replicas, or fine-grained access controls.
-
QCon London 2025: Applying Domain-Driven Design at Scale
At QCon London 2025, Vanderbijl unveiled how domain-driven design transformed a chaotic healthcare platform into a coherent business architecture. Through innovative strategies like "Take That" and "Robbie Williams," the team tackled architectural complexity, emphasizing adaptability and continuous improvement. This journey illustrates DDD as an evolving process essential for sustainable growth.
-
QCon London: In an Enterprise Ecosystem Your Platform Is Not an Island
In a talk at QCon London, Rachael Wonnacott explained the challenges in building a developer platform in an organisation with legacy processes and how a golden path leading to either a Kubernetes Hotel or a Public Cloud House might be necessary.
-
Google Cloud Introduces Multi-Cluster Orchestrator for Cross-Region Kubernetes Workloads
Google Cloud has announced the launch of Multi-Cluster Orchestrator (MCO), a new solution designed to simplify the deployment and management of Kubernetes workloads across multiple clusters spanning different regions. The tool aims to address challenges organizations face when operating applications across geographically distributed environments.
-
Red Hat Boosts AI across the Hybrid Cloud with Red Hat AI
Red Hat has recently announced enhancements to its Red Hat AI portfolio, aiming to accelerate the development and deployment of artificial intelligence (AI) solutions across hybrid cloud environments. This initiative focuses on integrating AI into enterprise operations, offering tools that support both predictive and generative AI models.
-
QCon London 2025: the Origin Story of AMQP - Advanced Message Queuing Politics
Join John O'Hara, creator of the Advanced Message Queuing Protocol (AMQP), as he shares the compelling journey of this groundbreaking technology at QCon London. Discover the intricate dynamics of collaboration, challenges faced, and the human element in open standards. O'Hara's insights illuminate the politics behind technology development, proving vision is as vital as innovation.
-
How Meta Uses Precision Time Protocol to Handle Leap Seconds
For systems that require strict synchronization—like distributed databases, telemetry pipelines, or event-driven architectures—handling leap seconds incorrectly can lead to data loss, duplication, or inconsistencies. As such, managing leap seconds accurately ensures system reliability and consistency across environments that depend on high-precision time.
-
QCon London 2025: Insights from 20+ Years in Mission-Critical Infrastructure
Matthew Liste, head of infrastructure at American Express, shared insights at QCon London 2025 on building robust cloud platforms in financial services. With 20+ years of experience, he emphasized stability, security, scalability, the value of interchangeable components, and long-term sustainability, urging professionals to maintain focus and foster a strong team culture for platform engineering.
-
Lessons on How to Get Timeouts, Retries and Idempotency Right from Sam Newman at QCon London
At QCon London, Sam Newman - the architect who has attributed the coining of the term microservices, went back to the basics to underline the three critical things to get right when working with distributed systems: timeouts, retries and idempotency. Through the talk, he provided mechanisms allowing distributed systems to be more robust.
-
QCon London 2025: Distributed Event-Driven Architectures across Multi-Cloud Boundaries
At QCon London 2025, Teena Idnani from Microsoft addressed the rise of multi-cloud adoption, revealing that 89% of organizations embrace this strategy. Using the fictional FinBank, she showcased practical strategies to overcome latency, resilience, event ordering, and duplication challenges, emphasizing the importance of security, observability, and continuous team education.
-
Optimize AI Workloads: Google Cloud’s Tips and Tricks
Google Cloud has announced a suite of new tools and features designed to help organizations reduce costs and improve efficiency of AI workloads across their cloud infrastructure. The announcement comes as enterprises increasingly seek ways to optimize spending on AI initiatives while maintaining performance and scalability.