InfoQ Homepage Microservices Content on InfoQ
-
Reddit Migrates Comment Backend from Python to Go Microservice to Halve Latency
Reddit has rebuilt its core backend, migrating Comments, Accounts, Posts, and Subreddits from a legacy Python monolith to Go microservices. The migration improves performance, halves critical write latency, and modernizes the platform for future scalability while preserving correctness across multiple datastores.
-
Monzo’s Real-Time Fraud Detection Architecture with BigQuery and Microservices
Monzo has redesigned its fraud prevention platform to detect scams in real time, handle growing payment volumes, and deploy new controls rapidly. Explore the bank’s modular control architecture, feature computation pipeline, and observability using BigQuery for accurate, low-latency fraud detection.
-
Introducing the MCP Registry
The Model Context Protocol (MCP) ecosystem is enhancing AI development with a public registry for server discovery and a secure gateway for agent interactions. This initiative, featuring the recently launched MCP Registry and the Linux Foundation's Agentgateway project, streamlines the management of AI tools, fostering collaboration and security for engineering teams.
-
Impulse, Airbnb’s New Framework for Context-Aware Load Testing
Airbnb has developed Impulse, an internal load testing framework to improve microservice reliability and performance. It enables distributed, large-scale testing and lets teams run self-service, context-aware load tests integrated with CI pipelines. By simulating production-like traffic, Impulse helps engineers identify bottlenecks and errors before changes reach production.
-
How Allegro Does Automated Code Migrations for over 2000 Microservices
Allegro shared the details of the process it uses to manage code migrations at scale. The company combined GitHub’s Dependabot and OpenRewrite projects into a custom solution that helps developers perform mundane code migration tasks automatically across numerous source code repositories. The company tackled many edge cases to ensure the process operates smoothly, relieving initial trust issues.
-
Scaling API Independence: Akehurst on Mocking, Contract Testing, and Observability
At QCon London 2025, Tom Akehurst spotlighted the path to developer autonomy in microservices through "Scaling API Independence." He emphasized advanced mocking, contract testing, and observability to combat API dependencies. Akehurst showcased how these strategies, enhanced by AI, streamline development, boost productivity, and ensure integration confidence amidst complexity.
-
Lessons on How to Get Timeouts, Retries and Idempotency Right from Sam Newman at QCon London
At QCon London, Sam Newman - the architect who has attributed the coining of the term microservices, went back to the basics to underline the three critical things to get right when working with distributed systems: timeouts, retries and idempotency. Through the talk, he provided mechanisms allowing distributed systems to be more robust.
-
Stripe Rearchitects Its Observability Platform with Managed Prometheus and Grafana on AWS
Stripe replaced its observability platform, which used a third-party vendor solution, with a new architecture utilizing managed services on AWS. The company made the move due to scalability limits, reliability issues, and increasing costs while transitioning to microservices. The migration involved dual-writing metrics, translating assets, validation, and user training.
-
Agoda’s Unconventional Client-First Transition from a GraphQL Monolith to Microservices
Agoda recently described their innovative approach to transitioning from a monolithic GraphQL API to a microservices architecture. Unlike traditional methods focusing on breaking down server-side components first, Agoda adopted a client-first strategy, preparing their client applications to handle both the monolith and the microservices in parallel using an in-house smart orchestrator library.
-
Netflix Rolls Out Service-Level Prioritized Load Shedding to Improve Resiliency
Netflix extended its prioritized load-shedding implementation to the individual service level to further improve system resilience. The approach uses cloud capacity more efficiently by shedding low-priority requests only when necessary instead of maintaining separate clusters for failure isolation.
-
Apache Tomcat 11.0 Delivers Support for Virtual Threads and Jakarta EE 11
Apache Tomcat 11 represents a pivotal advancement in web server technology, supporting Jakarta EE 11 and featuring virtual threads for efficient concurrency via Project Loom. Enhanced WebSocket performance, improved asynchronous processing, and robust security measures make it ideal for modern applications. Transitioning requires namespace adjustments, but offers tools for a seamless migration.
-
Planning, Automation and Monorepo: How Monzo Does Code Migrations Across 2800 Microservices
Monzo products are supported by an extensive microservice-based platform of over 2800 services. The company relies on planning and heavy automation to drive code migrations at scale and leverages config service to support gradual roll forwards and quick rollbacks in case of issues. Migrations are managed by a central team rather than service owner teams to avoid delays and inconsistencies.
-
Netflix’s Pushy: Evolution of Scalable WebSocket Platform That Handles 100Ms Concurrent Connections
Netflix shared details on the evolution of Pushy, a WebSocket messaging platform that supports push notifications and inter-device communication across many different devices for the company’s products. Netflix’s engineers implemented many improvements across the Pushy ecosystem to ensure the platform's scalability and reliability and support new capabilities.
-
Production Comes First - an Outside-In Approach to Building Microservices by Martin Thwaites
Martin Thwaites, an observability evangelist, developer, and developer advocate at honeycomb.io, presented on Production Comes First - an Outside-In Approach to Building Microservices. The session was part of the "Connecting Systems: APIs, Protocols, Observability" track.
-
QCon London: Scaling Microservices Architecture and Technology Organization at Trainline
During the recent QCon London conference, Trainline’s CTO spoke about the evolution of the company’s system architecture and organizational structure over the last five years. The company had to adapt to market changes and growing customer expectations by improving the performance and reliability of its technology platform.