InfoQ Homepage Reliability Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

QCon London: Scaling Microservices Architecture and Technology Organization at Trainline

During the recent QCon London conference, Trainline’s CTO spoke about the evolution of the company’s system architecture and organizational structure over the last five years. The company had to adapt to market changes and growing customer expectations by improving the performance and reliability of its technology platform.

Rafal Gancarz
on Apr 17, 2024
Architecture & Design

Decathlon Adopts Backend for Frontend (BFF) Pattern to Empower FE Teams

Decathlon established the Backend For Frontend (BFF) architectural pattern as a company-wide recommendation and provided guidelines for its adoption among engineering teams. The four-part series introduces the pattern and explores its benefits and potential pitfalls. The company also shares available alternatives to using the BFF pattern and reviews architectural considerations.

Rafal Gancarz
on Mar 25, 2024
Development

Erlang-Runtime Statically-Typed Functional Language Gleam Reaches 1.0

Gleam, an actor-based highly-concurrent functional language running on the Erlang virtual machine (BEAM), has reached version 1.0, which means it is now ready to be used in production systems with a guarantee of backward compatibility based on semantic versioning.

Sergio De Simone
on Mar 16, 2024
Architecture & Design

Uber Builds Scalable Chat Using Microservices with GraphQL Subscriptions and Kafka

Uber replaced a legacy architecture built using the WAMP protocol with a new solution that takes advantage of GraphQL subscriptions. The main drivers for creating a new architecture were challenges around reliability, scalability, observability/debugibility, as well as technical debt impeding the team’s ability to maintain the existing solution.

Rafal Gancarz
on Mar 07, 2024
Architecture & Design

Grab Improves Kafka on Kubernetes Fault Tolerance with Strimzi, AWS AddOns and EBS

Grab updated its Kafka on Kubernetes setup to improve fault tolerance and completely eliminate human intervention in case of unexpected Kafka broker terminations. To address the shortcomings of the initial design, the team integrated with AWS Node Termination Handler (NTH), used the Load Balancer Controller for target group mapping, and switched to ELB volumes for storage.

Rafal Gancarz
on Feb 21, 2024
Architecture & Design

Uber Improves Resiliency of Microservices with Adaptive Load Shedding

Uber created a new load-shedding library for its microservice platform, serving over 130 million customers and handling aggregated peaks of millions of requests per second (RPSs). The company replaced the solution based on QALM with Cinnamon library, which, in addition to graceful degradation, can dynamically and continuously adjust the capacity of the service and the amount of load shedding.

Rafal Gancarz
on Feb 06, 2024
Cloud

Zonal Autoshift on AWS: Optimizing Infrastructure Reliability

Zonal autoshift, a new capability of Amazon Route 53 Application Recovery Controller, automatically shifts traffic away from an Availability Zone (AZ) when a potential failure is identified by the cloud provider. The service redirects the traffic back once the AZ failure is resolved.

Renato Losio
on Jan 30, 2024
Cloud

Microsoft Refreshes its Well-Architected Framework

Microsoft recently announced a comprehensive refresh of the Well-Architected Framework (WAF) for designing and running optimized workloads on Azure.

Steef-Jan Wiggers
on Nov 15, 2023
Architecture & Design

AWS Restructures and Consolidates Its Well-Architected Framework

AWS published a new set of updates to its Well-Architected Framework, with changes across all six pillars of the framework. The performance efficiency and operational excellence pillars have been restructured and consolidated to reduce the number of best practices. Other pillars received improved implementation guidance, including recommendations and steps on reusable architecture patterns.

Rafal Gancarz
on Nov 08, 2023
Cloud

Google Delivers Comprehensive Cloud Infrastructure Reliability Guide

Google recently delivered a cloud infrastructure reliability guide combining best practices and expertise from its engineers for its customers.

Steef-Jan Wiggers
on Jan 24, 2023
Cloud

Azure Cosmos DB: Low Latency and High Availability at Planet Scale

Mei-Chin Sei and Vinod Sridharan spoke at QCon San Francisco on Azure Cosmos DB: Low Latency and High Availability at Planet Scale. The talk was part of the "Architectures You've Always Wondered About" track.

Steef-Jan Wiggers
on Oct 30, 2022
DevOps

Adopting Continuous Deployment: Tom Wanielista at QCon San Francisco 2022

At QCon San Francisco 2022, Tom Wanielista, a staff engineer on infrastructure at Lyft, presented on Adopting Continuous Deployment at his company. The talk is part of one of the editorial tracks called "Architecting Change at Scale."

Steef-Jan Wiggers
on Oct 25, 2022
Architecture & Design

Filibuster: Automated Fault Injection Tool to Improve DoorDash's Reliability

DoorDash recently revealed how they are using Filibuster, an automated fault injection tool, to identify resilience issues in microservice applications early on and improve platform reliability.

Tanmay Deshpande
on Sep 26, 2022
Cloud

Google Introduces Cloud Backup and Disaster Recovery

Google recently introduced Cloud Backup and Disaster Recovery (DR), allowing customers to enable centralized backup management directly from the Google Cloud console. The new backup and recovery service is designed to work with cloud storage repositories, databases, and applications.

Steef-Jan Wiggers
on Sep 18, 2022
Culture & Methods

Developing and Evolving SaaS Infrastructures for Enterprises

SaaS companies that are focused on the enterprise market need to evolve their infrastructure to meet the security, reliability, and other IT requirements of their customers. IT admins and large customers are two important sources of requirements to drive development.

Ben Linders
on Aug 04, 2022

Newer News

Older News

InfoQ Software Architects' Newsletter

News