InfoQ Homepage Performance & Scalability Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

Allegro Reduces Kafka Producer Latency Outliers by 82% after Switching to XFS

Allegro experimented with different performance optimization options to improve Apache Kafka producer tail latency and eventually switched all its clusters to the XFS filesystem. The company used Kafka protocol sniffing, JVM profiling, and eBPF, which proved instrumental in identifying and eliminating performance bottlenecks.

Rafal Gancarz
on Apr 26, 2024
Architecture & Design

QCon London: Scaling Microservices Architecture and Technology Organization at Trainline

During the recent QCon London conference, Trainline’s CTO spoke about the evolution of the company’s system architecture and organizational structure over the last five years. The company had to adapt to market changes and growing customer expectations by improving the performance and reliability of its technology platform.

Rafal Gancarz
on Apr 17, 2024
Architecture & Design

QCon London: Lessons Learned from Building LinkedIn’s AI/ML Data Platform

At the QCon London 2024 conference, Félix GV from LinkedIn discussed the AI/ML platform powering the company’s products. He specifically delved into Venice DB, the NoSQL data store used for feature persistence. The presenter shared the lessons learned from evolving and operating the platform, including cluster management and library versioning.

Rafal Gancarz
on Apr 15, 2024
Architecture & Design

QCon London: How Duolingo Sent 4 Million Push Notifications in 6 Seconds During the Super Bowl Break

As part of the Super Bowl marketing campaign, Duolingo sent out 4 million mobile push notifications when the company’s five-second ad aired during the commercial break. At QCon London, Doulingo’s engineers presented the asynchronous AWS architecture responsible for broadcasting messages to millions of users across seven US cities.

Rafal Gancarz
on Apr 10, 2024
Architecture & Design

QCon London: Meta Used Monolithic Architecture to Ship Threads in Only Five Months

Zahan Malkani talked during QCon London 2024 about Meta’s journey from identifying the opportunity in the market to shipping the Threads application only five months later. The company leveraged Instagram's existing monolithic architecture and quickly iterated to create a new text-first microblogging service in record time.

Rafal Gancarz
on Apr 09, 2024
Architecture & Design

Expedia Speeds up Flights Search with Micro Frontends and GraphQL Optimizations

Expedia made flight search faster by up to 52% (page usable time) by applying a range of optimizations to web and mobile applications. To support these improvements, the company improved the observability of its applications. Expedia Flights web application has been migrated to Micro Frontend Architecture (MFA) to allow flexibility, reusability, and better optimization.

Rafal Gancarz
on Mar 20, 2024
Architecture & Design

Hashnode Creates Scalable Feed Architecture on AWS with Step Functions, EventBridge and Redis

Hashnode created a scalable event-driven architecture (EDA) for composing feed data for thousands of users. The company used serverless services on AWS, including Lambda, Step Functions, EventBridge, and Redis Cache. The solution leverages Step Functions' distributed maps feature that enables high-concurrency processing.

Rafal Gancarz
on Mar 15, 2024
Architecture & Design

Uber Builds Scalable Chat Using Microservices with GraphQL Subscriptions and Kafka

Uber replaced a legacy architecture built using the WAMP protocol with a new solution that takes advantage of GraphQL subscriptions. The main drivers for creating a new architecture were challenges around reliability, scalability, observability/debugibility, as well as technical debt impeding the team’s ability to maintain the existing solution.

Rafal Gancarz
on Mar 07, 2024
Architecture & Design

Pinterest Open-Sources a Production-Ready PubSub Java Client for Kafka, Flink, and MemQ

Pinterest open-sourced its generic PubSub client library, PSC, which has been heavily used in production for a year and a half. The library helped the engineering teams by increasing developer velocity, and the scalability and stability of services using it. Over 90% of Java applications have migrated to PSC with minimal changes.

Rafal Gancarz
on Feb 11, 2024
Architecture & Design

Uber Improves Resiliency of Microservices with Adaptive Load Shedding

Uber created a new load-shedding library for its microservice platform, serving over 130 million customers and handling aggregated peaks of millions of requests per second (RPSs). The company replaced the solution based on QALM with Cinnamon library, which, in addition to graceful degradation, can dynamically and continuously adjust the capacity of the service and the amount of load shedding.

Rafal Gancarz
on Feb 06, 2024
Architecture & Design

How RevenueCat Manages Caching for Handling over 1.2 Billion Daily API Requests

RevenueCat extensively uses caching to improve the availability and performance of its product API while ensuring consistency. The company shared its techniques to deliver the platform, which can handle over 1.2 billion daily API requests. The team at RevenueCat created an open-source memcache client that provides several advanced features.

Rafal Gancarz
on Jan 29, 2024
Architecture & Design

Discord Scales to 1 Million+ Online MidJourney Users in a Single Server

Discord optimized its platform to serve over one million online users in a single server while maintaining a responsive user experience. The company evolved the guild component, which is responsible for fanning out billions of message notifications, in a series of performance and scalability improvements supported by system observability and performance tuning.

Rafal Gancarz
on Jan 26, 2024
Architecture & Design

lastminute.com Improves Search Scalability Using Microservices with RabbitMQ and Redis

The team at lastminute.com rearchitected the search result aggregation process by breaking up the single service into multiple ones and introducing asynchronous integration. Developers used RabbitMQ for messaging and Redis for storing results from data suppliers. The revised architecture improved scalability and deployability and reduced resource utilization.

Rafal Gancarz
on Jan 12, 2024
Architecture & Design

Zendesk Moves from DynamoDB to MySQL and S3 to Save over 80% in Costs

Zendesk reduced its data storage costs by over 80% by migrating from DynamoDB to a tiered storage solution using MySQL and S3. The company considered different storage technologies and decided to combine the relational database and the object store to strike a balance between querybility and scalability while keeping the costs down.

Rafal Gancarz
on Dec 29, 2023
Architecture & Design

Why LinkedIn chose gRPC+Protobuf over REST+JSON: Q&A with Karthik Ramgopal and Min Chen

LinkedIn announced that it would be moving to gRPC with Protocol Buffers for the inter-service communication in its microservices platform, where previously an open-source Rest.li framework was used with JSON as a primary serialization format. InfoQ contacted Karthik Ramgopal and Min Chen to learn more about the decision and company motivations behind it.

Rafal Gancarz
on Dec 27, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News