InfoQ Homepage Performance & Scalability Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

Uber Eats Scales Catalog Management from Restaurants to Retail with INCA Framework

Uber Eats introduced INCA (Inventory and Catalog), a scalable system to handle vast product catalogs from supermarkets, pharmacies, and retail partners. Unlike the earlier restaurant-focused setup built for low SKUs and simple pass-through data, INCA supports large-scale inventories, rich metadata, and compliance needs essential for retail operations.

Leela Kumili
on Aug 29, 2025
Cloud

AWS Lambda Response Streaming Increases Payload Limit to 200 MB

AWS has revolutionized Lambda with an increased response streaming payload limit from 20 MB to 200 MB. This enhancement allows developers to stream larger data sets, improving Time to First Byte performance. By simplifying response handling and eliminating complex workarounds, AWS empowers developers to deliver rich content seamlessly, transforming serverless applications.

Steef-Jan Wiggers
on Aug 27, 2025
Cloud

Amazon DocumentDB Serverless: Auto-Scaling Database Solution for Variable Workloads

AWS has launched Amazon DocumentDB Serverless, an auto-scaling database solution compatible with MongoDB, tailored for variable workloads. While marketed as "serverless," it functions more like auto-scaling, charging from $30/month. Ideal for enterprises and SaaS vendors, it adeptly handles spikes in demand, particularly for AI-driven applications.

Steef-Jan Wiggers
on Aug 07, 2025
Architecture & Design

Grab Switches from SQS and Redis to Temporal for Its Subscription Platform

Grab based the new architecture for GrabUnlimited on Temporal. The company enhanced user experience and reduced production incidents by 80% for its subscription platform, which serves millions of users. The new architecture significantly improved robustness and scalability, addressing a range of issues with the previous solution.

Rafal Gancarz
on Jul 21, 2025
Development

Apple Completes Migration of Key Ecosystem Service to Swift, Gains 40% Performance Uplift

Apple has migrated its global Password Monitoring service from Java to Swift, achieving a 40% increase in throughput and significantly reducing memory usage—freeing up nearly 50% of previously allocated Kubernetes capacity.

Matt Foster
on Jun 12, 2025
Architecture & Design

AWS Promotes Responsible AI in the Well-Architected Generative AI Lens

AWS announced the availability of the new Well-Architected Generative AI Lens, focused on providing best practices for designing and operating generative AI workloads. The lens is aimed at organizations delivering robust and cost-effective generative AI solutions on AWS. The document offers cloud-agnostic best practices, implementation guidance and links to additional resources.

Rafal Gancarz
on Apr 27, 2025
Architecture & Design

QCon London 2025: Applying Domain-Driven Design at Scale

At QCon London 2025, Vanderbijl unveiled how domain-driven design transformed a chaotic healthcare platform into a coherent business architecture. Through innovative strategies like "Take That" and "Robbie Williams," the team tackled architectural complexity, emphasizing adaptability and continuous improvement. This journey illustrates DDD as an evolving process essential for sustainable growth.

Steef-Jan Wiggers
on Apr 16, 2025
Cloud

QCon London 2025: Insights from 20+ Years in Mission-Critical Infrastructure

Matthew Liste, head of infrastructure at American Express, shared insights at QCon London 2025 on building robust cloud platforms in financial services. With 20+ years of experience, he emphasized stability, security, scalability, the value of interchangeable components, and long-term sustainability, urging professionals to maintain focus and foster a strong team culture for platform engineering.

Steef-Jan Wiggers
on Apr 10, 2025
AI, ML & Data Engineering

How Uber Sped up SQL-based Data Analytics with Presto and Express Queries

Uber uses Presto, an open-source distributed SQL query engine, to provide analytics across several data sources, including Apache Hive, Apache Pinot, MySQL, and Apache Kafka. To improve its performance, Uber engineers explored the advantages of dealing with quick queries, a.k.a. express queries, in a specific way and found they could improve both Presto utilization and response times.

Sergio De Simone
on Nov 18, 2024
DevOps

Improving the Efficiency of Goku Time-Series Database at Pinterest

Pinterest has modernized and enhanced its Goku time-series database. The recent updates focus on optimizing storage and resource usage without compromising service quality.

Mohit Palriwal
on Nov 06, 2024
Architecture & Design

Software Architecture Tracks at QCon San Francisco 2024 – Navigating Current Challenges and Trends

At QCon San Francisco 2024, software architecture is front and center, with two tracks dedicated to exploring some of the largest and most complex architectures today. Join senior software practitioners as they provide inspiration and practical lessons for architects seeking to tackle issues at a massive scale.

Artenisa Chatziou
on Nov 01, 2024
Architecture & Design

Netflix’s Pushy: Evolution of Scalable WebSocket Platform That Handles 100Ms Concurrent Connections

Netflix shared details on the evolution of Pushy, a WebSocket messaging platform that supports push notifications and inter-device communication across many different devices for the company’s products. Netflix’s engineers implemented many improvements across the Pushy ecosystem to ensure the platform's scalability and reliability and support new capabilities.

Rafal Gancarz
on Sep 23, 2024
Architecture & Design

Canva Opts for Amazon KDS over SNS+SQS to Save 85% with 25 Billion Events per Day

Canva evaluated different data massaging solutions for its Product Analytics Platform, including the combination of AWS SNS and SQS, MKS, and Amazon KDS, and eventually chose the latter, primarily based on its much lower costs. The company compared many aspects of these solutions, like performance, maintenance effort, and cost.

Rafal Gancarz
on Aug 07, 2024
Cloud

High HTTP Scaling with Azure Functions Flex Consumption: Q&A with Thiago Almeida and Paul Batum

Microsoft has introduced a significant enhancement to its Azure Functions platform with the Flex Consumption plan, designed to handle high HTTP scale efficiently. This new plan supports customizable per-instance concurrency, allowing users to achieve high throughput while managing costs effectively. In practical tests, Azure Functions Flex demonstrated the ability to scale from zero to 32,000 RPS.

Steef-Jan Wiggers
on Jun 26, 2024
Cloud

Microsoft Introduces the Public Preview of Flex Consumption Plan for Azure Functions at Build

At the annual Build conference, Microsoft announced the flex consumption plan for Azure Functions, which brings users fast and large elastic scale, instance size selection, private networking, availability zones, and higher concurrency control.

Steef-Jan Wiggers
on May 25, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

News