InfoQ Homepage Performance & Scalability Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Microsoft Refreshes its Well-Architected Framework

Microsoft recently announced a comprehensive refresh of the Well-Architected Framework (WAF) for designing and running optimized workloads on Azure.

Steef-Jan Wiggers
on Nov 15, 2023
Architecture & Design

AWS Restructures and Consolidates Its Well-Architected Framework

AWS published a new set of updates to its Well-Architected Framework, with changes across all six pillars of the framework. The performance efficiency and operational excellence pillars have been restructured and consolidated to reduce the number of best practices. Other pillars received improved implementation guidance, including recommendations and steps on reusable architecture patterns.

Rafal Gancarz
on Nov 08, 2023
Architecture & Design

How DoorDash Rearchitected its Cache to Improve Scalability and Performance

DoorDash rearchitected the heterogeneous caching system they were using across all of their microservices and created a common, multi-layered cache providing a generic mechanism and solving a number of issues coming from the adoption of a fragmented cache.

Sergio De Simone
on Oct 28, 2023
Architecture & Design

Monzo Employs Targeted Traffic Shedding against Stampeding Herd Effect from the Mobile App

Monzo developed a solution for shedding traffic in case its platform comes under intense and unexpected load that could lead to an outage. Traffic spikes can be generated by the mobile app and triggered by push notifications or other bursts in user activity. The solution can reduce the read traffic by almost 50% with 90% overall accuracy without noticeable customer impact.

Rafal Gancarz
on Oct 23, 2023
Architecture & Design

Contentsquare Uses Microservices and Apache Kafka for Notification Delivery

Contentsquare needed notification functionality for many use cases within its platform. The company created a generic solution spanning multiple services as part of its microservice architecture. During the implementation, the developers had to improve observability and overcome some scalability challenges.

Rafal Gancarz
on Oct 20, 2023
Cloud

Google Improves Cloud Spanner: More Compute and Storage without Price Increase

Google recently announced various improvements to Cloud Spanner, its distributed, decoupled relational database service with a “50% increase in throughput and 2.5 times the storage per node than before” without a price change.

Steef-Jan Wiggers
on Oct 14, 2023
Development

Eating One's Own Dogfood: GitHub Using Actions and Runners for GitHub.com

To improve how they ship software in a scalable and effective way, GitHub has adopted GitHub Actions for a part of their continuous integration system. In particular, they leveraged the new Actions larger runners to get to run 15,000 CI jobs across 150,000 cores. In the process they also extended larger runners capabilities for all their users.

Sergio De Simone
on Oct 07, 2023
DevOps

Effective Performance Engineering at Twitter-Scale: Yao Yue at QCon San Francisco

During the second day of QCon San Francisco 2023, Yao Yue, the founder of IOP Systems, presented on performance engineering. In her session Yue discussed the evolving performance engineering in the modern era. For decades, hardware advancements have kept many performance engineers on the sidelines, but now, in a pivotal moment, their skills are more crucial than ever.

Steef-Jan Wiggers
on Oct 04, 2023
AI, ML & Data Engineering

Hugging Face's Guide to Optimizing LLMs in Production

When it comes to deploying Large Language Models (LLMs) in production, the two major challenges originate from the huge amount of parameters they require and the necessity of handling very long input sequences to represent contextual information. Hugging Face has documented a list of techniques to tackle those hurdles based on their experience serving such models.

Sergio De Simone
on Sep 25, 2023
Architecture & Design

LinkedIn's Open-Source "iris-message-processor" Achieves 86.6x Faster Escalation Management Speeds

LinkedIn developed a new open-source service called "iris-message-processor" to enhance the performance and reliability of its existing Iris escalation management system. "iris-message-processor" significantly improves processing speeds, being ~4.6x faster under average loads and ~86.6x faster under high loads than its predecessor.

Eran Stiller
on Sep 11, 2023
Cloud

Golem Unveils a Resilient Computing Platform for Serverless Workers with WebAssembly Component Model

Recently Golem released its flagship product Golem Cloud, a durable computing platform allowing developers to build and deploy long-running, stateful serverless workers that are resistant to failures, upgrades, and updates. The product is currently in developer preview.

Steef-Jan Wiggers
on Aug 22, 2023
Architecture & Design

Cadence 1.0: Uber Releases Its Scalable Workflow Orchestration Platform

Uber released a major version of its workflow orchestration platform named Cadence after six years in development. Uber and other companies use Cadence to build stateful services at scale using native programming languages.

Rafal Gancarz
on Aug 07, 2023
Cloud

AWS Launches General Availability of Amazon EC2 P5 Instances for AI/ML and HPC Workloads

AWS recently announced the general availability (GA) of Amazon EC2 P5 instances powered by the latest NVIDIA H100 Tensor Core GPUs suitable for users that require high performance and scalability in AI/ML and HPC workloads. The GA is a follow-up to the earlier announcement of the development of the infrastructure.

Steef-Jan Wiggers
on Aug 03, 2023
Cloud

Microsoft Azure Managed Lustre for HPC and AI Workloads Now Generally Available

Microsoft recently announced the general availability (GA) of Azure Managed Lustre, a managed file system for high-performance computing (HPC) and AI workloads.

Steef-Jan Wiggers
on Jul 20, 2023
Architecture & Design

How LinkedIn Serves over 4.8 Million Member Profiles per Second

LinkedIn introduced Couchbase as a centralized caching tier for scaling member profile reads to handle increasing traffic that has outgrown their existing database cluster. The new solution achieved over 99% hit rate, helped reduce tail latencies by more than 60% and costs by 10% annually.

Rafal Gancarz
on Jul 03, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News