InfoQ Homepage Performance Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

Uber Improves Resiliency of Microservices with Adaptive Load Shedding

Uber created a new load-shedding library for its microservice platform, serving over 130 million customers and handling aggregated peaks of millions of requests per second (RPSs). The company replaced the solution based on QALM with Cinnamon library, which, in addition to graceful degradation, can dynamically and continuously adjust the capacity of the service and the amount of load shedding.

Rafal Gancarz
on Feb 06, 2024
Architecture & Design

How RevenueCat Manages Caching for Handling over 1.2 Billion Daily API Requests

RevenueCat extensively uses caching to improve the availability and performance of its product API while ensuring consistency. The company shared its techniques to deliver the platform, which can handle over 1.2 billion daily API requests. The team at RevenueCat created an open-source memcache client that provides several advanced features.

Rafal Gancarz
on Jan 29, 2024
Java

The One Billion Row Challenge Shows That Java Can Process a One Billion Rows File in Two Seconds

On the first day of 2024, Gunnar Morling, Senior Staff Software Engineer at Decodable, launched The One Billion Row Challenge (1BRC) to the Java Community. This ongoing challenge will run until the end of January and aims to find Java code that processes one billion rows in the fastest time. Until now, the podium contained algorithms that finished the processing in under 2.5 seconds.

Olimpiu Pop
on Jan 29, 2024
Architecture & Design

Discord Scales to 1 Million+ Online MidJourney Users in a Single Server

Discord optimized its platform to serve over one million online users in a single server while maintaining a responsive user experience. The company evolved the guild component, which is responsible for fanning out billions of message notifications, in a series of performance and scalability improvements supported by system observability and performance tuning.

Rafal Gancarz
on Jan 26, 2024
Architecture & Design

Why LinkedIn chose gRPC+Protobuf over REST+JSON: Q&A with Karthik Ramgopal and Min Chen

LinkedIn announced that it would be moving to gRPC with Protocol Buffers for the inter-service communication in its microservices platform, where previously an open-source Rest.li framework was used with JSON as a primary serialization format. InfoQ contacted Karthik Ramgopal and Min Chen to learn more about the decision and company motivations behind it.

Rafal Gancarz
on Dec 27, 2023
Culture & Methods

How to Become a High-Performing Software Team

The four major elements that enable high-performing software teams are purpose, decentralized decision-making, high trust with psychological safety, and embracing uncertainty. Teams can improve their performance by experimenting with their ways of working.

Ben Linders
on Nov 23, 2023
Architecture & Design

How DoorDash Rearchitected its Cache to Improve Scalability and Performance

DoorDash rearchitected the heterogeneous caching system they were using across all of their microservices and created a common, multi-layered cache providing a generic mechanism and solving a number of issues coming from the adoption of a fragmented cache.

Sergio De Simone
on Oct 28, 2023
Development

Python-Like Numerical Computation Library MatX Brings Transforms as Operators and Other Features

Developed by Nvidia for its own GPUs, MatX is a C++ library that aims to bring near-native performance in numerical computing using a high-level syntax not far from those available in Python scipy or MATLAB. Its latest release brings a number of new features, including the possibility to use transforms as operators, new operators such as upsample, downsample, pwelch, and more.

Sergio De Simone
on Oct 23, 2023
DevOps

Effective Performance Engineering at Twitter-Scale: Yao Yue at QCon San Francisco

During the second day of QCon San Francisco 2023, Yao Yue, the founder of IOP Systems, presented on performance engineering. In her session Yue discussed the evolving performance engineering in the modern era. For decades, hardware advancements have kept many performance engineers on the sidelines, but now, in a pivotal moment, their skills are more crucial than ever.

Steef-Jan Wiggers
on Oct 04, 2023
Java

GraalVM for JDK 21 Delivers Performance Enhancements and Improved Developer Experience

Oracle has recently announced the release of GraalVM for JDK 21. GraalVM is a JDK that uses an alternative just-in-time (JIT) compiler but it also includes a Native Image module, a technology that allows Java applications to run as native executables, without the need for a JVM. This can improve the performance of Java applications in terms of speed, memory, and size.

Andrea Messetti
on Sep 29, 2023
AI, ML & Data Engineering

Hugging Face's Guide to Optimizing LLMs in Production

When it comes to deploying Large Language Models (LLMs) in production, the two major challenges originate from the huge amount of parameters they require and the necessity of handling very long input sequences to represent contextual information. Hugging Face has documented a list of techniques to tackle those hurdles based on their experience serving such models.

Sergio De Simone
on Sep 25, 2023
Cloud

Faster Standard Retrievals from S3 Glacier Flexible Retrieval and S3 Batch Operations

Recently AWS announced the general availability of faster standard retrievals from S3 Glacier Flexible Retrieval. According to the company, the retrieval can be up to 85% faster and applies to the Standard retrieval tier when using S3 Batch Operations.

Steef-Jan Wiggers
on Aug 18, 2023
Cloud

New Google Cloud H3 Virtual Machine Series for High-Performance Computing Workloads in Preview

Recently Google launched a new H3 Virtual Machine (VM) Series designed for High-Performance Computing (HPC) workloads. The series of VMs are available in public preview for Compute Engine and Google Kubernetes Engine (GKE) users and offers 88 cores (Simultaneous multi-threading disabled) and 352 GB of memory.

Steef-Jan Wiggers
on Aug 15, 2023
Cloud

AWS Launches General Availability of Amazon EC2 P5 Instances for AI/ML and HPC Workloads

AWS recently announced the general availability (GA) of Amazon EC2 P5 instances powered by the latest NVIDIA H100 Tensor Core GPUs suitable for users that require high performance and scalability in AI/ML and HPC workloads. The GA is a follow-up to the earlier announcement of the development of the infrastructure.

Steef-Jan Wiggers
on Aug 03, 2023
Cloud

Microsoft Previews Azure Boost to Improve Remote Storage Throughput and IOPS Performance

During the recent Inspire 2023 conference, Microsoft announced the preview of Azure Boost to improve remote storage throughput and IOPS performance. Separating the hypervisor and host OS functions from the host infrastructure, the new option allows up to 10 Gbps throughput and 400K IOPS.

Renato Losio
on Jul 30, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News