InfoQ Homepage Performance & Scalability Content on InfoQ
-
Why a Hedge Fund Built Its Own Database
James Munro discusses ArcticDB and the practicalities of building a performant time-series datastore and why transactions, particularly the Isolation in ACID, is just not worth it.
-
Delivering Millions of Notifications within Seconds During the Super Bowl
Zhen Zhou discusses how they built/test an on-demand notification system, what it takes to manage cloud resources/site-reliability at the same time, and how to mitigate reliability issues.
-
LIquid: a Large-Scale Relational Graph Database
Scott Meyer discusses LIquid, the graph database built to host LinkedIn, serving a ~15Tb graph at ~2M QPS.
-
From Mainframes to Microservices - the Journey of Building and Running Software
Suhail Patel discusses the platforms and software patterns that made microservices popular, and how virtual machines and containers have influenced how software is built and run at scale today.
-
Modern Compute Stack for Scaling Large AI/ML/LLM Workloads
Jules Damji discusses which infrastructure should be used for distributed fine-tuning and training, how to scale ML workloads, how to accommodate large models, and how CPUs and GPUs can be utilized.
-
Sleeping at Scale - Delivering 10k Timers per Second per Node with Rust, Tokio, Kafka, and Scylla
Lily Mara and Hunter Laine walk through the design of a system, its performance characteristics, and how they scaled it.
-
Several Components are Rendering: Client Performance at Slack-Scale
Jenna Zeigen discusses front-end performance issues encountered by Slack as they continue to grow and evolve the desktop app.
-
Effective Performance Engineering at Twitter-Scale
Yao Yue recapitulates scaling a project at Twitter while summarizing some key lessons learned about effective performance engineering.
-
Scaling Organizations with Platform Engineering
Lesley Cordero focuses on how Platform Engineering can drive sustainability for growing organizations through DevOps principles, centralization, and scalable technical practices.
-
The Journey to a Million Ops / Sec / Node in Venice
Alex Dubrouski, andGaojie Liu discuss some of the tricks used in their pursuit to lower read latency and to reach 1M operations per second per node.
-
Sigstore: Secure and Scalable Infrastructure for Signing and Verifying Software
Billy Lynch and Zack Newman discuss the architecture and internals of Sigstore and keyless signing, along with the security considerations that drove the design.
-
Managing 238M Memberships at Netflix
Surabhi Diwan discusses how the Netflix’ membership team outgrew many of its technology and architectural choices as memberships went from a few hundred thousand to 200 million.