InfoQ Homepage Performance Content on InfoQ
-
How GitHub Copilot Serves 400 Million Completion Requests a Day
David Cheney explains the architecture powering GitHub Copilot, detailing how they achieve sub-200ms response times for millions of daily requests.
-
The Harsh Reality of Building a Real-Time ML Feature Platform
Ivan Burmistrov shares how ShareChat built their own Real-Time Feature Platform serving more than 1 billion features per second, and how they managed to make it cost efficient.
-
A Walk along the Complexity-Performance Curve
Richard Startin explores the relationship between complexity and performance through the lens of recent improvements to the Java String class, visiting continuous profiling and some assembly code.
-
Practical Benchmarking: How to Detect Performance Changes in Noisy Results
Matt Fleming provides tips for understanding noise, where it comes, and techniques for fighting it, plus some anecdotes and real-life examples from the world of open-source.
-
Lessons Learned in the Financial Market about Performance and Observability in Front-End Projects
Jessica Felix discusses how to navigate the intricate balance between performance and observability, and the challenges of maintaining equilibrium.
-
Empirical Observations on the the Future of Scalable UI Architecture
Willian Martins discusses a concise course of action regarding the patterns to introduce into a UI system and how to build UI architectures for scale.
-
Dataflow-Based Query Caching with Readyset
Alana Marzoev discusses the fundamentals of streaming dataflow and the architecture of ReadySet, a streaming dataflow system designed for operational workloads.
-
Production Comes First - an Outside-In Approach to Building Microservices
Martin Thwaites introduces outside-in testing, how to use Observability techniques in a local development to build applications that are easier to debug locally and run as a first class citizen.
-
Optimizing Java Applications on Kubernetes: beyond the Basics
Bruno Borges discusses the strategies for enhancing Java application performance on Kubernetes, focusing on leveraging JVM ergonomics, and managing garbage collection processes.
-
Evolving Trainline Architecture for Scale, Reliability and Productivity
Milena Nikolic discusses how Trainline's systems architecture has evolved over the past 5 years to cater changes, as well as what's coming next.
-
Turbocharged Development: the Speed and Efficiency of WebAssembly
Danielle Lancashire discusses why Wasm is the most cross-platform unit of compute for serverless applications, and how that translates to efficiency at scale.
-
Pitfalls of Unified Memory Models in GPUs
Joe Rowell explores the use of unified memory on modern GPU, the low-level details of how unified memory is realized on an x86-64 system, and some of the tools to understand what's happening on a GPU.