InfoQ Homepage Performance Content on InfoQ
-
The AI Gateway: Scaling Centralized Inference Across Decentralized Teams
Meryem Arik explains how AI model gateways resolve the chaos of decentralized engineering teams by centralizing inference. Learn to optimize costs, enforce governance, and maximize GPU utilization.
-
Using AI as a Thinking Partner for Large-Scale Engineering Systems
Google senior staff engineer Julie Qiu shares how she uses AI as a thinking partner to navigate large-scale systems, moving beyond code generation to architecting complex, multi-language ecosystems.
-
The Human Scalability Problem: Why Your Teams Don’t Scale Like Your Code
Charlotte de Jong Schouwenburg explains why scaling engineering teams often slows down delivery. She shares how to solve human latency by building trust and psychological safety across silos.
-
Speed at Scale: Optimizing the Largest CX Platform out There
Matheus Albuquerque explains how to modernize legacy React codebases using jscodeshift, code splitting, and Preact to achieve a 37% bundle reduction while maintaining support for older browsers.
-
Building Embedding Models for Large-Scale Real-World Applications
Sahil Dua explains the architecture and training of embedding models. He shares practical tips for distilling large models and scaling RAG applications for real-time production environments.
-
Scaling to 100+ as a Director: Lessons from Growing Engineering Organizations
Thiago Ghisi discusses his journey scaling engineering orgs to 100+ people. He shares a 3-level impact framework and explains how to evolve from a "solver" to a "driver" of organizational growth.
-
Beyond the Warehouse: Why BigQuery Alone Won’t Solve Your Data Problems
Sarah Usher explains why relying solely on a data warehouse fails at scale. She shares a 3-layer data lifecycle (Raw, Curated, Use Case) to help engineering leaders build flexible, decoupled systems.
-
Theme Systems at Scale: How to Build Highly Customizable Software
Guilherme Carreiro explains how Shopify scales a customizable theme system to 60M requests per minute. Learn about Liquid DSL constraints, native extensions, and bridging technical/non-technical gaps.
-
From Confusion to Clarity: Advanced Observability Strategies for Media Workflows at Netflix
Naveen Mareddy and Sujana Sooreddy explain how Netflix monitors massive media encoding workflows. They discuss scaling to 1M+ trace spans and using Flink to unlock real-time business insights.
-
Reliable Data Flows and Scalable Platforms: Tackling Key Data Challenges
Matthias Niehoff discusses bridging the gap between application and data engineering. Learn to apply software engineering best practices, embrace boring technologies, and simplify architecture.
-
From Dashboard Soup to Observability Lasagna: Building Better Layers
Martha Lambert shares the "Observability Lasagna" framework, explaining how her team replaced "dashboard soup" with a user-centric stack to achieve high system reliability for their on-call product.
-
Scaling API Independence: Mocking, Contract Testing & Observability in Large Microservices Environments
Tom Akehurst discusses how API mocking and simulation can solve microservice decoupling and productivity problems at scale, using observability, contract testing, and GenAI as guardrails.