InfoQ Homepage Scaling Content on InfoQ
-
Deezer Optimizes Kubernetes Autoscaling with Custom Metrics
Popular music streaming service Deezer has written about using custom metrics to enable auto-scaling in its Kubernetes infrastructure. Server utilisation and performance issues made scaling applications to an appropriate size and number of replicas challenging, and Kuberenetes' HPA scaling alone didn't solve these issues. So Deezer turned to custom metrics.
-
Kubernetes Autoscaler Karpenter Reaches 1.0 Milestone
Amazon Web Services (AWS) has released version 1.0 of Karpenter, an open-source Kubernetes cluster auto-scaling tool. This release marks Karpenter's graduation from beta status and introduces stable APIs and several new features. Karpenter, initially launched in November 2021, has evolved into a comprehensive Kubernetes-native node lifecycle manager.
-
How Tech-Enabled Networks of Software Teams Work
To maintain agility at scale, software teams can use technological and organizational solutions to reduce dependencies and work autonomously. According to Fabrice Bernhard, collaboration technology can be leveraged to create a distributed network of teams. To empower their teams, leaders can support them with a systematic problem-solving culture aimed at delivering good products to customers.
-
How to Build Large Scale Cyber-Physical Systems
To build large-scale safety-critical systems, we need to decompose the system into smaller solvable problems, resolve what is known, and resolve unknowns through experiments, Robin Yeman argued. She suggested investing in test environments for both software and hardware early to enable being test-driven early to increase the safety, security, reliability, and availability of the systems.
-
Expedia Open-Sources Container-Startup-Autoscaler (CSA) for Scaling Kubernetes Workloads
Expedia's Performance and Reliability team has recently open-sourced its container-startup-autoscaler (CSA). It is a Kubernetes controller leveraging the In-Place Update of Pod Resources feature to dynamically adjust CPU and/or memory resources of containers during startup based on user-defined startup/post-startup configurations.
-
DigitalOcean Introduces CPU-Based Autoscaling for its App Plaform
DigitalOcean has launched automatic horizontal scaling for its App Platform PaaS, aiming to free developers from the burden of scaling services up or down based on CPU load all by themselves.
-
How to Create a UI That's Both Robust and User Friendly
The key challenge in building UIs is balancing ease of use and maintainability, with scale and complexity. It requires thoughtful component design and an understanding of common usage paths to create a UI that's both robust and user-friendly. Automation can be a game-changer when it comes to improving efficiency and consistency in your codebase.
-
Developing Software to Manage Distributed Energy Systems at Scale
Functional programming techniques can make software more composable, reliable, and testable. For systems at scale, trade-offs in edge vs. cloud computing can impact speed and security.
-
Microsoft Announces the Preview of Serverless for Hyperscale in Azure SQL Database
Recently, Microsoft announced the preview of serverless for Hyperscale in the Azure SQL Database, which brings together the benefits of serverless and Hyperscale into a single database solution.
-
Scalable Automation Frameworks for Functional and Non-Functional Testing
Separating the capabilities of a testing framework from the actual tests can enable scaling automated testing for complex enterprise products. According to Alexander Velinov, we should agree on the types of tests to execute automatically during release and what should be kept as manually triggered tests.
-
Slack Implements Circuit Breakers to Improve CI/CD Pipeline Availability
Slack recently published how it implemented the Circuit Breaker pattern to improve its CI/CD pipeline availability. Before this project, engineers at Slack saw challenges as peak request volumes in internal tooling caused cascade failures in dependent systems. Since completion, engineers saw increased service availability and fewer bad developer experiences like flakiness from failing services.
-
Fitting Presto to Large-Scale Apache Kafka at Uber
The need for ad-hoc real-time data analysis has been growing at Uber. They run a large Apache Kafka deployment and need to analyse data going through the many workflows it supports. Solutions like stream processing and OLAP datastores were deemed unsuitable. An article was published recently detailing why Uber chose Presto for this purpose and what it had to do to make it performant at scale.
-
AWS Releases the Second Version of Amazon Aurora Serverless with Independent Scaling
Recently, AWS announced the general availability of the second version of Amazon Aurora Serverless, an on-demand, auto-scaling configuration for Amazon Aurora. The second version is generally available for both Aurora PostgreSQL and MySQL, featuring the independent scaling of compute and storage.
-
Using Team-Set Salaries for Company-Wide Compensation
Team-set salaries (TSS) can be scaled up by doing appraisals across teams where results are automatically calibrated. The scores indicate where conversations are needed. TSS encourages people to learn new skills and adapt.
-
Scaling Video Quality Measurements at Netflix with Cosmos
Netflix relies heavily on measuring perceptual video quality for different business purposes. As metrics evolve and become part of more workflows, their measurement tool needs to scale too. Netflix recently described how a new video quality measurement workflow was implemented using Cosmos microservices to foster innovation in quality metrics, with good scalability and loose data coupling.