InfoQ Homepage DevOps Content on InfoQ
-
Traefik v3.3 Release: Enhanced Observability and Documentation
TraefikLabs recently announced the latest release of Traefik Proxy v3.3 (codenamed "saint-nectaire” after a French cheese). This release focuses primarily on two critical areas: observability capabilities and improved documentation structure. These enhancements aim to make the popular open-source reverse proxy even more powerful for platform engineers working in complex cloud-native environments.
-
Dropbox’s Asynchronous Platform Evolution: from Challenges to a Unified Messaging System Model
Dropbox, a file sharing cloud platform, recently discussed their Messaging System Model (MSM) that supports diverse use cases and handles over 30 million tasks per minute.
-
Most Companies Experience Weekly Outages: The State of Resilience 2025 Report
According to The State of Resilience 2025 Report, published by Cockroach Labs, outages are commonplace in most organizations, with 55% of companies reporting weekly and 14% reporting daily outages. Staggering 100% of survey participants experienced revenue losses due to outages, with some companies (8%) reporting losses of USD $1 million or higher over the last 12 months.
-
ClickHouse Reports 25% Performance Gain after Migrating Cloud Workloads to AWS ARM Instances
ClickHouse recently shared its migration strategy to AWS Graviton over the past six months, reporting a 25% performance improvement for end users. The engineering team outlines the steps taken to establish a performance baseline and transition the managed ClickHouse Cloud service to the new ARM deployment while handling large-scale production workloads.
-
Stack Refactoring for Enhanced Infrastructure Management in AWS CloudFormation Service
AWS CloudFormation's new stack refactoring feature transforms resource management, enabling seamless movement of resources between stacks. This enhances modularity and alignment with business needs, reduces misconfiguration risks, and boosts efficiency. Developers can optimize costs and improve clarity, making cloud architecture more manageable and adaptable.
-
AWS CodeBuild Now Supports Fastlane to Simplify iOS Deployments
CodeBuild, AWS continuous integration service, now provides Fastlane support to manage tasks such as code signing, screenshot generation, beta distribution, and app store submissions for iOS, iPadOS, watchOS, tvOS, and macOS apps.
-
Prezi's Journey from Prometheus to VictoriaMetrics
Prezi’s engineering team recently discussed their transition from a Prometheus-based monitoring system to VictoriaMetrics, focusing on cost optimization, performance improvements, and architectural simplicity. This transition reduced the costs by approximately 30%, and speed of completion for heavy queries reduced to 3-7 seconds from 30+ seconds.
-
OpenAI Features New o3-mini Model on Microsoft Azure OpenAI Service
OpenAI has launched the advanced o3-mini model via Microsoft Azure, enhancing AI applications with improved cost efficiency, faster performance, and adjustable reasoning capabilities. Designed for complex tasks, it supports structured outputs and backward compatibility. With widespread access, the o3-mini empowers developers to drive innovation across various industries.
-
Slack Enhances Accessibility Testing through Automation
Slack recently integrated automated accessibility testing into its software development lifecycle to improve user experience for individuals with disabilities. Slack has internal standards and the company further collaborates with external accessibility testers as well. These standards align with WCAG, an internationally recognized benchmark for web accessibility.
-
Recent Generations of Apple CPUs Affected by New Side-Channel Vulnerabilities
Researchers at the Georgia Institute of Technology and the Ruhr University Bochum demonstrated two new side channel attacks on recent M and A CPUs from Apple that leak sensitive information when using Chrome and Safari to visit popular sites.
-
How Locking, Saturation and CDN Network Issues Brought down Canva
The Canva engineering team recently published their post-mortem on the outage they experienced last November, detailing the API Gateway failure and the lessons learned during the incident.
-
Amazon EventBridge Event Bus Cross-Account Event Delivery
AWS enhances Amazon EventBridge, enabling direct event delivery across accounts. This feature simplifies architecture, boosts security, and reduces latency, allowing seamless routing of events to AWS services like SQS and Lambda. Leverage fine-grained IAM control for streamlined event sourcing. Discover more about this transformative update on AWS documentation and GitHub.
-
DeepSeek Database Leaking Sensitive Information Highlights AI Security Risks
Cloud security firm Wiz uncovered unprotected DeepSeek database giving full control over database operations and access to internal data including millions of lines of chat logs. While the vulnerability has been quickly fixed, the incident shows the need for the AI industry to enforce higher security standards, says the company.
-
Dragonfly v2.2.0 Released: Rust Client, P2P Upgrades, and Enhanced Image Acceleration
In January, the Cloud Native Computing Foundation (CNCF) announced the release of Dragonfly version 2.2.0, introducing several significant enhancements to the open-source, peer-to-peer (P2P) file distribution and image acceleration system.
-
Git 2.48: Modernized with Meson Build System, Memory Leaks Fixed
Git 2.48 now supports the Meson build system, a modern alternative to the existing Makefile, Autoconf, and CMake-based systems. This change addresses limitations in IDE support and maintenance issues with older systems.