InfoQ Homepage Cloud Content on InfoQ
-
AWS Disruption Exposes Fragility in Critical Cloud Infrastructure
On October 20, 2025, Amazon Web Services (AWS) experienced a major outage that disrupted global internet services, affecting millions of users and thousands of companies across more than 60 countries. The incident originated in the US-EAST-1 region and was traced to a DNS resolution failure affecting the DynamoDB endpoint, which cascaded into outages across multiple dependent services.
-
Parting the Clouds: the Rise of Disaggregated Systems by Murat Demirbas at QCon SF 2025
Cloud computing is evolving through disaggregation, addressing inefficiencies of traditional architectures by decoupling compute and storage. This shift enhances scalability, fault isolation, and operational simplicity, driven by advancements in networking. As seen in cloud databases such as Amazon Aurora, embracing these principles enables true economic optimization and innovative design.
-
Cloudflare Workflows Adds Python Support for Durable AI Pipelines
Innovative Cloudflare Workflows now supports both TypeScript and Python, enabling developers to orchestrate complex applications seamlessly. With durable execution and state persistence, it simplifies the development of robust data pipelines and AI/ML models. Experience enhanced concurrency and intuitive design, making orchestration effortless for Python enthusiasts.
-
Race Condition in DynamoDB DNS System: Analyzing the AWS US-EAST-1 Outage
On October 19th and 20th, AWS experienced an extended outage triggered by a failure in Amazon DynamoDB that affected most services in its most popular region, Northern Virginia. The cloud provider released an analysis of the incident, sparking discussions in the community about redundancy on AWS, moving out of public cloud, and multi-region approaches.
-
Microsoft Addresses Data Residency with Private Cloud Expansion
Microsoft has strengthened its Sovereign Cloud offering to meet stringent global data-residency and control regulations, particularly in Europe. New capabilities include a commitment to EU Data Boundary, expanded in-country data processing, and enhanced Sovereign Private Cloud features.
-
Go's New Green Tea Garbage Collector May Improve Performance up to 40%
Go 1.25 introduces a new experimental garbage collector that delivers up to 40% faster than the current implementation, bringing a significant performance improvement for GC-heavy workloads.
-
Google Cloud Introduces Chaos Engineering Framework and Recipes for Distributed Systems
Google Cloud's Expert Services Team has released a detailed guide on chaos engineering for cloud-based distributed systems. It highlights that the intentional creation of failures is essential for developing resilient architectures. The initiative provides open-source recipes and helpful guidance for applying controlled disruption testing in Google Cloud environments.
-
AWS Launches Capabilities by Region Tool
AWS has launched "AWS Capabilities by Region," a powerful tool that streamlines service visibility for architects and developers. No more manual checks—now you can compare AWS services across regions interactively and plan deployments efficiently. With enhanced transparency and automated capability checks, streamline global projects and minimize delays.
-
Azure APIM Simplifies Event-Driven Architecture with Native Service Bus Policy
Microsoft's new feature in API Management (APIM) enables seamless messaging to Azure Service Bus, simplifying API connections in event-driven architectures. By using the send-service-bus-message policy, developers can easily route HTTP requests to Service Bus for asynchronous processing, enhancing integration, security, and control without additional components.
-
Azure Front Door Outage: How a Single Control-Plane Defect Exposed Architectural Fragility
A recent 9-hour Azure Front Door (AFD) outage was triggered by a faulty control-plane configuration change that bypassed safety checks due to a software defect, leading to a massive blast radius and affecting M365 and Entra ID via Identity Coupling, exposing a critical architectural anti-pattern in centralized edge fabrics.
-
EU's Cloud Sovereignty SEAL Ranking Forces Governance and Resilience Trade-offs
The EU's new Cloud Sovereignty Framework establishes a standardized assessment for cloud services, enhancing digital autonomy and reducing dependence on non-EU giants. It introduces a scorecard system based on eight Sovereignty Objectives that influences public sector procurement decisions.
-
Cloudflare Introduces Data Platform with Zero Egress Fees
Cloudflare has recently announced the open beta of Cloudflare Data Platform, a managed solution for ingesting, storing, and querying analytical data tables using open standards such as Apache Iceberg.
-
AWS Launches M8a General-Purpose EC2 Instances with 5th Gen AMD EPYC (Turin) Processors
AWS has launched M8a instances, powered by 5th Gen AMD EPYC processors, delivering 30% higher performance than M7a. With enhanced I/O capabilities and up to 75 Gbps networking, they cater to high-performance applications like gaming and ML. M8a instances offer diverse deployment options, reinforcing AWS's commitment to cutting-edge cloud solutions.
-
AWS Launches C8i and C8i-flex EC2 Instances Powered by Custom Intel Xeon 6
AWS has launched compute-optimized C8i and C8i-flex instances featuring custom Intel Xeon 6 processors. Designed for compute-intensive tasks, these instances offer up to 384 vCPUs and 768 GiB memory, delivering up to 60% faster performance. The C8i-flex variant boosts cost efficiency, enhancing resource management for dynamic workloads.
-
Amazon Timestream for InfluxDB Adds Support for InfluxDB 3 Core and Enterprise
InfluxData has launched InfluxDB 3 Core and Enterprise on Amazon Timestream, offering a high-speed, open-source time-series database for real-time applications. With enhanced security, scalability, and performance, developers can seamlessly integrate with AWS services. InfluxDB 3 redefines data management for AI-driven environments, enabling rapid analytics and decision-making.