InfoQ Homepage Cloud Computing Content on InfoQ
-
Uber's Cloud Journey: Embracing ARM in an x86 World
Uber embarked on a strategic migration from on-premise data centers to Oracle Cloud Infrastructure (OCI) and Google Cloud Platform in February 2023. A key component of this migration was integrating ARM-based computers into their predominantly x86 fleet to reduce costs, improve price performance, and ensure hardware flexibility amid supply chain uncertainties.
-
UC Berkeley's Sky Computing Lab Introduces Model to Reduce AI Language Model Inference Costs
UC Berkeley's Sky Computing Lab has released Sky-T1-32B-Flash, an updated reasoning language model that addresses the common issue of AI overthinking. The model, developed through the NovaSky (Next-generation Open Vision and AI) initiative, "slashes inference costs on challenging questions by up to 57%" while maintaining accuracy across mathematics, coding, science, and general knowledge domains.
-
Google Cloud Launches Sixth Generation Trillium TPUs: More Performance, Scalability and Efficiency
Google Cloud's Trillium, its sixth-generation TPU, is now available. It enhances AI workloads with unmatched performance and 67% better energy efficiency. Integral to the AI Hypercomputer, Trillium boasts training speeds over 4x faster and triples inference throughput. This leap positions Google as a contender against Nvidia in the AI data center market.
-
KubeCon + CloudNativeCon NA 2024: Yahoo’s Kubernetes Journey On-Prem to Multi-Cloud
At KubeCon+CloudNativeCon North America 2024, Nandhakumar Venkatachalam and Payal Patel shared Yahoo's Kubernetes journey from on-premises to multi-cloud at scale, underscoring challenges faced and lessons learned during this transition.
-
Anna Berenberg Talks at QCon San Francisco on Google's One Network
Anna Berenberg, an Engineering Fellow at Google Cloud, unveiled One Network, a cloud-agnostic architecture that simplifies complex interconnected systems. Unifying disparate environments and leveraging open-source technologies enhances operational efficiency and consistency in security policies, empowering developers to focus on service endpoints while ensuring seamless platform integration.
-
Vercel Expands AI Toolkit with AI SDK 4.0 Update
Vercel has announced version 4.0 of its open-source AI SDK toolkit designed for building AI applications in JavaScript and TypeScript. The update introduces key features like PDF support, computer use integration, and a new xAI Grok API.
-
First Google Axion Processor Now Available: Claims Best Performance in Cloud Market
Google has announced the general availability of its C4A virtual machines, marking the debut of Axion-based instances. The cloud provider claims these instances deliver up to 10% better price-performance compared to the latest Arm-based alternatives from competitors, including Amazon Graviton4.
-
How Allegro Reduced the Cost of Running a GCP Dataflow Pipeline by 60%
Allegro achieved significant savings for one of the Dataflow Pipelines running on GCP Big Data. The company continues working on improving the cost-effectiveness of its data workflows by evaluating resource utilization, enhancing pipeline configurations, optimizing input and output datasets, and improving storage strategies.
-
Microsoft Unveils Azure Cobalt 100-Based Virtual Machines: Enhanced Performance and Sustainability
Microsoft's Azure Cobalt 100 VMs are now generally available. They deliver up to 50% improved price performance with energy-efficient Arm architecture. Tailored for diverse workloads, these VMs offer various configurations, including general-purpose and memory-optimized options. Their release supports sustainable computing, aligning with Microsoft's commitment to lower carbon footprints.
-
System Initiative Launches DevOps Platform to Address Cloud Stack Drift
System Initiative aims to tackle some long-standing problems in conventional industry practices for deploying and managing cloud infrastructure. Specifically, existing solutions requiring static configurations for cloud infrastructure cannot reliably reflect fast-changing and dynamic cloud services, leading to "stack drift": unpredictability when engineers apply changes to infrastructure.
-
AWS CodeBuild Now Supports Mac Builds
Amazon has recently announced that AWS CodeBuild, its managed build service, now supports building applications on macOS. However, due to Apple's licensing requirements, developers must still reserve a dedicated macOS fleet to utilize this new option.
-
How AWS Well-Architected Framework Supports Frugal Architecture
AWS shared how to deliver frugal architecture using Well-Architectured Framework guidance. The authors outline areas of alignment between the WAF and the Frugal Architect laws Dr. Werner Vogels introduced in his re:Invent 2023 keynote. By implementing the frugal approach, organizations can achieve sustainability and cost optimization goals for their cloud architectures.
-
Google Cloud Launches C4 Machine Series: High-Performance Computing and Data Analytics
Google Cloud recently announced the general availability of its new C4 machine series, powered by 4th Gen Intel Xeon Scalable Processors (Sapphire Rapids). The series offers a range of configurations tailored to meet the needs of demanding applications such as high-performance computing (HPC), large-scale simulations, and data analytics.
-
How Amazon Aurora Serverless Manages Resources and Scaling for Fleets of 10K+ Instances
AWS engineers published a paper describing the evolution and latest design of resource management and scaling for the Amazon Aurora Serverless platform. Aurora Serverless uses a combination of components at different levels to create a holistic approach for dynamically scaling and adjusting resources to satisfy the needs of customer workloads.
-
Increased Popularity of Artist Platform Cara Led to Substantial Vercel Functions Expenses
Cara's web and mobile app, a platform for artists, combines the experience of Instagram and X. It provides a space for users to showcase their portfolios and share updates on their feeds. The app's serverless backend, powered by Vercel Functions, recently incurred a significant cost of $ 98,280 due to a sudden surge in usage.