InfoQ Homepage Infrastructure Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Google Unveils Ironwood TPU for AI Inference

Google's Ironwood TPU, its most advanced custom AI accelerator, powers the "age of inference" with unmatched performance and scalability. With up to 9,216 liquid-cooled chips, it outpaces competitors, delivering 42.5 Exaflops. Engineered for high-efficiency, low-latency AI tasks, Ironwood redefines potential in AI hardware, leveraging AlphaChip to revolutionize chip design.

Steef-Jan Wiggers
on May 02, 2025
DevOps

Optimize AI Workloads: Google Cloud’s Tips and Tricks

Google Cloud has announced a suite of new tools and features designed to help organizations reduce costs and improve efficiency of AI workloads across their cloud infrastructure. The announcement comes as enterprises increasingly seek ways to optimize spending on AI initiatives while maintaining performance and scalability.

Claudio Masolo
on Apr 09, 2025
Cloud

Microsoft Enhances Azure Elastic SAN with Auto Scale, Snapshot Support, and CRC Protection

Microsoft's Azure Elastic SAN, launched in early 2024, revolutionizes cloud block storage with unique autoscale capabilities, snapshot support, and CRC protection for enhanced data integrity. This fully managed solution simplifies storage management and optimizes costs, making it ideal for businesses seeking efficient, high-availability solutions in the cloud.

Steef-Jan Wiggers
on Mar 13, 2025
Cloud

Stack Refactoring for Enhanced Infrastructure Management in AWS CloudFormation Service

AWS CloudFormation's new stack refactoring feature transforms resource management, enabling seamless movement of resources between stacks. This enhances modularity and alignment with business needs, reduces misconfiguration risks, and boosts efficiency. Developers can optimize costs and improve clarity, making cloud architecture more manageable and adaptable.

Steef-Jan Wiggers
on Feb 13, 2025
Cloud

Amazon Launches High Memory U7inh EC2 Instance for Enhanced SAP HANA Workloads

AWS has unveiled the Amazon EC2 High Memory U7inh instance, a game-changer for mission-critical in-memory databases like SAP HANA, offering 32 TB of memory and 1,920 vCPUs. Designed with HPE, it doubles the performance of previous models, ensuring seamless integration in AWS. Maximize your SAP workloads in the cloud with enhanced speed and scalability.

Steef-Jan Wiggers
on Jan 09, 2025
Cloud

Google Cloud Launches Sixth Generation Trillium TPUs: More Performance, Scalability and Efficiency

Google Cloud's Trillium, its sixth-generation TPU, is now available. It enhances AI workloads with unmatched performance and 67% better energy efficiency. Integral to the AI Hypercomputer, Trillium boasts training speeds over 4x faster and triples inference throughput. This leap positions Google as a contender against Nvidia in the AI data center market.

Steef-Jan Wiggers
on Dec 28, 2024
Cloud

Azure Boost DPU: Microsoft's New Silicon Solution for Enhanced Cloud Performance

At Ignite 2024, Microsoft unveiled the Azure Boost DPU, its first in-house solution for low-power, data-centric workloads. This innovative chip optimizes cloud performance and security, offering triple the efficiency of CPUs. With a robust hardware-software design, Microsoft’s advancements position it to redefine AI and cloud infrastructure.

Steef-Jan Wiggers
on Dec 25, 2024
Cloud

Amazon EC2 R8g Instances with AWS Graviton4 Processors Generally Available

AWS has announced the general availability of Amazon EC2 R8g instances, which use AWS Graviton4 processors. These instances have been available in preview since November 2023 and are designed for memory-intensive workloads such as databases, in-memory caches, and real-time big data analytics.

Steef-Jan Wiggers
on Jul 26, 2024
DevOps

Ahrefs Joins Others in Suggesting That On-Premises Hosting Can Be More Cost Effective than Cloud

A recent article claims that Ahrefs, an SEO software suite company, was able to prevent $400 million in expenditures over three years by not leveraging cloud resources. Similarly, 37Signals, the makers of Basecamp, has begun a cloud exodus with the stated goal of saving seven million dollars in infrastructure costs over five years.

Matt Campbell
on May 19, 2024
Culture & Methods

Making Software Development Boring to Deliver Business Value

Given there’s a limit to our cognitive abilities and our comprehension of complex systems, Corstian Boerman argues that software development should become boring. He suggests moving infrastructure out of the way so that it does not burden the day-to-day development process, and focusing on delivering business value in a predictable and repeatable way.

Ben Linders
on Mar 07, 2024
DevOps

CNCF Survey: Half of Organizations Spend More with Kubernetes, Mostly Due to Overprovisioning

CNCF published the results of its latest microsurvey report on cloud-native FinOps and cloud financial management (CFM). Kubernetes has driven cloud spending up for 49% of respondents, while 28% stated their costs remain unchanged and 24% saved after migrating to Kubernetes. Respondents listed overprovisioning, lack of awareness and responsibility, and sprawl as the main factors for overspending.

Rafal Gancarz
on Mar 04, 2024
DevOps

Roblox Builds New Cellular Infrastructure to Improve Gaming Experience

The online game platform and creation system Roblox has detailed how they have made their infrastructure more efficient and resilient, to support the demands of more than 70 million active daily users engaged in immersive 3D experiences.

Matt Saunders
on Jan 03, 2024
Cloud

Canonical Releases a Low-Touch, Open Source Cloud Solution with MicroCloud

Canonical recently announced the general availability of MicroCloud, a low-touch, open source cloud solution designed for scalable clusters and edge deployments. It's aimed at edge computing, as well as for customers in need of a small-scale private cloud.

Steef-Jan Wiggers
on Nov 22, 2023
DevOps

System Initiative Software Goes Open Source; Aims to Model and Automate Infrastructure Management

System Initiative, a customizable power tool, recently open-sourced all of its software under the Apache License 2.0. The release of System Initiative's software to the open-source community aims at improving the DevOps landscape, with a specific emphasis on simulating the user’s infrastructure and using it to manage real-world systems.

Aditya Kulkarni
on Oct 17, 2023
Java

QCon San Francisco 2023 Day 1: Architectures, Data Engineering, Infra Languages, Staff+ Skills

The 17th annual QCon San Francisco conference was held at the Hyatt Regency San Francisco in San Francisco, California. This five-day event, organized by C4Media, consists of three days of presentations and two days of workshops. Day One, scheduled on October 2nd, 2023, included a keynote address by Suhail Patel and presentations from four conference tracks and two sponsored tracks.

Michael Redlich
on Oct 03, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News