InfoQ Homepage DevOps Content on InfoQ
-
Periskop: SoundCloud's Exception Monitoring Service
SoundCloud's engineering team wrote about their exception monitoring software called Periskop, which collects and aggregates exceptions across servers and reports to a central server for analysis.
-
TLS Improvements Backported to Java 8
Application Layer Protocol Negotiation is now available in Java 8, enabling software owners to communicate through HTTP/2 without a higher Java version.
-
Microsoft Updates Azure Dedicated Hosts with Reservations, Maintenance Control and More
In a recent blog post, Microsoft announced new updates to its Dedicated Host service in Azure. The public cloud vendor made a few updates to the service such as cost-saving reservations, maintenance control, further options with SKUs, and Resource Health alerts.
-
Grafana Labs Announces GA of Cortex v1.0 and Discusses Architectural Changes
Grafana Labs, the company behind popular open-source monitoring projects Grafana and Loki, announced the General Availability of Cortex v1.0. Cortex is a clustered Prometheus implementation that includes features such as horizontal scalability, multi-tenancy, durability, and long-term storage.
-
GitHub Was down Multiple Times Last February: Here's Why
GitHub completed its internal investigation about what caused multiple service interruptions that affected its service last February for over eight hours. The root cause for this was a combination of unexpected database load variation and database configuration issues.
-
What's New in MicroProfile 3.3
The Eclipse Foundation released MicroProfile 3.3 featuring updates to five APIs - Rest Client, Config, Fault Tolerance, Metrics and Health. Other improvements include clarifications and enhancements to specifications and documentation, improved integration among all the MicroProfile APIs, interoperability across different MicroProfile implementations, and a complete set of artifacts for each API.
-
Spectro Cloud Launches a Kubernetes-Based Hybrid Cloud Platform
Spectro Cloud, an enterprise cloud-native infrastructure company, launched a platform for managing multiple distributions of Kubernetes. The platform bearing the company name gives customers fine-grained control, flexibility and multi-cloud capabilities for their Kubernetes stack, including the ease of use and scalability of a managed SaaS platform.
-
Reimagining CI/CD Pipelines as Composable Blocks with Bryan Liles
Bryan Liles, senior staff engineer at VMWare, talked at the DeliveryConf about ideas of patterns and recommendations when building CI/CD pipelines. Liles recommends thinking about CI/CD as patterns instead of implementations, like merely using Jenkins or Spinnaker. It should be possible to build a platform with composable blocks with replaceable components and agnostic to a technology stack.
-
DNSSEC Signing Potentially Interrupted by Coronoavirus
The DNSSEC signing process, which has happened every three months for the last ten years, is likely to be unable to happen due to travel restrictions caused by Coronavirus. Read on to find out what the problems are, and how they plan on keeping DNSSEC running after summer 2020.
-
Google Announces Cloud AI Platform Pipelines to Simplify Machine Learning Development
In a recent blog post, Google announced the beta of Cloud AI Platform Pipelines, which provides users with a way to deploy robust, repeatable machine learning pipelines along with monitoring, auditing, version tracking, and reproducibility.
-
WebDriverIO Version 6 Release Adds Native Chrome DevTools Automation Protocol Support
The recent release of WebDriverIO version 6, a browser test automation framework for Node.js, adds Chrome DevTools protocol testing to its existing support for WebDriver and makes it easier to leverage tools like Puppeteer and Cypress.io.
-
OVHcloud's Harbor Kubernetes Operator Becomes Part of CNCF’s goharbor Project
OVHcloud released their Kubernetes operator for the Harbor container registry as open source under the CNCF's goharbor project.
-
Amazon Introduces a New Feature for ElastiCache for Redis: Global Datastore
Recently Amazon announced Global Datastore, a new feature of Amazon ElastiCache for Redis that provides fully managed, fast, reliable and secure cross-region replication.
-
Improving Incident Management through Role Assignments and Game Days
John Arundel, principal consultant at Bitfield Consulting, shared his thoughts on how to ensure incidents are handled smoothly and quickly. He suggests assigning specific roles to each team member responding to the incident. Red team versus blue team exercises can also be leveraged to ensure the team is prepared to respond accurately and quickly.
-
Amazon Introduces Bottlerocket, a Linux-Based OS for Container Hosting
Recently, Amazon announced a new Linux-based open-source operating system (OS) called Bottlerocket, which is purpose-built to run containers. Bottlerocket is currently in public preview as an Amazon Machine Image (AMI) for Amazon Elastic Compute Cloud (EC2) for customers to try out.