BT
DevOps Follow 972 Followers

Q&A with the Creator of Checkless, a Low-Cost, Simple Site Monitoring Tool

by Matthew Campbell Follow 0 Followers on  Sep 19, 2018

Steve Elliott wanted a simple, cheap way to monitor uptime for his websites. He found most off-the-shelf tooling to either be too complex or too costly. This lead him to build Checkless, a serverless tool that can monitor sites for uptime via ping-based checks and depending on your usage, can potentially be free to use.

DevOps Follow 972 Followers

Auth0's Move to a Single-Cloud Architecture on AWS

by Hrishikesh Barua Follow 15 Followers on  Aug 25, 2018

Auth0, a provider of authentication, authorization and single sign on services, moved their infrastructure from multiple cloud providers (AWS, Azure and Google Cloud) to just AWS. An increasing dependency on AWS services necessitated this, and today their systems are spread across four AWS regions, with services replicated across zones.

DevOps Follow 972 Followers

Prometheus Monitoring Platform "Graduates" from the Cloud Native Computing Foundation (CNCF)

by Kent Weare Follow 11 Followers on  Aug 19, 2018

On August 9th, the Cloud Native Computing Foundation (CNCF) announced open source monitoring toolkit, Prometheus, has graduated from its incubation status. In order to achieve this rating, projects must demonstrate growth, documentation, organized governance processes, commitment to community sustainability and inclusivity.

DevOps Follow 972 Followers

Uber Open Sources Its Large Scale Metrics Platform M3

by Hrishikesh Barua Follow 15 Followers on  Aug 18, 2018 1

Uber’s engineering team released its metrics platform M3 as open source which it has been using internally for some years. The platform was built to replace its Graphite based system, and provides cluster management, aggregation, collection, storage management, a distributed time series database (TSDB) and a query engine with its own query language M3QL.

DevOps Follow 972 Followers

How Coinbase Handled Scaling Challenges on Their Cryptocurrency Trading Platform

by Hrishikesh Barua Follow 15 Followers on  Aug 12, 2018

Coinbase, a digital currency exchange, faced scaling challenges on their platform during the 2017 cryptocurrency boom. The engineering team focused on upgrading and optimizing MongoDB, traffic segregation for hotspots to resolve them, and building capture and replay tools to prepare for future surges.

Architecture & Design Follow 2423 Followers

O11ycon Discusses Benefits and Challenges of Observability

by Dylan Schiemann Follow 8 Followers on  Aug 09, 2018

The first o11ycon provides a comprehensive look at the emerging concept of observability in software and systems which allow people to understand if things are working as expected, and to diagnose problems and identify solutions.

DevOps Follow 972 Followers

Plaid.com’s Monitoring System for 9600+ Integrations

by Hrishikesh Barua Follow 15 Followers on  Aug 01, 2018

Plaid.com has integrations with over 9600 financial institutions, and their monitoring challenges arise from the heterogeneous nature of these integrations and as well as their large number. They rebuilt their monitoring system on Kinesis, Prometheus, Alertmanager and Grafana to solve the challenges of scalability and low latency.

DevOps Follow 972 Followers

How SendGrid Scales Its Email Delivery Systems

by Hrishikesh Barua Follow 15 Followers on  Jul 28, 2018 2

SendGrid, a cloud based email service, has seen its backend architecture evolve from a small Postfix installation to a system hosted on their own data-centers as well as on the public cloud. Rewriting of services in Go, a gradual move to AWS, and a distributed Ceph-based queue allows the team to hand over 40 billion emails per month.

DevOps Follow 972 Followers

Instana Releases Sample Microservice Application

by Helen Beal Follow 4 Followers on  Jul 26, 2018

Instana, provider of AI powered monitoring solutions for dynamic containerised microservice applications, announced at QCon New York the release of Stan’s Robot Shop, a sample microservice application that can be used as a sandbox to test and learn about microservice architecture, containerised application orchestration and automatic monitoring techniques.

DevOps Follow 972 Followers

Bloomberg’s Standardization and Scaling of Its Monitoring Systems

by Hrishikesh Barua Follow 15 Followers on  Jul 21, 2018

One of the outcomes of Bloomberg’s adoption of SRE practices across its development teams is the monitoring system, backed by the Cassandra-based Metrictank time-series database, that they put in place.

Cloud Follow 332 Followers

AWS Config Gains Cross-Account, Cross-Region Data Aggregation

by Steffen Opel Follow 4 Followers on  Jun 30, 2018

Amazon Web Services (AWS) recently added the capability to aggregate compliance data produced by AWS Config rules across multiple accounts and/or regions to enable centralized auditing and governance of AWS resources. A new aggregated dashboard view displays non-compliant rules across the organization. Users can then drill down to view details about resources that are violating any rules.

Architecture & Design Follow 2423 Followers

Observability and Microservices: The Need for Effective Tracing and Metrics

by Mark Little Follow 14 Followers on  Jun 17, 2018

Zach Jory has written an article discussing how microservices and service mesh implementations need observability to ensure that developers can build cloud-native applications which scale and can be more easily managed. This ties into a number of articles and interviews we have spoken about over recent months too.

DevOps Follow 972 Followers

AppDynamics Launches New European Software-as-a-Service Offering

by Helen Beal Follow 4 Followers on  Jun 15, 2018

Application intelligence vendor, AppDynamics, has launched a new European Software-as-a-Service (SaaS) offering, built on the Amazon Web Services (AWS) EU (Frankfurt) Region.

DevOps Follow 972 Followers

Understanding Production with DevOps Archeology

by Manuel Pais Follow 9 Followers on  Jun 14, 2018

Lee Fox spoke at Continuous Lifecycle London about tools and methods to help make sense of today’s complex systems and infrastructure; he calls it DevOps archeology.

DevOps Follow 972 Followers

Building Observable Distributed Systems

by Ben Linders Follow 28 Followers on  Jun 12, 2018

Today's systems are more and more complex; microservices distributed over the network and scaling dynamically, resulting in many more ways of failure, ways we can't always predict. Investing in observability gives us the ability to ask questions to systems, things we never thought about before. Some of the tools that can be used for this are metrics, tracing, structured and correlated logging.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT