InfoQ Homepage Monitoring Content on InfoQ
-
Gain insight into the performance of your apps with Google Cloud Monitoring
Google Cloud Monitoring is now available for free whilst in beta to all Google Cloud Platform customers. The service provides dashboards and alerts for cloud-powered applications, giving developers and operations staff insight and metrics to their services.
-
Monitoring as a Service
James Turnbull, VP of engineering at Kickstarter and author of The Docker Book, presented at both FOSDEM and Config Management Camp about monitoring, sharing his views on modern, scalable, business oriented monitoring, provided as a service with self service APIs, and integrated in the project development.
-
Amazon CloudWatch Supports JSON Logs and Integrates AWS CloudTrail
Shortly after releasing the AWS CloudTrail Processing Library (CPL), Amazon Web Services has also integrated AWS CloudTrail with Amazon CloudWatch Logs to enable alarms and respective "notifications from CloudWatch, triggered by specific API activity captured by CloudTrail". The implied support for monitoring JSON-formatted logs has recently been officially released as well.
-
Atlas: Netflix's Primary Telemetry Platform
Netflix has open sourced Atlas, part of their next-generation monitoring platform they have been working on since early 2012. The company developed Atlas to store time series data in order to provide near real-time operational insight to teams.
-
State of On-Call Survey
VictorOps published the results of its survey on the state of on-call activities, which it claims to be the first of its kind. The survey includes data about the challenges of being on-call, the way those who are on-call get notified, the tools they use to support incident resolution, the prevalence of false alarms, the average time of each incident resolution and more.
-
Amazon CloudWatch Gains Log Monitoring and Storage
Amazon CloudWatch recently gained log file monitoring and storage for application, operating system and custom logs and meanwhile enhanced support for Microsoft Windows Server to cover a wider variety of log sources.
-
5 years of metrics and monitoring
Lindsay Holmwood made a retrospective about metrics and monitoring in his DevOps Days Belgium talk, listed his typical metrics and monitoring pipeline, exposed some flaws in monitoring systems, and his view of what the future may bring in the field.
-
Lessons Learned Building Distributed Systems at Bitly
At the Bacon Conference last May, bitly Lead Application Developer Sean O'Connor explained the most relevant lessons bitly developers learned while building a distributed system that handles 6 billions clicks per month.
-
How Etsy Deploys More Than 50 Times a Day
Daniel Schauenberg described at QCon London how Etsy, renowned for its DevOps and Continuous Delivery practices, does 50 deploys/day. A fully automated deployment pipeline, thorough application monitoring and IRC-based collaboration are all important to achieve this rate of change while keeping risk to a minimum. Etsy has about 60 million monthly visits and 1.5 billion page views per month.
-
Discussion on Nagios Fitness for Purpose
At a recent London DevOps meetup, Andy Sykes launched a debate on whether Nagios, a well-known application that offers monitoring and alerting services, should be replaced with a better solution. Laurie Denness, from Etsy, argued in a reply that Nagios and its ecosystem still are a great solution in the monitoring and alerting arena.
-
LINQ To Logs And Traces
Microsoft Open Technologies recently announced the release of Tx, an open source project that can help debugging using Logs/Traces, and building of real-time monitoring and alerting systems.
-
Forecasting at Twitter
Arun Kejariwal, from Twitter, talked at Velocity Conf London last month about forecasting algorithms used at Twitter to proactively predict system resource needs as well as business metrics such as number of users or tweets. Given the dynamic nature of their data stream, they found that a refined ARIMA model works well once data is cleansed, including removal of outliers.
-
Mission Control and Flight Recorder on HotSpot JVM
Since the Java 7 Update 40 release, Mission Control and Flight Recorder are shipped with the JDK. Mission Control is the starting place for monitoring, management and troubleshooting, while Flight Recorder is the facility to collect and evaluate profiling data. Both tools have been available for JRockit and are now finally ported to HotSpot.
-
Community-Driven Research: Real World Ruby on Rails Usage RFP
As part of InfoQ's ongoing Community Driven Research project, we want to find out how developers are using Ruby on Rails in practice. In this first step, we want to know what you use so that we can collect suggestions for the voting.
-
CRaSH: An Extensible Command Line Shell For Monitoring A Running JVM
The Common ReusAble SHell (CRaSH) is an interactive shell (with history support and autocompletion) that attaches to a running JVM and can execute several commands for retrieving JVM statistics or changing JVM internals on the fly. It can be used for remote monitoring and administration of existing Java applications and it is fully extensible via custom Groovy scripts.