InfoQ

InfoQ

Topic/Tag specific view

Reliability Content on InfoQ


Latest featured content about Reliability

Reliability Engineering Matters, Except When It Doesn't

Topics
QCon San Francisco 2011,
Scalability,
QCon,
Operations,
Conferences,
Infrastructure,
Performance & Scalability,
Reliability

Michael Nygard shares essential Reliability Engineering techniques that can keep systems from falling apart, but the discipline has some limitations to be considered.

News about Reliability

X-Mas Showcase: High Scalability and Usability Rule

Topics
Software Craftsmanship,
Scalability,
Useability,
Architecture Management,
Agile,
Pragmatic Thinking,
Big Data,
Performance & Scalability,
Reliability

Who ever has wondered what kind of software is used by Santa Claus & Co, got a hint recently in youtube. This might irritate some software engineers who have assumed, Santa Claus would only use Open Source Software.

Cooperation between European Space Agency and Lero

Topics
Quality,
Software Craftsmanship,
Architecture,
Research,
Reliability,
Model Driven Engineering,
Safety,
Architecture Analysis,
Product Line Engineering,
Verification,
Agile

As announced on 18th August 2011, the Irish Software Engineering Research Center (Lero) has signed a €300.000 contract for a research project with the European Space Agency (ESA). Goal of the research activities is to provide a solution framework for future space missions.

Footsteps: Deterministic Logging and Replay for JavaScript

Topics
Ruby,
Javascript,
Java,
Dynamic Languages,
Web Development,
Languages,
.NET,
Programming,
Reliability,
Research,
Microsoft Research

Debugging event driven applications has always been notoriously difficult. The research project Footsteps project seeks to address the problems of reproducibility by offering a logging and replay framework that records non-deterministic events such as mouse clicks and random number generation. No plugins or special browsers are needed, this done entirely with JavaScript.

MongoDB 1.8 Improves Reliability with Journaling

Topics
Ruby,
MongoDB,
Java,
Dynamic Languages,
Distributed Document Oriented Database,
NoSQL,
Languages,
.NET,
Reliability,
Programming,
Database,
Architecture

MongoDB's new journaling feature improves reliability with write-ahead redo logs. Log entries are written before permanent storage is updated. When a server restarts after a crash outstanding journal files will be replayed before the server goes online. Other changes include sharding performance boosts, shell tab completion, and the addition of covering and sparse indexes.

Articles about Reliability

QoS for Applications: A Resource Management Framework for Runtimes

Topics
Ruby,
Runtimes,
Dynamic Languages,
Quality,
Java,
Software Craftsmanship,
Languages,
Architecture,
Services,
Agile,
Programming,
Reliability

This article draws an analogy between QoS for networks and for applications, resulting in a mapping guide between the two and introducing a production solution for Java, (J)Ruby, and (J)Python apps.

Presentations about Reliability

On Distributed Failures (and handling them with Doozer)

Topics
Strange Loop 2011,
Strange Loop,
Distributed Systems,
Reliability,
Failure,
Architecture,
Conferences

Blake Mizerany presents various ways that can lead to system failure in distributed systems and how to recover using Doozer, a highly available, consistent data store.

Let It Crash ... Except When You Shouldn't

Topics
QCon London 2011,
Erlang,
QCon,
Functional Programming,
Reliability,
Conferences,
Programming

Steve Vinoski explains how to avoid some of the Erlang errors that can bring down a system starting from the premise that not all the crashes are welcome as the “Let It Crash” philosophy might suggest.

Building Reliable Systems from Unreliable Components

Topics
QCon London 2011,
QCon,
SOA,
Architecture,
Enterprise Architecture,
Reliability,
Services,
Conferences

Arnon Rotem-Gal-Oz discusses creating a SOA implementation that maintains a good overall reliability in spite of using smaller and a larger number of components.

Interviews about Reliability

Jonas Bonér and Kresten Krab Thorup on Bringing Erlang's Fault Tolerance and Distribution to Java with Akka and Erjang

Topics
Scala,
Akka,
OOP,
Dynamic Languages,
Erlang,
JVM Languages,
Functional Programming,
Fault Tolerance,
Languages,
Java,
Scalability,
Methodologies,
Programming,
Reliability,
Infrastructure,
Distributed Programming,
Erlang Factory 2011,
Erjang,
Actors,
Performance & Scalability

Jonas Bonér and Kresten Krab Thorup discuss some key aspects of Erlang like fault tolerance and reliability and how the Akka and Erjang projects try to bring them to the JVM.

Jez Humble on Continuous Delivery

Topics
Devops,
Customers & Requirements,
Lean Startup,
Domain-Driven Design,
Feedback,
Continuous Integration,
Automated testing,
Project Management,
Infrastructure,
IT Service Management,
Agile in the Enterprise,
Business,
Design,
Lean,
Methodologies,
Agile Techniques,
Continuous Improvement,
Automation,
TDD,
Agile Test,
Software Craftsmanship,
Agile,
Cloud Computing,
Programming,
Testing,
Reliability,
BDD,
DVCS,
Agile2011

In this interview at Agile 2011, Jez Humble discusses continuous delivery and the deployment pipeline, emphasizing the importance of feedback and automating tests at every level to validate deployments. Gone are the days of massive acceptance test scripts. He also talks about the evils of feature branching, and speaks on the DevOps practices to collaborate all the way through the delivery cycle.