Cloud Foundry: Design and Architecture
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
The content has been bookmarked!
There was an error bookmarking this content! Please retry.
Posted by Kostis Kapelonis on Oct 12, 2011
Chronon Systems has recently released version 2 of their recording JVM debugger which allows for post execution logging.
Chronon is a recording JVM debugger that monitors systems in production, recording their internal state while active. Afterwards it can replay the application flow pin-pointing the exact cause of a bug. The Chronon time travelling debugger was already covered by InfoQ in a previous post.
Version 2 of the debugger supports post execution logging, meaning that programmers now can "insert" logging statements into code after the application has already run. Because Chronon does not actually run the application, but simply shifts it from one state into another, post execution logging has an extra advantage compared to traditional logging. The results are instant. Even if your program ran for 5 hours and you put a logging statement on a line that is normally hit after 4 hours, Chronon will instantly show you the results. There is no waiting for the issue to happen so that the log is triggered.
Logging levels in a running Java application are pre-defined and binding. While they can change on the fly even when a system is already running in production, any change only affects future logging statements. If a bug appeared when the log level was too high then logs have incomplete information. This forces programmers to include excessive logging into an application in order to catch most program states. The application is usually deployed with the maximum possible level (e.g. TRACE) and as it matures and programmers feel more confident, they decrease the verbosity of logging into a higher level (e.g. INFO).
This however has a heavy toll on production performance since logging in big enterprise applications can have noticeable impact. In the worst case scenario a serious error can manifest when the logging level has already been raised (e.g. to WARN) leaving developers with little or no clues with what happened. In this scenario they have to change the logging level again and attempt to reproduce the issue in the production system in order to capture meaningful log information.
However with post execution logging a programmer does not have to constrain the program to a specific level. This concept can be taken to the extreme by removing all logging statements altogether allowing for maximum performance.
InfoQ contacted Prashant Deva (CTO, Founder) for an explanation on how this works:
Chronon knows the entire state of your application at any point in time since it has recorded it in production. Using this information Chronon can look at any method/line of your code and tell you the exact time at which it was hit. When you put Post Execution Logging statements on a line, Chronon sees all the times at which that line was hit and gets the state of the program at that point in time. This way it can calculate the value of any variables or expressions you put in your logging statement. Since you can put logging statements on different lines of code, once it gets the results of the logging statements on each line, all it has to do is sort the results by time and you get a log output that is exactly 100% like if the log statements actually were present in your program when it was executing.
So essentially Chronon prints out what would happen if you had placed logging statements in the code for this particular recorded time period. This gives great flexibility to programmers because now they can insert logging statements anywhere into code, knowing that if they need additional information (lower level or more statements) after an issue has hit the production system, they can do so without redeploying/reproducing it again.
Prashant clarifies:
There are no gimmicks. The logging statements you put in are, in a sense 'executed', exactly as if they were actually present in code. For example, they know the thread on which they are being executed; if you put them on a line inside say an 'if' block and that line is hit only 5 times out of the 10 calls of that method, then the log output will only show 5 log statements.
More details on Chronon can be found in the official FAQ.
Tutorial: Integrating SQLFire with tc Server and Spring Data
Banking Case Study: Scaling with Low Latency using NewSQL
App Server Evolution: REST, Cloud, and DevOps Support in Resin 4
Introducing SQLFire: a memory-optimized, high performance SQL database
VMware vFabric SQLFire - Test drive the data management system with memory speed, horizontal scalability and a familiar SQL interface
This is amazing !
I worked in support a few years ago, I dreamed of something like that.
Thank you for making it real.
Just thought I would put this link to our blog post explaining more details and motivations behind Post Execution Logging:
eblog.chrononsystems.com/chronon-20-fixes-every...
Thank you.
Glad you like it :)
It would be nice, to have netbeans integration.
No current plans for Netbeans. You can still use Chronon though by using the Recording Server for recording, and Eclipse with Chronon, for playback.
Also if you want, you can vote for Netbeans support here:
community.chrononsystems.com/chronon_systems/to...
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
Andrew Watson talks about the work of the OMG, where CORBA is alive and well (hint: in your car), UML and UML Profiles vs. custom Modeling languages, DDS and other middleware, and much more.
Sohil Shah discusses creating iPhone and Android enterprise mobile applications based on cloud services using the open source platform OpenMobster.
Paul Sanford presents the transformations supported by data throughout its life cycle, and how that can be better done with Splunk, an engine for monitoring and analyzing machine-generated data.
A common “best practice” for unit tests is to only write a one assertion in each test. I intend to question this advice by showing that multiple assertions per test are both necessary and beneficial.
John Rauser presents the architectural and technological evolution of Amazon retail websites starting with 1994 and ending with adopting Amazon Web Services.
Michael Stal discusses system architecture quality, how to avoid architectural erosion, how to deal with refactoring, and design principles for architecture evolution.
Every developer has had to integrate with another system, API or component. Tis article provides strategies to handle the change and for he separating system boundaries.
5 comments
Watch Thread Reply