Cloud Foundry: Design and Architecture
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
The content has been bookmarked!
There was an error bookmarking this content! Please retry.
Posted by Scott Delap on Jul 27, 2006
Nuescheler explained how a content repository can fit into a typical web application. A JCR offers similar transactionality, scalability, and query support for structured data as a RDBMS but adds support for unstructured content, build in versioning, and "node-based" access control.
The Apache Jackrabbit project is an open source implementation of the JSR 170. InfoQ asked Nuescheler about the differences between Jackrabbit and CRX:
CRX can be considered as the commercialized Jackrabbit with a number of optimizations, added features, and easy to use packaging. CRX also features commercial grade support, release management and product management. Since featurewise Jackrabbit is the only complete opensource JCR implementation one can also expect the complete JCR featureset in CRX.
To increase performance for demanding environments, CRX 1.2 leverages the Apache Derby Persistance Manager. When asked about why CRX is using Apache Derby instead of Berkely DB Nuescheler responded:
a number of projects including Jackrabbit and also JDK 1.6 are using Derby. As a result it is a well tested and futureproof default persistance layer...
The new release of CRX also adds Active / Passive Clustering. This features a central, high-availability database server for permanent storage, and two CRX servers for non-permanent storage. Nuescheler commented:
active/passive clustering allows to hot failover from the active cluster node to the passive cluster node that is running on the same persistence layer for example a shared RDBMS. This allows us to avoid a single point of failure on the repository layer.This new clustering implementation improves cache management and performance in environments with large volumes of data and transaction loads.
We are working at the forefront of making the java content repository market viable for commercial grade applications. This is similar to what oracle did to the db market in the early days of sql. We are responding to the CRX community's needs and optimizing for those cases. This means for example that we support smart batch reads and batch writes (bundles) that enhance scalability drastically.
Introducing SQLFire: a memory-optimized, high performance SQL database
The WebSphere Liberty Profile for Developers: An Introduction
Early Access! Download JBoss Developer Studio 5.0 now, with packages for Mac, Windows or Linux!
Tools to unit test your JavaScript
Combining Inspections, Static Analysis, Testing to Achieve >95% Defect Removal Efficiency
VMware vFabric SQLFire - Test drive the data management system with memory speed, horizontal scalability and a familiar SQL interface
a number of projects including Jackrabbit and also JDK 1.6 are using Derby. As a result it is a well tested and futureproof default persistance layer...
There are so many voices speaking about Derby and the decission to be included in JDK, so that I really wonder about the technical details. Also, I would say that the two cannot be compaired from licensing point of view and as far as I know this was the reason for Jackrabbit to work by default on Derby.
I would add that from my knowledge and our tests (indeed very custom), Berkley always performed better than Derby.
./alex
--
.w( the_mindstorm )p.
I would add that from my knowledge and our tests (indeed very custom), Berkley always performed better than Derby.
I don't know about Berley. But Derby is very slow compared to other databases for simple queries. This 'performance problem' is there since a long time. I published performance results around 1998 (when the product and company was called Cloudscape), but then somebody told me I 'must remove them'.
See also my current performance test results (with source code) at H2 Database Engine / Performance.
I don't know about Berkley.
Berkley is somehow different from normal RDBMS, so I guess it wouldn't make much sense to include it in your reports. However, for JCR, and I guess can refere here both Jackrabbit and CRX, Berkley should be included in performance tests. At least this is my opinion.
But, there may be another details for this decission that are not about performance.
./alex
--
.w( the_mindstorm )p.
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
Andrew Watson talks about the work of the OMG, where CORBA is alive and well (hint: in your car), UML and UML Profiles vs. custom Modeling languages, DDS and other middleware, and much more.
Sohil Shah discusses creating iPhone and Android enterprise mobile applications based on cloud services using the open source platform OpenMobster.
Paul Sanford presents the transformations supported by data throughout its life cycle, and how that can be better done with Splunk, an engine for monitoring and analyzing machine-generated data.
A common “best practice” for unit tests is to only write a one assertion in each test. I intend to question this advice by showing that multiple assertions per test are both necessary and beneficial.
John Rauser presents the architectural and technological evolution of Amazon retail websites starting with 1994 and ending with adopting Amazon Web Services.
Michael Stal discusses system architecture quality, how to avoid architectural erosion, how to deal with refactoring, and design principles for architecture evolution.
Every developer has had to integrate with another system, API or component. Tis article provides strategies to handle the change and for he separating system boundaries.
3 comments
Watch Thread Reply