Cloud Foundry: Design and Architecture
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
The content has been bookmarked!
There was an error bookmarking this content! Please retry.
Posted by Abel Avram on Oct 21, 2011
Apache has announced the release of Cassandra 1.0.0, the first major milestone of the distributed column-based data store coming with data compression and several performance improvements and optimizations.
Cassandra, a top open source Apache project, is a scalable database management system built on the distributed nature underlying Amazon’s Dynamo and Google’s BigTable column-based data model. Cassandra was initially developed by Facebook who open sourced it in 2008 and it has been under development ever since. Apache recently announced their first major milestone, Cassandra 1.0.0, with a number of new features, improvements and bug fixes.
One of the main features requested was compression. Cassandra 1.0 supports data compression per ColumnFamily which is a container for columns similar to a table in a traditional relational DB. The compacting process runs in the background, having no negative impact on the node’s response time. Compression has reduced the space occupied by data by 2 to 4 times, resulting in indirect 25-35% performance improvement on reads and 5-10% performance improvement on writes, according to Pavel Yaskevich.
Performance has seen a major improvement from Cassandra 0.6 to 1.0, according to Jonathan Ellis, Apache VP of Apache Cassandra Project: “Compared to the current release this time in 2010, we’ve increased our write performance a respectable 40%. But the real area we wanted to focus on improving was read performance, which we succeeded in increasing a phenomenal 400%!”
Other improvements include:
Improvements to memory management through off-heap caches, arena allocation and automatic self-tuning, for less GC pauses and more predictable performances. (Details)
Better disk-space management: better control of the space taken by commit logs and immediate deletion of obsolete data files.
New optional leveled compaction strategy with more predictable performance and fixed SStable size (Details).
Improved hinted handoffs, leading to less need for read repair for better read performances.
Lots of improvements to performance (details), CQL, repair, easier operation, etc.
Cassandra is currently in use at Cisco, Digg, Facebook, Rackspace, Reddit, Twitter, and other companies, the largest production cluster having over 100 TB of data in over 150 machines.
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
Andrew Watson talks about the work of the OMG, where CORBA is alive and well (hint: in your car), UML and UML Profiles vs. custom Modeling languages, DDS and other middleware, and much more.
Sohil Shah discusses creating iPhone and Android enterprise mobile applications based on cloud services using the open source platform OpenMobster.
Paul Sanford presents the transformations supported by data throughout its life cycle, and how that can be better done with Splunk, an engine for monitoring and analyzing machine-generated data.
A common “best practice” for unit tests is to only write a one assertion in each test. I intend to question this advice by showing that multiple assertions per test are both necessary and beneficial.
John Rauser presents the architectural and technological evolution of Amazon retail websites starting with 1994 and ending with adopting Amazon Web Services.
Michael Stal discusses system architecture quality, how to avoid architectural erosion, how to deal with refactoring, and design principles for architecture evolution.
Every developer has had to integrate with another system, API or component. Tis article provides strategies to handle the change and for he separating system boundaries.
No comments
Watch Thread Reply