Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.
Gil Tene talks to Charles Humble about different garbage collection techniques, and specific collectors including Azul's C4, IBM's Balanced GC, and Oracle's Garbage First, before moving on to discuss both the JCP and OpenJDK.
Two of ThoughtWorks’ finest, Martin Fowler and Jez Humble, talk about the notion of Continuous Delivery, which enables organizations to build software that is production ready at all times. To do this, enterprises automate the build, deployment, and testing process, and improve collaboration between developers, testers, and operations. The duo discusses a variety of related issues.
Cliff Click discusses the Pauseless GC algorithm and how Azul's Zing implements it on plain x86 CPUs. Also: what keeps dynamic languages slow on the JVM, invokedynamic, concurrency and much more.
Hilary Mason, interviewed by Ryan Slobojan, discuss the engineering behind bit.ly and their use of machine learning in their system architecture. Hilary also talks about their use of MySQL and MongoDB to manage terabytes of information about users and clicks and their implications on performing real-time analysis of anthropology on the human condition.
Jon Brisbin discusses his experience with Virtualization and reasons why companies would use Private Clouds, eg. regulation compliance. Also: the future role of operations, monitoring, and more.
In this interview Gil Tene dives deep into the history of Azul Systems and its commitment to deliver robust, scalable Java systems. He tells of the origins of the company and its early Vega hardware. Tene also talk about the new Zing elastic runtime platform for Java apps. And he speaks on the Managed Runtime Initiative Azul launched. He also talks on Pauseless GC and elastic memory.
Adrian Cole discusses his jclouds project, which is an open source library that helps Java developers get started in the cloud and reuse their Java development skills. Cole also talks about some of the challenges of creating a cloud agnostic library, such as the use of different hypervisors and that various cloud implementations are written in different languages, such as VB, Python, Ruby, etc.
John Leach explains how Brightbox uses Virtualization in the data center and whether Virtualization causes performance problems. Also: a look at a few Unix tools and Linux features that Ruby developers might not know about.
Karl Rumelhart explains VMWare's virtualization products and how they relate to cloud solutions, Open Source at VMWare, the SpringSource acquisition and much more.
In this interview from SpringOne 2009, Javier Soltero discusses the challenges of managing a complex Java application, the Hyperic toolset, out-of-the-box versus developer-built application management/monitoring hooks, the effect of both the SpringSource and the VMWare acquisition on Hyperic development, and the result of combining SpringSource and VMWare's offerings.
Adam Wiggins explains the experience with building Heroku on top of Amazon EC2, the pros and cons of virtualization, and the importance of automation.