Cloud Foundry: Design and Architecture
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
The content has been bookmarked!
There was an error bookmarking this content! Please retry.
Posted by Ron Bodkin on Jan 28, 2011
JasperSoft this week announced support for reporting against big data systems, including support for a variety of modes for reporting on Hadoop, several popular NoSQL databases, as well as three MPP analytic relational databases. They noted they are supporting:
InfoQ interviewed Andrew Lampitt, Senior Director of Technology Alliances at JasperSoft about the release.
Q: How is this announcement different than what other BI vendors are providing?
A: JasperSoft has always allowed reporting against obscure data formats.
In general, the industry has taken a ho-hum approach to reporting on Hadoop, using Hive to execute SQL queries against Hadoop. JasperSoft has added support for reporting against files in HDFS or directly against HBase, and also against various No SQL flavors..
Q: Have you benchmarked the performance?
A: These are first or second generation connectors, and not meant to be production quality. JasperSoft has collaborated with the vendors, if any, else project owners to produce a first cut of reporting. JasperSoft have talked to some prospects or existing customers, to get 2nd or 3rd level of feedback.
Q: What's the level of adoption or evaluation of these connectors?
A: We either have existing customers using it, or it's new stuff for which we're seeking feedback. Partly this announcement was to generate awareness. We're working with customers and vendors, to learn what are the most demanding corporate reporting requirements
Q: What new capabilities does the release include?
A: The JasperSoft connectors provide
JasperSoft allows bringing the file into memory and manipulating it there. However, analysis against nodes in a graph database [ed: such as Neo4J] is quite different than a key-value store.
Q: Do you support reporting against summaries or star schemas in non-traditional formats?
A: I'm not sure it does. Reporting against an operational system is very different than against a warehouse.
For MongoDB or Riak - you can manipulate data at GUI-level e.g., summarization, but it's not a traditional analysis-situation.
We look at NoSQL as new options for OLTP.
If I'm a developer using Hadoop and want to look at a bit of data, it will let me run some reports against the file system.
A: Yes... (the) limitation is the memory? It's not necessarily loading all this data into the client browser, but it always loads the whole thing on the server (JasperReports Server).
Q: Is there any way to apply a filter or to minimize the size of the data set brought back in files?
A: Anything is possible... but it's not something we'll probably solve soon. It's analogous to issues with a local CSV file. In general you always bring it all into memory. It's not obvious what a good technique for filtering a file will be.
Ron Bodkin is the Founder of Think Big Analytics, which builds big data solutions using Hadoop and NoSQL.
Introducing SQLFire: a memory-optimized, high performance SQL database
Improving Software Delivery Cycles: Pre-requisites and Inhibitors
VMware vFabric SQLFire - Test drive the data management system with memory speed, horizontal scalability and a familiar SQL interface
Derek Collison discusses the goals, the design premises and patterns employed in creating the architecture of Cloud Foundry, VMware’s open source PaaS, unveiling internal architectural details.
Andrew Watson talks about the work of the OMG, where CORBA is alive and well (hint: in your car), UML and UML Profiles vs. custom Modeling languages, DDS and other middleware, and much more.
Sohil Shah discusses creating iPhone and Android enterprise mobile applications based on cloud services using the open source platform OpenMobster.
Paul Sanford presents the transformations supported by data throughout its life cycle, and how that can be better done with Splunk, an engine for monitoring and analyzing machine-generated data.
A common “best practice” for unit tests is to only write a one assertion in each test. I intend to question this advice by showing that multiple assertions per test are both necessary and beneficial.
John Rauser presents the architectural and technological evolution of Amazon retail websites starting with 1994 and ending with adopting Amazon Web Services.
Michael Stal discusses system architecture quality, how to avoid architectural erosion, how to deal with refactoring, and design principles for architecture evolution.
Every developer has had to integrate with another system, API or component. Tis article provides strategies to handle the change and for he separating system boundaries.
No comments
Watch Thread Reply