At its heart, SQL is a domain specific language designed to allow non-professional programmers to query databases and write ad hock reports. When a company moves from a relational database to a NoSQL offering the need for ad hock reporting doesn’t go away, it just becomes harder. Simba’s ODBC drivers shift the power back into the hands of the users.
Few months back, Microsoft announced HDInsight, Microsoft’s Hadoop distribution for managing, analysing and making sense out of large volumes of data. InfoQ connected with Val Fontama, Senior Product Marketing Manager for SQL Server, to know more about how the Enterprise Big Data @ Microsoft story is panning out.
Netflix has released Hystrix, a library designed to control points of access to remote systems, services and 3rd party libraries, providing greater tolerance of latency and failure. Hystrix features thread and semaphore isolation with fallbacks and circuit breakers, request caching and request collapsing, and monitoring and configuration.
In his new blog post Hortonworks Vice President of Corporate Strategy Shaun Connolly discusses the importance of Apache Ambari incubation project and the main milestones achieved by the project in 2012: simplified cluster provisioning, pre-configured key operational metrics, job execution visualization, a RESTful API and an intuitive UI.
Last week, 6,000 attendees from around the globe were in Las Vegas, NV for the first-ever Amazon Web Services (AWS) re:Invent conference. InfoQ was there to interview thought leaders and identify the key messages of the conference.
Facebook has open sourced Corona, an in-house developed improvement to Hadoop MapReduce scheduling software.
Several new Hadoop-based frameworks where announced during this year O’Reilly Strata Conference + Hadoop World 2012 in New York last week.
Scalability vs distributed transactional semantics,is no longer a compromise as per Google's research work on Spanner. Spanner's features include non-blocking reads, lock-free read only transactions and atomic schema changes across a globally replicated relational database. The central idea that tackles the latency issues with distributed transactions is the exposure of clock uncertainty.
Apache new project Drill is aimed to support real-time interactive analysis of large-scale (terabytes size) data sets.
Recently Cap Gemini's Steve Jones has written an article on how he believes that thinking about solutions to problems is less important these days than jumping on the latest hype bandwagon. Although he uses REST and Big Data as examples, he believes it goes beyond any single technology and that eventually IT will no longer belong to IT people.
In their presentation posted at InfoQ systems and data architects Ben Stopford, Farzad Pezeshkpour and Mark Atwell show how RBS leveraged new technologies in their architectures while facing difficult challenges such as regulation, competition and tighter budgets. They also need to cope with stringent technical challenges, for instance with efficiency and scalability.
Want to try out Hadoop with the Microsoft Stack and figure out what capabilities this brings to you? We point to some resources that can help.
VMware have announced the availability of Spring Hadoop, which integrates the Spring Framework and the Apache Hadoop platform.
In his new article “MapReduce Patterns, Algorithms, and Use Cases”, Ilya Katsov gives a systematic view of the different MapReduce patterns, algorithms and techniques that can be found on the web or in scientific articles along with several practical use case studies.
After six years of gestation, Big data framework Apache Hadoop 1.0.0 was recently released. Core features in the release include Kerberos Authentication, support for Apache HBase and RESTful API to HDFS. InfoQ spoke with Arun Murthy, VP of Apache Hadoop, about the new release.