In this article, author Oleksii Tymchenko discusses a bio-informatic software as a service (SaaS) product called Chorus, which was built as a public data warehousing and analytical platform for mass spectrometry data. Other features of the product include real-time visualization of raw mass-spec data.
Oozie Eclipse plugin is a new tool for editing Apache Oozie workflows graphically inside Eclipse. Usage of this plugin allows to skip hard to develop and maintain process definition in HPDL. Instead a process graph is defined graphically by placing process actions on pallet and connecting them. An article introduces Eclipse Oozie plugin and provides an example of its usage.
Columnar data storage can offer significant performance improvements over the way database tables are traditionally stored, but they aren’t always faster. Aleksandr Shavlyuga explores the power, and limitations of SQL Server’s ColumnStore Indexes.
In this article, author Carlos Bueno discusses the strategies for estimating the server capacity for big data projects and initiatives, with the help of two case studies.
This article covers machine learning and cognitive computing, and how they are related to artificial intelligence (AI). Panelists discuss how this technology is applied in digital marketing space.
Author discusses how to design an Internet-connected garage door opener ("IoT opener") to be secure and the security infrastructure for IoT devices, like authentication & access policy enforcement.
An interview with Google's William Vambenepe, who's lead product manager for Big Data services, to ask him about the shift from products to services when working with Big Data.
In this article, author describes the role of data analytics in healthcare space with emerging wearable technologies like biophysical interfaces, physiological sensors and embedded diagnostic tools.
In this article, based on a research study, author presents big data as service-oriented and evolutionary case of disruptive IT-enabled services (IESs) rather than as datasets.
In this article, author discusses evolving technologies like Machine Learning and IoT, how organizations can benefit from these new sources of information and intelligence.
In this article, Carlos Bueno describes a method for analyzing constraints on the shape and flow of data in systems. He also discusses details of two use cases, movie streaming and face recognition.
Spark SQL, part of Apache Spark, is used for structured data processing by running SQL queries on Spark data. Srini Penchikala discusses Spark SQL module & how it simplifies data analytics using SQL. 1