Big Data Analytics with Spark book, authored by Mohammed Guller, provides a practical guide for learning Apache Spark framework for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. InfoQ spoke with author about the book & development tools for big data applications.
Coding in Java, buying SAP, deploying OpenStack, and using Amazon Web Services: each one introduces a type of lock-in. However, it makes no difference how hard you try- some form of lock-in is unavoidable. What matters most is understanding the layers of lock-in, and how to assess and reduce your switching costs.
DataStax recently announced a new product called DataStax Graph to store graph data models. It's based on open source Titan graph database and uses Apache Tinkerpop framework's Gremlin query language. InfoQ spoke with Martin Van Ryswyk about the new product.
In this fourth installment of Apache Spark article series, author Srini Penchikala discusses machine learning concept & Spark MLlib library for running predictive analytics using a sample application.
Reveno is a powerful new, performant, JVM based lock-free transaction processing framework based on CQRS and event-sourcing patterns. In this article we develop a simple trading system using Reveno. 2
Data Science has been getting lot of attention as organizations are starting to use data analytics to gain insights into their data. This article takes a closer look at Data Scientist role in 2016.
Current enterprise data architectures include NoSQL databases co-existing with RDBMS. In this article, author discusses a solution for managing NoSQL & relational data using unified data modeling. 5
Lana Gibson gave a talk at the AgileNZ conference on using analytics data to design web content, based on her experiences as Content Performance Lead working on the GOV.UK whole of government website
Java performance issues are often attributable to bad database access patterns. In this article a top performance field engineer demonstrates his patterns for diagnosing database related issues.
Our physical world is about to become digitally enabled and according to various predictions, there will be many billions of IoT devices going online and collecting data in the coming years. 2
In this article, third installment of Apache Spark series, author discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application. 7
In this article, Dr. Josiah Carlson, author of the book “Redis in Action”, explains how to use Redis and sorted sets with hashes for time series analysis. 3