InfoQ Homepage Database Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

Using Redis as a Time Series Database: Why and How

In this article, Dr. Josiah Carlson, author of the book “Redis in Action”, explains how to use Redis and sorted sets with hashes for time series analysis.

Josiah Carlson
on Jan 02, 2016
AI, ML & Data Engineering

Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library

In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.

Konur Unyelioglu
on Dec 22, 2015
AI, ML & Data Engineering

Data Lake-as-a-Service: Big Data Processing and Analytics in the Cloud

Data Lake-as-a-Service solutions provide big data processing in the cloud for faster business outcomes in a very cost effective way. InfoQ spoke with Lovan Chetty and Hannah Smalltree from Cazena team about how Data Lake as a Service works.

Srini Penchikala
on Dec 10, 2015
AI, ML & Data Engineering

Philip Rathle on Neo4j 2.3 Graph Database Features and openCypher Initiative

Neo Technology, the company behind the graph NoSQL database Neo4j, recently released version 2.3 of the database. They also announced openCypher initiative to help with creating a standard graph query language. InfoQ spoke with Philip Rathle, VP of Products at Neo Technology, about the new features in the latest release of Neo4j and openCypher announcement.

Srini Penchikala
on Nov 26, 2015
AI, ML & Data Engineering

Key Lessons Learned from Transition to NoSQL at an Online Gambling Website

In this article, author Dan Macklin discusses the transition to Riak NoSQL and Erlang based architecture coupled with Convergent Replicated Data Types (CRDTs) and lessons learned with the transition.

Dan Macklin
on Nov 21, 2015
AI, ML & Data Engineering

Real-time Data Processing in AWS Cloud

In this article, author Oleksii Tymchenko discusses a bio-informatic software as a service (SaaS) product called Chorus, which was built as a public data warehousing and analytical platform for mass spectrometry data. Other features of the product include real-time visualization of raw mass-spec data.

Oleksii Tymchenko
on Nov 11, 2015
AI, ML & Data Engineering

Oozie Plugin for Eclipse

Oozie Eclipse plugin is a new tool for editing Apache Oozie workflows graphically inside Eclipse. Usage of this plugin allows to skip hard to develop and maintain process definition in HPDL. Instead a process graph is defined graphically by placing process actions on pallet and connecting them. An article introduces Eclipse Oozie plugin and provides an example of its usage.

Ahmed Mahran
on Oct 30, 2015
AI, ML & Data Engineering

Big Data Solutions with MS SQL ColumnStore Index

Columnar data storage can offer significant performance improvements over the way database tables are traditionally stored, but they aren’t always faster. Aleksandr Shavlyuga explores the power, and limitations of SQL Server’s ColumnStore Indexes.

Aleksandr Shavlyuga
on Oct 11, 2015
AI, ML & Data Engineering

The Estimation Game - Techniques for Informed Guessing

In this article, author Carlos Bueno discusses the strategies for estimating the server capacity for big data projects and initiatives, with the help of two case studies.

Carlos Bueno
on Sep 26, 2015
AI, ML & Data Engineering

Machine Learning and Cognitive Computing

Based on a webinar on analytics, this article covers the topics of machine learning and cognitive computing, and how these fields are related to artificial intelligence (AI). Panelists discuss how this technology is being applied in digital marketing space and what concerns organizations have in providing machine learning services.

Seth Earley
on Sep 19, 2015
AI, ML & Data Engineering

Garage Door Openers: An Internet of Things Case Study

In this article, author discusses how to design an Internet-connected garage door opener ("IoT opener") to be secure. He talks about cloud service authentication and security improvements offered by networked openers, like two-factor authentication (2FA). He also discusses security infrastructure for IoT devices, which includes user authentication, access policy creation & enforcement.

Jonathan Margulies
on Sep 13, 2015
AI, ML & Data Engineering

Breaking Down Data Silos with Foreign Data Wrappers

Author Lenley Hensarling discusses the Foreign Data Wrapper (FDW) feature in Postgres database. FDW provides a SQL interface for accessing data objects in remote data stores to integrate data from disparate sources like NoSQL databases and bring them into a common model.

Lenley Hensarling
on Sep 09, 2015

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles