InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Jawbone's Head of Data Science and Analytics on the Future of Wearables and Analytics Insights
Brian Wilt, Head of Data Science and Analytics at Jawbone, recently gave a presentation at QCon SF about Machine Learning applications at Jawbone. Here we ask more about current and future directions of research and development around sleep research, getting actionable insights, getting wearables to play a significant role in healthcare, and cool projects currently in their early stages at Jawbone
-
Machine Learning with Spark: Book Review and Interview
Machine learning is about making data-driven decisions or predictions based on existing data. Apache Spark and its machine learning library MLlib offer several algorithms useful for developing scalable machine learning applications. InfoQ spoke with Nick Pentreath, author of the book Machine Learning with Spark, about data science and machine learning topics.
-
OCP Oracle Certified Professional Java SE 8 Programmer Study Guide II-Review and Author Conversation
The Oracle Java Certification exams are very difficult tests on every feature of Java, and obtaining certification gives hiring managers a very strong indication that you have a thorough understanding of Java. This handbook is a clear and complete exam preparation, and indeed a great pedal to the metal way to learn Java 8 even for those who may not be planning to become certified.
-
Big Data Processing with Apache Spark - Part 3: Spark Streaming
In this article, third installment of Apache Spark series, author Srini Penchikala discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application.
-
Using Redis as a Time Series Database: Why and How
In this article, Dr. Josiah Carlson, author of the book “Redis in Action”, explains how to use Redis and sorted sets with hashes for time series analysis.
-
Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library
In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.
-
Key Takeaway Points and Lessons Learned from QCon San Francisco 2015
This article summarizes the key takeaways and highlights from QCon San Francisco 2015 as blogged and tweeted by QCon's 1,300 attendees. Over the course of the next 4 months, InfoQ will be publishing most of the conference sessions online, including 10 video interviews that were recorded by the InfoQ editorial team.
-
Data Lake-as-a-Service: Big Data Processing and Analytics in the Cloud
Data Lake-as-a-Service solutions provide big data processing in the cloud for faster business outcomes in a very cost effective way. InfoQ spoke with Lovan Chetty and Hannah Smalltree from Cazena team about how Data Lake as a Service works.
-
Philip Rathle on Neo4j 2.3 Graph Database Features and openCypher Initiative
Neo Technology, the company behind the graph NoSQL database Neo4j, recently released version 2.3 of the database. They also announced openCypher initiative to help with creating a standard graph query language. InfoQ spoke with Philip Rathle, VP of Products at Neo Technology, about the new features in the latest release of Neo4j and openCypher announcement.
-
Key Lessons Learned from Transition to NoSQL at an Online Gambling Website
In this article, author Dan Macklin discusses the transition to Riak NoSQL and Erlang based architecture coupled with Convergent Replicated Data Types (CRDTs) and lessons learned with the transition.
-
Real-time Data Processing in AWS Cloud
In this article, author Oleksii Tymchenko discusses a bio-informatic software as a service (SaaS) product called Chorus, which was built as a public data warehousing and analytical platform for mass spectrometry data. Other features of the product include real-time visualization of raw mass-spec data.
-
Oozie Plugin for Eclipse
Oozie Eclipse plugin is a new tool for editing Apache Oozie workflows graphically inside Eclipse. Usage of this plugin allows to skip hard to develop and maintain process definition in HPDL. Instead a process graph is defined graphically by placing process actions on pallet and connecting them. An article introduces Eclipse Oozie plugin and provides an example of its usage.