InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Invokedynamic - Java’s Secret Weapon
invokedynamic was the first new Java bytecode since Java 1.0 and was crucial in implementing the "headline" features of Java 8 (such as lambdas and default methods). In this article, we take a deep dive into invokedynamic and explain why it is such a powerful tool for the Java platform and for JVM languages such as JRuby and Nashorn.
-
Spark in Action Book Review & Interview
In the "Spark in Action" book, authors Petar Zecevic and Marko Bonaci discuss the Apache Spark framework for data processing (batch and streaming data use cases). They introduce the architecture of Spark and core concepts such as Resilient Distributed Datasets (RDDs). InfoQ spoke with them about Apache Spark, developer tools, and the upcoming features and enhancements in the future releases.
-
Beyond Page Objects: Next Generation Test Automation with Serenity and the Screenplay Pattern
Automated acceptance testing reduces time wasted in manual testing and bug fixing, and when combined with Behaviour-Driven Development, can guide development effort. But it requires skill, practice and discipline. The Screenplay Pattern helps teams address these difficulties and is where you may end up by mercilessly refactoring Page Objects using SOLID design principles.
-
Unified Data Modeling for Relational and NoSQL Databases
Current enterprise data architectures include NoSQL databases co-existing with relational databases. However, NoSQL data management currently lacks mature methods and tools to manage NoSQL data. In this article, author discusses a solution for managing both NoSQL and relational databases using Unified Data Modeling techniques.
-
IAP: Fast, Versatile Alternative to HTTP
Jakob Jenkov's organization has analyzed the modern application stack, including high level architectures, concrete technologies like databases, query languages, messaging, distributed computing models, & network protocols, and constructed the next gen alternative to HTTP. IAP is the resulting emerging standard protocol, and ION the high speed alternative to JSON and Protocol Buffers.
-
Jawbone's Head of Data Science and Analytics on the Future of Wearables and Analytics Insights
Brian Wilt, Head of Data Science and Analytics at Jawbone, recently gave a presentation at QCon SF about Machine Learning applications at Jawbone. Here we ask more about current and future directions of research and development around sleep research, getting actionable insights, getting wearables to play a significant role in healthcare, and cool projects currently in their early stages at Jawbone
-
Machine Learning with Spark: Book Review and Interview
Machine learning is about making data-driven decisions or predictions based on existing data. Apache Spark and its machine learning library MLlib offer several algorithms useful for developing scalable machine learning applications. InfoQ spoke with Nick Pentreath, author of the book Machine Learning with Spark, about data science and machine learning topics.
-
OCP Oracle Certified Professional Java SE 8 Programmer Study Guide II-Review and Author Conversation
The Oracle Java Certification exams are very difficult tests on every feature of Java, and obtaining certification gives hiring managers a very strong indication that you have a thorough understanding of Java. This handbook is a clear and complete exam preparation, and indeed a great pedal to the metal way to learn Java 8 even for those who may not be planning to become certified.
-
Big Data Processing with Apache Spark - Part 3: Spark Streaming
In this article, third installment of Apache Spark series, author Srini Penchikala discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application.
-
Using Redis as a Time Series Database: Why and How
In this article, Dr. Josiah Carlson, author of the book “Redis in Action”, explains how to use Redis and sorted sets with hashes for time series analysis.
-
Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library
In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.