InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Christine Doig on Data Science as a Team Discipline
Christine Doig spoke at this year's OSCON Conference about data science as a team discipline and how to navigate the data science Python ecosystem. InfoQ spoke with Christine about challenges data science teams need to address to be more effective.
-
Virtual Panel: Current State of NoSQL Databases
NoSQL databases have been around for several years now and have become a choice of data storage for managing semi-structured and unstructured data. These databases offer lot of advantages in terms of linear scalability and better performance for both data writes and reads. InfoQ spoke with four panelists to get different perspectives on the current state of NoSQL databases.
-
Key Takeaway Points and Lessons Learned from QCon New York 2016
The fifth annual QCon New York was the biggest yet, bringing together over 800 team leads, architects, project managers, and engineering directors. In total, over 140 practitioner-speakers presented 79 full-length technical sessions and 16 in-depth tutorials, providing deep insights into real-world architectures and state of the art software development practices from a practitioner’s perspective.
-
What the JIT!? Anatomy of the OpenJDK HotSpot VM
If you've ever wondered what happens when your bytecode executes, join former Oracle G1GC performance-lead Monica Beckwith in her guided tour of just-in-time (JIT) compilation and runtime optimizations in OpenJDK HotSpot VM.
-
Big Data Analytics with Spark Book Review and Interview
Big Data Analytics with Spark book, authored by Mohammed Guller, provides a practical guide for learning Apache Spark framework for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. InfoQ spoke with author about the book & development tools for big data applications.
-
Configure Once, Run Everywhere: Decoupling Configuration and Runtime
Configuration is one of the most widely used cross-cutting concerns in application development. Apache Tamaya is a new incubator project that brings standardized property management to Java.
-
Martin Van Ryswyk on DataStax Enterprise Graph Database
DataStax recently announced a new product called DataStax Graph to store graph data models. It's based on open source Titan graph database and uses Apache Tinkerpop framework's Gremlin query language. InfoQ spoke with Martin Van Ryswyk about the new product.
-
Big Data Processing with Apache Spark - Part 4: Spark Machine Learning
In this fourth installment of Apache Spark article series, author Srini Penchikala discusses machine learning concepts and Spark MLlib library for running predictive analytics using a sample application.
-
Lambda Functions versus Infrastructure - Are we Trading Apples for Oranges?
Amazon's AWS Lambda service lets us run code without provisioning servers. Serverless offerings have recently been receiving interest due to their simplicity, capabilities and potential for cost reductions. This article compares the tradeoffs of serverless models with VM/Container based models.
-
Key Takeaway Points and Lessons Learned from QCon London 2016
This article summarizes the key takeaways and highlights from QCon London 2016 as blogged and tweeted by QCon's 1,400 attendees. Over the course of the next 4 months, InfoQ will be publishing most of the conference sessions online, including 21 video interviews that were recorded by the InfoQ editorial team.
-
High Load Trading Transaction Processing with Reveno CQRS/Event Sourcing Framework
Reveno is a powerful new, easy to use, highly performant, JVM based lock-free transaction processing framework based on CQRS and event-sourcing patterns. In this article we will develop a simple trading system implementation using the Reveno framework.
-
The Role of a Data Scientist in 2016
Data Scientist role has been getting lot of attention lately as organizations are starting to use big data processing and analytics techniques to gain insights into their data. This post takes a closer look at the role of a Data Scientist in 2016.