InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Big Data and Big Money: The Role of Data in the Financial Sector
When we consider the 3Vs of big data— volume, velocity, and variety—it is hard to think of many sectors whose requirements fit so nicely into the guidelines at finance.
-
User Anonymity on Twitter
This article explores how it is possible to measure how many Twitter users adopted anonymous pseudonyms, the correlation between content sensitivity and user anonymity, and whether it would be possible to build automated classifiers that would detect sensitive Twitter accounts.
-
The Problem with AI
AI depends on "data janitorial" work, as opposed to science work, and there is a gulf between prototype and sandbox, and innovation and production.
-
How Much Should We Trust Artificial Intelligence
Considerable buzz surrounds artificial intelligence, and, indeed, AI is all around us. As with any software-based technology, it is also prone to vulnerabilities. Here, the author examines how we determine whether AI is sufficiently reliable to do its job and how much we should trust its outcomes.
-
Video Stream Analytics Using OpenCV, Kafka and Spark Technologies
What is the role of video streaming data analytics in data science space. Learn how to implement a motion detection use case using a sample application based on OpenCV, Kafka and Spark Technologies.
-
Perspective on Architectural Fitness of Microservices
In this article we peel the onion of potential architectural fitness of microservices in the context of Master Data Management, and the challenges a microservices-based architecture may face when solving problem domains that require compute-intensive tasks, such as the calculation of expected losses on a portfolio of unsecured consumer credit.
-
Key Takeaway Points and Lessons Learned from QCon New York 2017
The sixth annual QCon New York was the biggest yet, bringing together over 1,100 team leads, architects, project managers, and engineering directors - up from last year's record of 940. It was also the first to take place in our new home in Times Square.
-
Apache Beam Interview with Frances Perry
InfoQ Interviews Apache Beam's Frances Perry about the impetus for using Beam and the future of the top-level open source project and covers the thoughts behind the programming model as well as some of the touch-points in integration with other data engineering tools like Apache Spark and Flink.
-
Introducing FaunaDB Serverless Cloud
FaunaDB Serverless Cloud is the managed version of FaunaDB, a serverless, object-relational, globally replicated, strongly consistent, temporal database, that can be deployed on multiple clouds, such as AWS, GCP, and Azure, or on premises.
-
Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included! (Part 2)
Goldman Sachs is widely known as a leader in investment banking, but they are very much a leading technology firm as well. Continuing our exploration of Reladomo, the primary Java ORM used at GS and now open source, GS Technology Fellow, Mohammad Rezaei looks at advanced features, such as sharding, caching, bitemporal access, performance, and testing.
-
Machine Learning Techniques for Predictive Maintenance
In this article, the authors explore how we can build a machine learning model to do predictive maintenance of systems. They discuss a sample application using NASA engine failure dataset to predict the Remaining Useful Time (RUL) with regression models.
-
Under The Hood with the JVM's Automatic Resource Management
The deprecation of Object::finalize is an unusual step for the Java ecosystem. We dive deep into the Hotspot JVM to see how it works. We also compare it to RAII and the Java 7, try-with-resources syntax. The article contrasts these very different approaches to automatic resource management, and explains why TWR should be used in place of finalization by application programmers.