InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Get More Bytes for Your Buck
Lovethesales had to classify one million product data from 700 different disparate sources across a large domain. They decided to create a hierarchy of classifiers through utilizing machine learning, specifically Support Vector Machines. They learned that optimising the way in which the svms were connected together yielded vast improvements in the reuse of labeled training data.
-
Approximate Queries on WSO2 Stream Processor: Use of Approximation Algorithms in an Applied Setting
In this article, we describe an example real world application of API monitoring which benefits from using approximate stream processing. We developed the application on top of WSO2 Stream Processor as Siddhi extension. Siddhi is the complex event processing library which acts as the event processing engine of WSO2 Stream Processor.
-
Key Takeaway Points and Lessons Learned from QCon San Francisco 2017
The eleventh annual QCon San Francisco was the biggest yet, bringing together over 1,800 team leads, architects, project managers, and engineering directors.
-
InfoQ Call for Articles
InfoQ provides software engineers with the opportunity to share experiences gained using innovator and early adopter stage techniques and technologies with the wider industry. We are always on the lookout for quality articles and we encourage practitioners and domain experts to submit feature-length (2,000 to 3,000 word) papers that are timely, educational and practical.
-
Understanding Monads. A Guide for the Perplexed
With the current explosion of functional programming, the "monad" functional structure is once again striking fear into the hearts of newcomers. In this article, Introduction to Functional Programming course instructor Dr. Barry Burd clarifies this slippery critter.
-
FPGAs Supercharge Computational Performance
Originally used in the development of new hardware, new, cloud-based FPGAs are making the technology more accessible. The dramatic improvements in speed and lower costs over traditional CPUs means more companies can start benefiting from the technology. FPGAs are fundamentally concurrent, which makes them an ideal tool for data-intensive, parallel processing problems.
-
Building Reactive Systems Using Akka’s Actor Model and Domain-Driven Design
With the explosion of mobile and data-driven applications, users are demanding real-time access to everything everywhere. System resilience and responsiveness are essential business requirements. Businesses increasingly need to trade up to more flexible, "reactive" systems. To support reactive development, actor models with domain-driven design can fulfill modern resiliency requirements.
-
Big Data and Big Money: The Role of Data in the Financial Sector
When we consider the 3Vs of big data— volume, velocity, and variety—it is hard to think of many sectors whose requirements fit so nicely into the guidelines at finance.
-
User Anonymity on Twitter
This article explores how it is possible to measure how many Twitter users adopted anonymous pseudonyms, the correlation between content sensitivity and user anonymity, and whether it would be possible to build automated classifiers that would detect sensitive Twitter accounts.
-
The Problem with AI
AI depends on "data janitorial" work, as opposed to science work, and there is a gulf between prototype and sandbox, and innovation and production.
-
How Much Should We Trust Artificial Intelligence
Considerable buzz surrounds artificial intelligence, and, indeed, AI is all around us. As with any software-based technology, it is also prone to vulnerabilities. Here, the author examines how we determine whether AI is sufficiently reliable to do its job and how much we should trust its outcomes.
-
Video Stream Analytics Using OpenCV, Kafka and Spark Technologies
What is the role of video streaming data analytics in data science space. Learn how to implement a motion detection use case using a sample application based on OpenCV, Kafka and Spark Technologies.