InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Overcoming Data Scarcity and Privacy Challenges with Synthetic Data
In this article, the author discusses the importance of using synthetic data in data analytics projects, especially in financial institutions, to solve the problems of data scarcity and more importantly data privacy.
-
Understanding Similarity Scoring in Elasticsearch
In this article, the author discusses the importance of Relevancy Score for developing Search Engine solutions and how to calculate the relevancy score using Elasticsearch's similarity module.
-
How Apache Pulsar is Helping Iterable Scale its Customer Engagement Platform
In this article, author Greg Methvin discusses his experience implementing a distributed messaging platform based on Apache Pulsar.
-
Beyond the Database, and beyond the Stream Processor: What's the Next Step for Data Management?
Databases have been around forever with the same shape: you make a request to your data and then you receive an answer. Now, stream processors came along with a different approach: data isn’t locked up, it is in motion. Understand how stream processors and databases relate and why there is an emerging new category of databases that focus on data that stays in place as well as data that moves.
-
Q&A on the Book Accelerating Software Quality
The book Accelerating Software Quality by Eran Kinsbruner explores how we can combine techniques from artificial intelligence and machine learning with a DevOps approach to increase testing effectiveness and deliver higher quality. It provides examples and recommendations for using AI/ML-based solutions in software development and operations.
-
Is Artificial Intelligence Closer to Common Sense?
Intelligent agents lack the common-sense knowledge they need to reason about the world. Traditionally, there have been two unsuccessful approaches to getting computers to reason about the world—symbolic logic and deep learning. A new project, called COMET, tries to bring these two approaches together. Although it has not yet succeeded, it offers the possibility of progress.
-
Challenges of Human Pose Estimation in AI-Powered Fitness Apps
In this article, the author discusses the human pose estimation solution powered by AI technologies and the challenges faced in online fitness apps which use the pose estimation to predict the position of the human body based on an image or a video containing a person.
-
How to Build, Deploy, and Operationalize AI Assistants
While chatbot PoCs are simple, building production-grade conversational software is challenging. Enterprises experience challenges in production systems that have large user bases, security mandates, and polyglot environments. This article provides insight into building an AI assistant, and outlines various tools and techniques to continuously monitor and improve AI assistants in production.
-
COVID-19 and Mining Social Media - Enabling Machine Learning Workloads with Big Data
In this article, author Adi Pollock discusses how to enable machine learning workloads with big data to query and analyze COVID-19 tweets to understand social sentiment towards COVID-19.
-
Scalable Cloud Environment for Distributed Data Pipelines with Apache Airflow
In this article, author Lena Hall discusses how to use Apache Airflow to define and execute distributed data pipelines with an example of the workflow framework running on Kubernetes on Azure cloud platform.
-
The Case for Explainable AI (XAI)
Artificial Neural Networks offer significant performance benefits compared to other methodologies, but often at the expense of interpretability. Black box algorithms have precipitated a number of high-profile controversies arising from the inability to understand their inner workings. The efforts seeking to provide more transparency in this regard is referred to as Explainable AI (XAI).
-
Federated Machine Learning for Loan Risk Prediction
In this article, author Brendon Machado discusses how data owners and data scientists can work together to create models on privatized data using the federated learning technique and shows how to use it in loan risk prediction use cases.