InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Scalable Cloud Environment for Distributed Data Pipelines with Apache Airflow
In this article, author Lena Hall discusses how to use Apache Airflow to define and execute distributed data pipelines with an example of the workflow framework running on Kubernetes on Azure cloud platform.
-
The Case for Explainable AI (XAI)
Artificial Neural Networks offer significant performance benefits compared to other methodologies, but often at the expense of interpretability. Black box algorithms have precipitated a number of high-profile controversies arising from the inability to understand their inner workings. The efforts seeking to provide more transparency in this regard is referred to as Explainable AI (XAI).
-
Federated Machine Learning for Loan Risk Prediction
In this article, author Brendon Machado discusses how data owners and data scientists can work together to create models on privatized data using the federated learning technique and shows how to use it in loan risk prediction use cases.
-
Easy Interpretation of a Logistic Regression Model with Delta-p Statistics
Delta-p statistics is an easier means of communicating results to a non-technical audience than the plain coefficients of a logistic regression model. In this article, authors Maarit Widmann and Alfredo Roccato discuss how to predict credit eligibility using the Delta-p statistics based solution.
-
Combining DataOps and DevOps: Scale at Speed
DataOps is an extension of DevOps standards and processes into the data analytics world. It's about streamlining the processes involved in processing, analyzing and deriving value from big data.
-
The First Wave of GPT-3 Enabled Applications Offer a Preview of Our AI Future
The first wave of GPT-3 powered applications are emerging. After priming of only a few examples, GPT-3 could write essays, answer questions, and even generate computer code! Furthermore, GPT-3 can perform algebraic calculations and language translations despite never being taught such concepts. However, GPT-3 is a black box with unpredictable outcomes. Developers must use it responsively.
-
State of the Art in Automated Machine Learning
InfoQ caught up with experts Francesca Lazzeri, machine learning scientist lead at Microsoft; Matthew Tovbin, co-founder of Faros AI; Adrian de Wynter, applied scientist in Alexa AI’s Secure AI Foundations; Leah McGuire, principal member of technical staff at Salesforce; and Marios Michailidis, data scientist at H2O.ai, about the state of the art in automated machine learning (AutoML).
-
How to Get Hired as a Machine Learning Engineer
To become a machine learning engineer, you have to interview. You have to gain relevant skills from books, courses, conferences, and projects. Include technologies, frameworks, and projects on your CV. In an interview, expect that you will be asked technical questions, insight questions, and programming questions. When given a technical task, demonstrate your skills as if you already had the job.
-
Data Leadership Book Review and Interview
Data Leadership book, authored by Anthony Algmin, covers the data leadership topic and how data leaders should manage and govern the data management programs in their organizations. Data Leadership is how organizations choose to apply their energy and resources toward creating data capabilities to influence their business.
-
Innovation Startups Modeling Agile Culture
Innovation is not only about the most advanced technology; management and processes are the new era of startups' innovation. To mix the power of the data and the importance of people to offer business intelligence is a key point nowadays. The result is not only the most important thing; the way you do it is more important. To be agile is to adapt to today's market.
-
Applied Probability - Counting Large Set of Unstructured Events with Theta Sketches
In this article, author Ronen Cohen discusses the solution to processing the event data using Theta Sketches and technologies like HBase and Kafka.
-
TornadoVM: Accelerating Java with GPUs and FPGAs
The proliferation of heterogeneous hardware represents a problem for programming languages such as Java that target CPUs. TornadoVM extends the Graal JIT compiler to take advantage of GPUs & FPGAs and provides a flexible, high-level model whilst still enabling high performance and features such as live task migration.