InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Simplifying ETL in the Cloud, Microsoft Releases Azure Data Factory Mapping Data Flows
In a recent blog post, Microsoft announced the general availability (GA) of their serverless, code-free Extract-Transform-Load (ETL) capability inside of Azure Data Factory called Mapping Data Flows. This tool allows organizations to embrace a data-driven culture without the need to manage large infrastructure footprints while having the ability to dynamically scale data processing workloads.
-
Rasa Developer Summit: Production Grade AI Assistants
Rasa, an open source framework that provides machine learning tools to build and deploy contextual AI assistants, recently held its developer summit in San Francisco. The speakers at the summit shared interesting enterprise case studies on using Rasa to build AI assistants. "Our primary goal for the Rasa Developer Summit was to build community," said Alex Weidauer, CEO & co-founder of Rasa.
-
QCon San Francisco: Speaker Interviews, Workshops, and Learning Paths
QCon San Francisco features 18 curated tracks with over 120 speakers presenting sessions across 3 full days of in-depth technical talks. Topics include: designing and running large-scale architectures, microservices patterns, languages of infrastructure, practices of DevOps, building high-scale effective teams, managing the software supply chain and the latest in machine learning for developers.
-
Facebook Releases AI Code Search Datasets
Facebook AI released a dataset containing coding questions paired with code-snippet answers, intended for evaluating AI-based natural-language code search systems. The release also includes benchmark results for several of Facebook's own code-search models and a training corpus of over 4 million Java methods parsed from over 24,000 GitHub repositories.
-
Microsoft Releases Azure Sentinel, a Cloud Native SIEM, to General Availability
In a recent blog post, Microsoft announced the general availability of Sentinel, a Security Information and Event Management (SIEM) service in Azure, providing customers with intelligent security analytics across their enterprise. With the GA of Azure Sentinel, Microsoft now enters the SIEM market.
-
QCon San Francisco Learning Paths: Practical and Intensive AI/ML Two-Day Training
To help develop your artificial intelligence (AI) and machine learning (ML) knowledge and skills, the team behind InfoQ and QCon Conferences has launched the AI and ML Learning Paths. These practical and intensive two-day focused sessions are designed to help you understand and use Machine Learning by writing and executing code, instead of listening to passive lectures.
-
Release of Open Liberty 19.0.0.9 Completes Support of MicroProfile Standalone Reactive Streams APIs
In keeping with IBM’s commitment to their four-week release cycle of Open Liberty, version 19.0.0.9 was recently made available. Open Liberty’s ongoing support for MicroProfile includes the new standalone reactive APIs: MicroProfile Reactive Streams Operators (version 19.0.0.4); MicroProfile Context Propagation (version 19.0.0.8); and MicroProfile Reactive Messaging (version 19.0.0.9).
-
Databricks' Unified Analytics Platform Supports AutoML Toolkit
Databricks recently announced the Unified Data Analytics Platform, including an automated machine learning tool called AutoML Toolkit. The toolkit can be used to automate various steps of the data science workflow.
-
AI Researchers' Open-Source Model Explanation Toolkit AllenNLP Interpret
Researchers from the Allen Institute for AI and University of California, Irvine, have released AllenNLP Interpret, a toolkit for explaining the results from NLP models. The extensible toolkit includes several built-in methods for interpretation and visualization components, as well as examples using AllenNLP to explain the results of state-of-the art NLP models including BERT and RoBERTa.
-
Microsoft Announces General Availability of Jupyter Notebooks Support for Cosmos DB
Recently Microsoft announced the general availability of Jupyter notebooks support for Cosmos DB, providing integrated support for running queries directly against all data models. As these notebooks run directly inside Cosmos DB; this allows for analyzing and visualizing the data directly from the Azure portal, without the need to extract the data.
-
Distributed SQL Database NuoDB 4.0 Supports Kubernetes Operator and Indexing Improvements
Distributed SQL database NuoDB 4.0 supports Kubernetes Operator and indexing improvements. NuoDB released the latest version of the database that also supports Azure and Google Cloud Platform certification.
-
Google Releases Two New NLP Dialog Datasets
Researchers from Google AI released two new dialog datasets for natural-language processing (NLP) development: Coached Conversational Preference Elicitation (CCPE) and Taskmaster-1. The datasets contain thousands of conversations as well as labels and annotations for training digital assistants to better determine users' preferences and intentions.
-
Microsoft .NET Conf 2019: .NET Core 3.0, C# 8.0, F# 4.7, and Machine Learning
At the 2019 edition of .NET Conf, streamed online earlier this week, Microsoft announced several new features for the .NET ecosystem. .NET Core, ASP.NET, and EF Core 3.0 were released, together with C# 8.0 and F# 4.7. The new releases have a considerable impact on how Windows Desktop, Web, and Mobile applications are developed. New features and tools for Visual Studio 2019 were also showcased.
-
Facebook Open-Sources RoBERTa: an Improved Natural Language Processing Model
Facebook AI open-sourced a new deep-learning natural-language processing (NLP) model, Robustly-optimized BERT approach (RoBERTa). Based on Google's BERT pre-training model, RoBERTa includes additional pre-training improvements that achieve state-of-the-art results on several benchmarks, using only unlabeled text from the world-wide web, with minimal fine-tuning and no data augmentation.
-
Google Releases Cloud Dataproc for Kubernetes in Alpha
Google Cloud Dataproc is an open-source data and analytic processing service based on Hadoop and Spark. Google has recently announced the alpha availability of Cloud Dataproc for Kubernetes, which provides customers with a more efficient method to process data across platforms.