InfoQ Homepage Deep Learning Content on InfoQ
-
Google Open-Sources ALBERT Natural Language Model
Google AI has open-source A Lite Bert (ALBERT), a deep-learning natural language processing (NLP) model, which uses 89% fewer parameters than the state-of-the-art BERT model, with little loss of accuracy. The model can also be scaled-up to achieve new state-of-the-art performance on NLP benchmarks.
-
Uber Open-Sources Plug-and-Play Language Model for Controlling AI-Generated Text
Uber AI open-sourced the plug-and-play language model (PPLM) which can control the topic and sentiment of AI-generated text. The model's output is evaluated by human judges as achieving 36% better topic accuracy compared to the baseline GPT-2 model.
-
Amazon Releases SageMaker Studio IDE for Machine Learning
At the recent re:Invent conference, Amazon Web Services (AWS) announced Amazon SageMaker Studio, an integrated development enviornment (IDE) for machine learning (ML) that brings code editing, training job tracking and tuning, and debugging all into a single web-based interface.
-
TensorFlow 2.1.0 Will Be the Last Version to Support Python 2
The TensorFlow project announced a release candidate for version 2.1.0. In addition to several improvements and bug fixes, this release will be the last version of the deep-learning framework to support Python 2.
-
Google Introduces New Metrics for AI-Generated Audio and Video Quality
Google AI researchers published two new metrics for measuring the quality of audio and video generated by deep-learning networks, the Fréchet Audio Distance (FAD) and Fréchet Video Distance (FVD). The metrics have been shown to have a high correlation with human evaluations of quality.
-
Microsoft Releases DialogGPT AI Conversation Model
Microsoft Research's Natural Language Processing Group released dialogue generative pre-trained transformer (DialoGPT), a pre-trained deep-learning natural language processing (NLP) model for automatic conversation response generation. The model was trained on over 147M dialogues and achieves state-of-the-art results on several benchmarks.
-
Deep-Learning Framework SINGA Graduates to Top-Level Apache Project
The Apache Software Foundation (ASF) recently announced that SINGA, a framework for distributed deep-learning, has graduated to top-level project (TLP) status, signifying the project's maturity and stability. SINGA has already been adopted by companies in several sectors, including banking and healthcare.
-
PyTorch 1.3 Release Adds Support for Mobile, Privacy, and Transparency
Facebook recently announced the release of PyTorch 1.3. The latest version of the open-source deep learning framework includes new tools for mobile, quantization, privacy, and transparency.
-
Google Applies NLP Algorithm BERT to Search
BERT, Google's latest NLP algorithm, will power Google search and make it better at understanding user queries in a way more similar to how humans would understand them, writes Pandu Nayak, Google fellow and vice president for Search, with one in 10 queries providing a different set of results.
-
Alexa Research Paper Shows Genetic Algorithms Offer Best Solution for Neural Network Optimization
Amazon's Alexa Science researchers published a paper providing a theoretical basis for neural network optimization. While showing that it is computationally intractable to find a perfect solution, the paper does provide a formulation, the Approximate Architecture Search Problem (a-ASP), that can be solved with genetic algorithms.
-
Microsoft and University of Maryland Researchers Announce FreeLB Adversarial Training System
Researchers from Microsoft and the University of Maryland (UMD) announced Free Large-Batch (FreeLB), a new adversarial training technique for deep-learning natural-language processing (NLP) systems that improves accuracy, increasing RoBERTa's scores on the General Language Understanding Evaluation (GLUE) benchmark and achieving the highest score on AI2 Reasoning Challenge (ARC) benchmark.
-
Facebook Open-Sources CraftAssist Framework for AI Assistants in Minecraft
Facebook AI researchers open-sourced CraftAssist, a framework for building interactive assistants for the Minecraft video game. The bots use natural language understanding (NLU) to parse and execute text commands from human players, such as requests to build houses in the game world. The framework's modular structure can be extended by researchers to perform their own ML experiments.
-
Facebook Releases AI Code Search Datasets
Facebook AI released a dataset containing coding questions paired with code-snippet answers, intended for evaluating AI-based natural-language code search systems. The release also includes benchmark results for several of Facebook's own code-search models and a training corpus of over 4 million Java methods parsed from over 24,000 GitHub repositories.
-
AI Researchers' Open-Source Model Explanation Toolkit AllenNLP Interpret
Researchers from the Allen Institute for AI and University of California, Irvine, have released AllenNLP Interpret, a toolkit for explaining the results from NLP models. The extensible toolkit includes several built-in methods for interpretation and visualization components, as well as examples using AllenNLP to explain the results of state-of-the art NLP models including BERT and RoBERTa.
-
Google Releases Two New NLP Dialog Datasets
Researchers from Google AI released two new dialog datasets for natural-language processing (NLP) development: Coached Conversational Preference Elicitation (CCPE) and Taskmaster-1. The datasets contain thousands of conversations as well as labels and annotations for training digital assistants to better determine users' preferences and intentions.