InfoQ Homepage Deep Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Open-Sources ALBERT Natural Language Model

Google AI has open-source A Lite Bert (ALBERT), a deep-learning natural language processing (NLP) model, which uses 89% fewer parameters than the state-of-the-art BERT model, with little loss of accuracy. The model can also be scaled-up to achieve new state-of-the-art performance on NLP benchmarks.

Anthony Alford
on Jan 07, 2020
AI, ML & Data Engineering

Uber Open-Sources Plug-and-Play Language Model for Controlling AI-Generated Text

Uber AI open-sourced the plug-and-play language model (PPLM) which can control the topic and sentiment of AI-generated text. The model's output is evaluated by human judges as achieving 36% better topic accuracy compared to the baseline GPT-2 model.

Anthony Alford
on Dec 31, 2019
AI, ML & Data Engineering

Amazon Releases SageMaker Studio IDE for Machine Learning

At the recent re:Invent conference, Amazon Web Services (AWS) announced Amazon SageMaker Studio, an integrated development enviornment (IDE) for machine learning (ML) that brings code editing, training job tracking and tuning, and debugging all into a single web-based interface.

Anthony Alford
on Dec 17, 2019
AI, ML & Data Engineering

TensorFlow 2.1.0 Will Be the Last Version to Support Python 2

The TensorFlow project announced a release candidate for version 2.1.0. In addition to several improvements and bug fixes, this release will be the last version of the deep-learning framework to support Python 2.

Anthony Alford
on Dec 10, 2019
AI, ML & Data Engineering

Google Introduces New Metrics for AI-Generated Audio and Video Quality

Google AI researchers published two new metrics for measuring the quality of audio and video generated by deep-learning networks, the Fréchet Audio Distance (FAD) and Fréchet Video Distance (FVD). The metrics have been shown to have a high correlation with human evaluations of quality.

Anthony Alford
on Dec 03, 2019
AI, ML & Data Engineering

Microsoft Releases DialogGPT AI Conversation Model

Microsoft Research's Natural Language Processing Group released dialogue generative pre-trained transformer (DialoGPT), a pre-trained deep-learning natural language processing (NLP) model for automatic conversation response generation. The model was trained on over 147M dialogues and achieves state-of-the-art results on several benchmarks.

Anthony Alford
on Nov 26, 2019
AI, ML & Data Engineering

Deep-Learning Framework SINGA Graduates to Top-Level Apache Project

The Apache Software Foundation (ASF) recently announced that SINGA, a framework for distributed deep-learning, has graduated to top-level project (TLP) status, signifying the project's maturity and stability. SINGA has already been adopted by companies in several sectors, including banking and healthcare.

Anthony Alford
on Nov 19, 2019
AI, ML & Data Engineering

PyTorch 1.3 Release Adds Support for Mobile, Privacy, and Transparency

Facebook recently announced the release of PyTorch 1.3. The latest version of the open-source deep learning framework includes new tools for mobile, quantization, privacy, and transparency.

Anthony Alford
on Nov 12, 2019
AI, ML & Data Engineering

Google Applies NLP Algorithm BERT to Search

BERT, Google's latest NLP algorithm, will power Google search and make it better at understanding user queries in a way more similar to how humans would understand them, writes Pandu Nayak, Google fellow and vice president for Search, with one in 10 queries providing a different set of results.

Sergio De Simone
on Nov 05, 2019
AI, ML & Data Engineering

Alexa Research Paper Shows Genetic Algorithms Offer Best Solution for Neural Network Optimization

Amazon's Alexa Science researchers published a paper providing a theoretical basis for neural network optimization. While showing that it is computationally intractable to find a perfect solution, the paper does provide a formulation, the Approximate Architecture Search Problem (a-ASP), that can be solved with genetic algorithms.

Anthony Alford
on Nov 05, 2019
AI, ML & Data Engineering

Microsoft and University of Maryland Researchers Announce FreeLB Adversarial Training System

Researchers from Microsoft and the University of Maryland (UMD) announced Free Large-Batch (FreeLB), a new adversarial training technique for deep-learning natural-language processing (NLP) systems that improves accuracy, increasing RoBERTa's scores on the General Language Understanding Evaluation (GLUE) benchmark and achieving the highest score on AI2 Reasoning Challenge (ARC) benchmark.

Anthony Alford
on Oct 29, 2019
AI, ML & Data Engineering

Facebook Open-Sources CraftAssist Framework for AI Assistants in Minecraft

Facebook AI researchers open-sourced CraftAssist, a framework for building interactive assistants for the Minecraft video game. The bots use natural language understanding (NLU) to parse and execute text commands from human players, such as requests to build houses in the game world. The framework's modular structure can be extended by researchers to perform their own ML experiments.

Anthony Alford
on Oct 22, 2019
AI, ML & Data Engineering

Facebook Releases AI Code Search Datasets

Facebook AI released a dataset containing coding questions paired with code-snippet answers, intended for evaluating AI-based natural-language code search systems. The release also includes benchmark results for several of Facebook's own code-search models and a training corpus of over 4 million Java methods parsed from over 24,000 GitHub repositories.

Anthony Alford
on Oct 15, 2019
AI, ML & Data Engineering

AI Researchers' Open-Source Model Explanation Toolkit AllenNLP Interpret

Researchers from the Allen Institute for AI and University of California, Irvine, have released AllenNLP Interpret, a toolkit for explaining the results from NLP models. The extensible toolkit includes several built-in methods for interpretation and visualization components, as well as examples using AllenNLP to explain the results of state-of-the art NLP models including BERT and RoBERTa.

Anthony Alford
on Oct 08, 2019
AI, ML & Data Engineering

Google Releases Two New NLP Dialog Datasets

Researchers from Google AI released two new dialog datasets for natural-language processing (NLP) development: Coached Conversational Preference Elicitation (CCPE) and Taskmaster-1. The datasets contain thousands of conversations as well as labels and annotations for training digital assistants to better determine users' preferences and intentions.

Anthony Alford
on Oct 01, 2019

Newer News

Older News

InfoQ Software Architects' Newsletter

News