InfoQ Homepage Deep Learning Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

DeepMind's Agent57 Outperforms Humans on All Atari 2600 Games

Researchers at Google's DeepMind have produced a reinforcement-learning (RL) system called Agent57 that has scored above the human benchmark on all 57 Atari 2600 games in the Arcade Learning Environment. Agent57 is the first system to outperform humans on even the hardest games in the suite.

Anthony Alford
on May 05, 2020
AI, ML & Data Engineering

Google Releases Quantization Aware Training for TensorFlow Model Optimization

Google announced the release of the Quantization Aware Training (QAT) API for their TensorFlow Model Optimization Toolkit. QAT simulates low-precision hardware during the neural-network training process, adding the quantization error into the overall network loss metric, which causes the training process to minimize the effects of post-training quantization.

Anthony Alford
on Apr 14, 2020
AI, ML & Data Engineering

Google's SEED RL Achieves 80x Speedup of Reinforcement-Learning

Researchers at Google Brain recently open-sourced their Scalable, Efficient Deep-RL (SEED RL) algorithm for AI reinforcement-learning. SEED RL is a distributed architecture that achieves state-of-the-art results on several RL benchmarks at lower cost and up to 80x faster than previous systems.

Anthony Alford
on Apr 07, 2020
AI, ML & Data Engineering

Researchers Publish Survey of Explainable AI

A team of researchers from IBM Watson and Arizona State University have published a survey of work in Explainable AI Planning (XAIP). The survey covers the work of 67 papers and charts recent trends in the field.

Anthony Alford
on Mar 17, 2020
AI, ML & Data Engineering

Facebook Research Develops AI System for Music Source Separation

Facebook Research recently released Demucs, a new deep-learning-powered system for music source separation. Demucs outperforms previously reported results based on human evaluations of overall quality of sound after separation.

Patrick Kelly
on Mar 12, 2020
AI, ML & Data Engineering

Deep Learning Accelerates Scientific Simulations up to Two Billion Times

Researchers from several physics and geology laboratories have developed Deep Emulator Network SEarch (DENSE), a technique for using deep-learning to perform scientific simulations from various fields from high-energy physics to climate science. Compared to previous simulators, the results from DENSE achieved speedups ranging from 10 million to 2 billion times.

Anthony Alford
on Mar 10, 2020
AI, ML & Data Engineering

PyTorch 1.4 Release Introduces Java Bindings, Distributed Training

PyTorch, Facebook's open-source deep-learning framework, announced the release of version 1.4. This release, which will be the last version to support Python 2, includes improvements to distributed training and mobile inference and introduces support for Java.

Anthony Alford
on Feb 25, 2020
AI, ML & Data Engineering

GitHub Releases ML-Based "Good First Issues" Recommendations

GitHub shipped an updated version of good first issues feature which uses a combination of both a machine learning (ML) model that identifies easy issues, and a hand curated list of issues that have been labeled "easy" by project maintainers. New and seasoned open source contributors can use this feature to find and tackle easy issues in a project.

Uday Tatiraju
on Feb 22, 2020
AI, ML & Data Engineering

Microsoft Open-Sources Project Petridish for Deep-Learning Optimization

A team from Microsoft Research and Carnegie Mellon University has open-sourced Project Petridish, a neural architecture search algorithm that automatically builds deep-learning models that are optimized to satisfy a variety of constraints. Using Petridish, the team achieved state-of-the-art results on the CIFAR-10 benchmark with only 2.2M parameters and five GPU-days of search time.

Anthony Alford
on Feb 18, 2020
AI, ML & Data Engineering

Google Open-Sources Reformer Efficient Deep-Learning Model

Researchers from Google AI recently open-sourced the Reformer, a more efficient version of the Transformer deep-learning model. Using a hashing trick for attention calculation and reversible residual layers, the Reformer can handle text sequences up to 1 million words while consuming only 16GB of memory on a single GPU accelerator.

Anthony Alford
on Feb 04, 2020
AI, ML & Data Engineering

Microsoft Open-Sources ONNX Acceleration for BERT AI Model

Microsoft's Azure Machine Learning team recently open-sourced their contribution to the ONNX Runtime library for improving the performance of the natural language processing (NLP) model BERT. With the optimizations, the model's inference latency on the SQUAD benchmark sped up 17x.

Anthony Alford
on Jan 28, 2020
AI, ML & Data Engineering

Apple Acquires Edge-Focused AI Startup Xnor.ai

Apple has acquired Xnor.ai, a Seattle-based startup that builds AI models that run on edge devices, for approximately $200 million.

Anthony Alford
on Jan 24, 2020
AI, ML & Data Engineering

Uber's Synthetic Training Data Speeds Up Deep Learning by 9x

Uber AI Labs has developed an algorithm called Generative Teaching Networks (GTN) that produces synthetic training data for neural networks which allows the networks to be trained faster than when using real data. Using this synthetic data, Uber sped up its neural architecture search (NAS) deep-learning optimization process by 9x.

Anthony Alford
on Jan 21, 2020
AI, ML & Data Engineering

Stanford Researchers Publish AI Index 2019 Report

The Stanford Human-Centered Artificial Intelligence Institute published its AI Index 2019 Report. The 2019 report tracks three times the number of datasets as the previous year's report, and contains nearly 300 pages of data and graphs related to several aspects of AI, including research, technical performance, education, and societal considerations.

Anthony Alford
on Jan 14, 2020
AI, ML & Data Engineering

Deep Java Library: New Deep Learning Toolkit for Java Developers

Amazon released Deep Java Library (DJL), an open-source library with Java APIs to simplify training, testing, deploying, and making predictions with deep-learning models. DJL is framework agnostic; it abstracts away commonly used deep-learning functions, using Java Native Access (JNA) on top of existing deep-learning frameworks, currently providing implementations for Apache MXNet and TensorFlow.

Carol McDonald
on Jan 09, 2020

Newer News

Older News

InfoQ Software Architects' Newsletter

News