InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
CMU Develops Algorithm for Guaranteeing AI Model Generalization
Researchers at Carnegie Mellon University's (CMU) Approximately Correct Machine Intelligence (ACMI) Lab have published a paper on Randomly Assign, Train and Track (RATT), an algorithm that uses noisy training data to provide an upper bound on the true error risk of a deep-learning model. Using RATT, model developers can determine how well a model will generalize to new input data.
-
Pyodide Brings Python and Its Scientific Stack to the Browser with WebAssembly
Mozilla announced that Pyodide, which aims at providing a full Python data science stack running entirely in the browser, has become an independent community-driven project. Pyodide uses the CPython 3.8 interpreter compiled to WebAssembly, and thus allows using Python, NumPy, Pandas, Matplotlib, SciPy, and more in Iodide, an experimental interactive scientific computing environment for the web.
-
Microsoft's ZeRO-Infinity Library Trains 32 Trillion Parameter AI Model
Microsoft recently open-sourced ZeRO Infinity, an addition to their open-source DeepSpeed AI training library that optimizes memory use for training very large deep-learning models. Using ZeRO-Infinity, Microsoft trained a model with 32 trillion parameters on a cluster of 32 GPUs, and demonstrated fine-tuning of a 1 trillion parameter model on a single GPU.
-
NVIDIA Announces AI Training Dataset Generator DatasetGAN
Researchers at NVIDIA have created DatasetGAN, a system for generating synthetic images with annotations to create datasets for training AI vision models. DatasetGAN can be trained with as few as 16 human-annotated images and performs as well as fully-supervised systems requiring 100x more annotated images.
-
IBM Introduces Watson Orchestrate for Task Automation
At the recent Think conference, IBM introduces an AI-powered automation tool called Watson Orchestrate. It is currently in preview in IBM Cloud Paks for Automation and expected to be generally available later in 2021 as an SaaS offering.
-
Amazon Open Sources DeepRacer Device Software
Amazon has recently open-sourced the DeepRacer device software, the software used to run AWS DeepRacer. The 1/18th scale autonomous vehicle and the DeepRacer events allow developers to create machine learning models and race them in a cloud based 3D racing simulator.
-
Take Your Technical and Non-Technical Skills to the Next Level: Join Us at QCon Plus
The best way to learn in the software industry is from your peers. Join 2,200 fellow senior software engineers, architects and team leads at QCon Plus starting next week.
-
AWS Announces a Data Management and Analytics Solution Called Amazon FinSpace
Recently, AWS announced a data management and analytics solution purpose-built for the Financial Services Industry (FSI) called Amazon FinSpace. The service aims to reduce the time it takes for financial analysts to find and access all types of financial data for analysis.
-
Researchers Publish Biologically Plausible AI Training Method
A team of researchers at Oxford University developed an algorithm called zero-divergence inference learning (Z-IL), an alternative to the backpropagation (BP) algorithm for training neural network AI models. Z-IL has been shown to exactly reproduce the results of BP on any neural network, but unlike BP does not violate known principles of brain function.
-
Find Solutions to Your Software Challenges at QCon Plus
Last November at QCon Plus, over 1,450 of your peers joined us at the virtual event in order to keep on top of software trends and find solutions to validate their technical roadmaps. Now is the time to book your attendance at the next event! With less than five weeks before QCon Plus May 2021, over 1,800 senior software engineers, architects, and team leads have already booked their spot.
-
Facebook Announces ZionEX Platform for Training AI Models with 12 Trillion Parameters
A team of scientists at Facebook AI Research (FAIR) announced a system for training deep-learning recommendation models (DLRM) using PyTorch on a custom-built AI hardware platform, ZionEX. Using this system, the team trained models with up to 12T parameters and achieved nearly an order-of-magnitude speedup in training time compared to other systems.
-
AWS Introduces Savings Plans and Instant Price Reductions for Amazon Sagemaker
Recently, AWS announced instant price reductions and Savings Plans for Amazon SageMaker, their fully-managed Machine Learning (ML) service. With Savings Plans for Amazon SageMaker, customers can benefit from cost savings up to 64% compared to the on-demand price. The company also drops the price of several instance families in Amazon SageMaker by up to 14.2%.
-
Open Source AI Can Predict Electrical Outages from Storms with 81% Accuracy
A team of scientists from Aalto University and the Finnish Meteorological Institute have developed an open-source AI model for predicting electrical outages caused by storm damage. The model can predict storm location within 15km and classifies the amount of transformer damage with 81% accuracy, allowing power companies to prepare for outages and repair them more quickly.
-
MIT Announces AI Benchmark ThreeDWorld Transport Challenge
A team of researchers from MIT and the MIT-IBM Watson AI Lab have announced the ThreeDWorld Transport Challenge, a benchmark task for embodied AI agents. The challenge is to improve research on AI agents that can control a simulated mobile robot that is guided by computer vision to pick up objects and move them to new locations.
-
Perceiver: One Neural-Network Model for Multiple Input Data Types
Google’s DeepMind company has recently released a state-of-the-art deep-learning model called Perceiver that receives and processes multiple input data ranging from audio to images, similarly to how the human brain perceives multimodal data. Perceiver is able to receive and classify input multiple data types, namely point cloud, audio and images.