InfoQ Homepage Computer Vision Content on InfoQ
-
Uber's Synthetic Training Data Speeds Up Deep Learning by 9x
Uber AI Labs has developed an algorithm called Generative Teaching Networks (GTN) that produces synthetic training data for neural networks which allows the networks to be trained faster than when using real data. Using this synthetic data, Uber sped up its neural architecture search (NAS) deep-learning optimization process by 9x.
-
Facebook AI Releases New Computer Vision Library Detectron2
Facebook AI Research (FAIR) has released Detectron2, a PyTorch-based computer vision library that brings a series of new research and production capabilities to the framework. While the first Detectron was written in Caffe2, Detectron2 represents a full rewrite of the original framework in PyTorch from the ground up, with several new object detection capabilities.
-
Google Announces Updates to AutoML Vision Edge, AutoML Video, and the Video Intelligence API
In a recent blog post, Google announced enhancements to a part of its Vision AI portfolio: AutoML Vision Edge, AutoML Video, and the Video Intelligence API. Each received updates to enhance their capabilities.
-
Waymo Shares Autonomous Vehicle Dataset for Machine Learning
Waymo, the self-driving technology company, released a dataset containing sensor data collected by their autonomous vehicles during more than five hours of driving. The set contains high-resolution data from lidar and camera sensors collected in several urban and suburban environments in a wide variety of driving conditions and includes labels for vehicles, pedestrians, cyclists, and signage.
-
New Technique Speeds up Deep-Learning Inference on TensorFlow by 2x
Researchers at North Carolina State University recently presented a paper at the International Conference on Supercomputing (ICS) on their new technique, "deep reuse" (DR), that can speed up inference time for deep-learning neural networks running on TensorFlow by up to 2x, with almost no loss of accuracy.
-
University Research Teams Open-Source Natural Adversarial Image DataSet for Computer-Vision AI
Research teams from three universities recently released a dataset called ImageNet-A, containing natural adversarial images: real-world images that are misclassified by image-recognition AI. When used as a test-set on several state-of-the-art pre-trained models, the models achieve an accuracy rate of less than 3%.
-
Researchers Develop Technique for Reducing Deep-Learning Model Sizes for Internet of Things
Researchers from Arm Limited and Princeton University have developed a technique that produces deep-learning computer-vision models for internet-of-things (IoT) hardware systems with as little as 2KB of RAM. By using Bayesian optimization and network pruning, the team is able to reduce the size of image recognition models while still achieving state-of-the-art accuracy.
-
AWS Enhances Deep Learning AMI, AI Services SageMaker Ground Truth, and Rekognition
Amazon Web Services (AWS) announced updates to their Deep Learning virtual machine image, as well as improvements to their AI services SageMaker Ground Truth and Rekognition.
-
Google Uses Mannequin Challenge Videos to Learn Depth Perception
Google AI Research published a paper describing their work on depth perception from two-dimensional images. Using a training dataset created from YouTube videos of the Mannequin Challenge, researchers trained a neural network that can reconstruct depth information from videos of moving people, taken by moving cameras.
-
Google Announces TensorFlow Graphics Library for Unsupervised Deep Learning of Computer Vision Model
At a presentation during Google I/O 2019, Google announced TensorFlow Graphics, a library for building deep neural networks for unsupervised learning tasks in computer vision. The library contains 3D rendering functions written in TensorFlow, as well as tools for learning with non-rectangular mesh-based input data.
-
OpenAI Introduces Sparse Transformers for Deep Learning of Longer Sequences
OpenAI has developed the Sparse Transformer, a deep neural-network architecture for learning sequences of data, including text, sound, and images. The networks can achieve state-of-the-art performance on several deep-learning tasks with faster training times.
-
Google Releases Google-Landmarks-V2, a Large-Scale Dataset for Landmark Recognition & Retrieval
Google has released Google-Landmarks-v2, an improved dataset for Landmark Recognition & Retrieval, along with Detect-to-Retrieve, a Tensorflow codebase for large-scale instance-level image recognition. Two companion Kaggle competitions based on Google-Landmarks-v2 were also launched. With over 200,000 landmarks in 5 million images, it is the largest landmark dataset ever published.
-
Salesforce Adds Intelligence to its Einstein Services Offering
In a recent press release, Salesforce announced additions to their Einstein platform that target bringing AI solutions to Salesforce developers and admins using a low code, point and click configuration-based solution. The recent additions to the platform include Einstein Translation and Einstein Optical Character Recognition (OCR).
-
Microsoft Expands the Availability of Its Cognitive Services: Anomaly Detector and Custom Vision
Microsoft recently announced the public preview of Anomaly Detector and general availability of Custom Vision. With both services, Microsoft further expands its Cognitive Services offering for its customers.
-
AWS Marketplace Offers Machine Learning Algorithms and Model Packages
Amazon Web Services is offering machine learning algorithms and model packages on their AWS Marketplace. This was announced at AWS re:Invent Conference last week.