InfoQ Homepage Computer Vision Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Artificial Intelligence Can Create Sound Tracks for Silent Videos

Researchers Ghose and Prevost created a deep learning algorithm which, given a silent video, can generate a realistic sounding synchronised soundtrack. They trained the neural network to classify the class of the sound to generate, and they also trained a sequential network to generate the sound. They thus could go from temporally aligned images to the generation of sound: a different modality!

Roland Meertens
on Sep 07, 2020
AI, ML & Data Engineering

Google Announces TensorFlow 2 Support in Object Detection API

Google announced support for TensorFlow 2 (TF2) in the TensorFlow Object Detection (OD) API. The release includes eager-mode compatible binaries, two new network architectures, and pre-trained weights for all supported models.

Anthony Alford
on Jul 21, 2020
AI, ML & Data Engineering

MIT and Toyota Release Autonomous Driving Dataset DriveSeg

Toyota's Collaborative Safety Research Center (CSRC) and MIT's AgeLab have released DriveSeg, a dataset for autonomous driving research. DriveSeg contains over 25,000 frames of high-resolution video with each pixel labelled with one of 12 classes of road object. DriveSeg is available free of charge for non-commercial use.

Anthony Alford
on Jun 30, 2020
Mobile

Google ML Kit SDK Now Focuses on On-Device Machine Learning

Google has introduced a new ML Kit SDK aimed at working in standalone mode without requiring a tight integration with Firebase, as the original ML Kit SDK did. Additionally, it provides limited support for replacing its default models with custom ones for image labeling and object detection and tracking.

Sergio De Simone
on Jun 23, 2020
AI, ML & Data Engineering

Google Open-Sources Computer Vision Model Big Transfer

Google Brain has released the pre-trained models and fine-tuning code for Big Transfer (BiT), a deep-learning computer vision model. The models are pre-trained on publicly-available generic image datasets and can meet or exceed state-of-the-art performance on several vision benchmarks after fine-tuning on just a few samples.

Anthony Alford
on Jun 09, 2020
Web Development

Google's V8 Engine Adds Support for WebAssembly SIMD

The WebAssembly SIMD proposal has come to Google JavaScript engine V8, albeit still as an experimental feature. Exploiting data parallelism, V8 support for SIMD (Single instruction, multiple data) aims to accelerate compute intensive tasks like audio/video processing, machine learning, and more.

Sergio De Simone
on Feb 05, 2020
AI, ML & Data Engineering

Apple Acquires Edge-Focused AI Startup Xnor.ai

Apple has acquired Xnor.ai, a Seattle-based startup that builds AI models that run on edge devices, for approximately $200 million.

Anthony Alford
on Jan 24, 2020
AI, ML & Data Engineering

Uber's Synthetic Training Data Speeds Up Deep Learning by 9x

Uber AI Labs has developed an algorithm called Generative Teaching Networks (GTN) that produces synthetic training data for neural networks which allows the networks to be trained faster than when using real data. Using this synthetic data, Uber sped up its neural architecture search (NAS) deep-learning optimization process by 9x.

Anthony Alford
on Jan 21, 2020
AI, ML & Data Engineering

Facebook AI Releases New Computer Vision Library Detectron2

Facebook AI Research (FAIR) has released Detectron2, a PyTorch-based computer vision library that brings a series of new research and production capabilities to the framework. While the first Detectron was written in Caffe2, Detectron2 represents a full rewrite of the original framework in PyTorch from the ground up, with several new object detection capabilities.

George Seif
on Oct 28, 2019
Cloud

Google Announces Updates to AutoML Vision Edge, AutoML Video, and the Video Intelligence API

In a recent blog post, Google announced enhancements to a part of its Vision AI portfolio: AutoML Vision Edge, AutoML Video, and the Video Intelligence API. Each received updates to enhance their capabilities.

Steef-Jan Wiggers
on Oct 22, 2019
AI, ML & Data Engineering

Waymo Shares Autonomous Vehicle Dataset for Machine Learning

Waymo, the self-driving technology company, released a dataset containing sensor data collected by their autonomous vehicles during more than five hours of driving. The set contains high-resolution data from lidar and camera sensors collected in several urban and suburban environments in a wide variety of driving conditions and includes labels for vehicles, pedestrians, cyclists, and signage.

Anthony Alford
on Sep 04, 2019
AI, ML & Data Engineering

New Technique Speeds up Deep-Learning Inference on TensorFlow by 2x

Researchers at North Carolina State University recently presented a paper at the International Conference on Supercomputing (ICS) on their new technique, "deep reuse" (DR), that can speed up inference time for deep-learning neural networks running on TensorFlow by up to 2x, with almost no loss of accuracy.

Anthony Alford
on Aug 27, 2019
AI, ML & Data Engineering

University Research Teams Open-Source Natural Adversarial Image DataSet for Computer-Vision AI

Research teams from three universities recently released a dataset called ImageNet-A, containing natural adversarial images: real-world images that are misclassified by image-recognition AI. When used as a test-set on several state-of-the-art pre-trained models, the models achieve an accuracy rate of less than 3%.

Anthony Alford
on Aug 13, 2019
AI, ML & Data Engineering

Researchers Develop Technique for Reducing Deep-Learning Model Sizes for Internet of Things

Researchers from Arm Limited and Princeton University have developed a technique that produces deep-learning computer-vision models for internet-of-things (IoT) hardware systems with as little as 2KB of RAM. By using Bayesian optimization and network pruning, the team is able to reduce the size of image recognition models while still achieving state-of-the-art accuracy.

Anthony Alford
on Jul 30, 2019
AI, ML & Data Engineering

AWS Enhances Deep Learning AMI, AI Services SageMaker Ground Truth, and Rekognition

Amazon Web Services (AWS) announced updates to their Deep Learning virtual machine image, as well as improvements to their AI services SageMaker Ground Truth and Rekognition.

Anthony Alford
on Jun 25, 2019

Newer News

Older News

InfoQ Software Architects' Newsletter

News