InfoQ Homepage Machine Learning Content on InfoQ
-
Meta AI’s Convolution Networks Upgrade Improves Image Classification
Meta AI released a new generation of improved Convolution Networks, achieving state-of-the-art performance of 87.8% accuracy on Image-Net top-1 dataset and outperforming Swin Transformers on COCO dataset where object detection performance is evaluated. The new design and training approach is inspired by the Swin Transformers model.
-
Evaluating Continual Deep Learning: a New Benchmark for Image Classification
Continual learning aims to preserve knowledge across deep network training iterations. A new dataset entitled "The CLEAR Benchmark: Continual LEArning on Real-World Imagery" has recently been published. The goal of the study is to establish a consistent image classification benchmark with the natural time evolution of objects for a more realistic comparison of continual learning models.
-
How AI Supports IT Operators to Resolve Issues Faster and Keep Systems Running
AIOps is all about equipping IT teams with algorithms that can help in quicker evaluation, remediation or actionable insights based on their historical data without the need to solicit feedback from users directly. AI can help IT operators to work smart, resolve issues faster and keep the systems up and running to deliver great end-user experience.
-
AI Listens by Seeing as Well
Meta AI released a self-supervised speech recognition model that also uses video and achieves 75% better accuracy for some amount of data than current state-of-the-art models. This new model, Audio-Visual Hidden BERT (AV-HuBERT), uses audiovisual features for improving models based only on hearing speech. Visual features used are based on lip-reading, similar to what humans do.
-
Meta and AWS to Collaborate on PyTorch Adoption
Meta and AWS will work together to improve the performance for customers of applications running PyTorch on AWS and accelerate how developers build, train, deploy, and operate artificial intelligence and machine-learning models.
-
AWS Launches SageMaker Studio Lab, Free Tool to Learn and Experiment with Machine Learning
AWS has introduced SageMaker Studio Lab, a free service to help developers learn machine-learning techniques and experiment with the technology. SageMaker Studio Lab provides users with all of the basics to get started, including a JupyterLab IDE, model training on CPUs and GPUs and 15 GB of persistent storage.
-
Recap of AWS re:Invent 2021
After one year as a virtual-only event, re:invent was back last week to Las Vegas with fewer attendees for the 10th edition, and with multiple sessions and keynotes, including a first one for the new CEO Adam Selipsky. AWS announced new features and improvements, with a focus more on packaged solutions than new primitives.
-
AMD Introduces Its Deep-Learning Accelerator Instinct MI200 Series GPUs
In its recent Accelerated Data Center Premiere Keynote, AMD unveiled its MI200 accelerator series Instinct MI250x and slightly lower-end Instinct MI250 GPUs. Designed with CDNA-2 architecture and TSMC’s 6nm FinFET lithography, the high-end MI250X provides 47.9 TFLOPs peak double precision performance and memory that will allow training larger deep networks by minimizing model sharding.
-
D2iQ Releases DKP 2.0 to Run Kubernetes Apps at Scale
D2iQ recently released version 2.0 of the D2iQ Kubernetes Platform (DKP), a platform to help organizations run Kubernetes workloads at scale. The new release provides a single pane of glass for managing multi-cluster environments and running applications across any infrastructure including private cloud, public cloud, or at the network edge.
-
AWS Announces the Availability of EC2 Instances (G5) with NVIDIA A10G Tensor Core GPUs
Recently AWS announced the availability of new G5 instances, which feature up to eight NVIDIA A10G Tensor Core GPUs. These instances are powered by second-generation AMD EPYC processors.
-
Facebook Develops New AI Model That Can Anticipate Future Actions
Facebook unveiled its latest machine-learning process called Anticipative Video Transformer (AVT), which is able to predict future actions by using visual interpretation. AVT works as an end-to-end attention-based model for action anticipation in videos.
-
Google Document Translation Now Generally Available
Google Cloud recently announced the general availability of Document Translation, a new feature of Translation API Advanced that allows formatting of documents to be retained throughout the translation process.
-
What Machine Learning Can Do for Security
Machine learning can be applied in various ways in security, for instance, in malware analysis, to make predictions, and for clustering security events. It can also be used to detect previously unknown attacks with no established signature.
-
Amazon Releases DL1 Instances Powered by Gaudi Accelerators
Amazon recently announced the general availability of the EC2 DL1 instances powered by Gaudi accelerators from Habana Labs. The new instances promise better price performances in training deep learning models for use cases such as computer vision, natural language processing, autonomous vehicle perception and recommendation engines.
-
Analyze Video Feeds at the Edge with AWS Panorama Appliance
Recently, AWS announced the general availability (GA) of AWS Panorama Appliance, a new device that customers can install in their facilities to run applications that analyze multiple video streams from existing on-premises cameras.