InfoQ Homepage Machine Learning Content on InfoQ
-
NLP Library spaCy 3.0 Features Transformer-Based Models and Distributed Training
AI software makers Explosion announced version 3.0 of spaCy, their open-source natural-language processing (NLP) library. The new release includes state-of-the-art Transformer-based pipelines and pre-trained models for 17 languages.
-
Microsoft Satin Audio Codec Uses AI to Outperform Skype Silk
Microsoft announced Satin, a new audio codec that leverages AI techniques to outperform Skype's Silk codec over ultra-low bandwidth and highly constrained network conditions.
-
Google Open-Sources Trillion-Parameter AI Language Model Switch Transformer
Researchers at Google Brain have open-sourced the Switch Transformer, a natural-language processing (NLP) AI model. The model scales up to 1.6T parameters and improves training time up to 7x compared to the T5 NLP model, with comparable accuracy.
-
OpenAI Announces GPT-3 Model for Image Generation
OpenAI has trained a 12B-parameter AI model based on GPT-3 that can generate images from textual description. The description can specify many independent attributes, including the position of objects as well as image perspective, and can also synthesize combinations of objects that do not exist in the real world.
-
Facebook Open-Sources Multilingual Speech Recognition Deep-Learning Model
Facebook AI Research (FAIR) open-sourced Cross-Lingual Speech Recognition (XSLR), a multilingual speech recognition AI model. XSLR is trained on 53 languages and outperforms existing systems when evaluated on common benchmarks.
-
AWS Introduces HealthLake and Redshift ML in Preview
AWS introduced preview releases of Amazon HealthLake service and a feature for Amazon Redshift called Redshift ML during re:Invent 2020 in December. Amazon HealthLake is a data lake service that helps healthcare, health insurance, and pharmaceutical companies to derive value out of their data with the help of NLP. Redshift ML is a service that provides a gateway into SageMaker to Redshift users.
-
TensorFlow 2.4 Release Includes CUDA 11 Support and API Updates
The TensorFlow project announced the release of version 2.4.0 of the deep-learning framework, featuring support for CUDA 11, cuDNN 8, and NVIDIA's Ampere GPU architecture, as well as new strategies and profiling tools for distributed training. Other API updates include mixed-precision in Keras and a NumPy frontend.
-
AWS Launches Amazon DevOps Guru
Amazon Web Services (AWS) recently introduced Amazon DevOps Guru, one of several new machine learning-driven services. DevOps Guru detects operational issues, generates reports and notifications, and offers insights and recommendations on how to take action.
-
AI Models from Google and Microsoft Exceed Human Performance on Language Understanding Benchmark
Research teams from Google and Microsoft have recently developed natural language processing (NLP) AI models which have scored higher than the human baseline score on the SuperGLUE benchmark. SuperGLUE measures a model's score on several natural language understanding (NLU) tasks, including question answering and reading comprehension.
-
Medium Describes "Rex" - a Go-Based Recommendation Service
In a recent blog post, Medium describes how it built a recommendation service named "Rex." The original recommendation service was part of the Node.js monolith, and it could only rank 150 stories. However, Medium wanted this service to rank hundreds of thousands of stories per user in under a second. So, they decided to build an entirely new, separate service using Go.
-
DeepMind's AlphaFold2 AI Solves 50-Year-Old Biology Challenge
The Protein Structure Prediction Center announced that AlphaFold2, an AI system developed by DeepMind, has solved its Protein Structure Prediction challenge. AlphaFold2 achieved a median score of 92.4 on the Global Distance Test (GDT) metric, above the threshold considered competitive with traditional methods.
-
Molecular Dynamics Simulation Based on Machine Learning Wins Gordon Bell Prize
The 2020 Association for Computing Machinery (ACM) Gordon Bell Prize was given to a team of researchers from institutions in the USA and China for their project titled: "Pushing the limit of molecular dynamics with ab initio accuracy to 100 million atoms with machine learning"
-
Caltech Open-Sources AI for Solving Partial Differential Equations
Researchers from Caltech's DOLCIT group have open-sourced Fourier Neural Operator (FNO), a deep-learning method for solving partial differential equations (PDEs). FNO outperforms other existing deep-learning techniques for solving PDEs and is three orders of magnitude faster than traditional solvers.
-
Trend Micro, Europol, and UNICRI Publish AI Misuse Report
Trend Micro, Europol’s European Cybercrime Centre (EC3), and United Nations Interregional Crime and Justice Research Institute (UNICRI) have jointly produced a report on current and possible future criminal misuse of AI. The report also includes a set of preparedness recommendations for policymakers, law enforcement, and cybersecurity experts.
-
AWS Announces Amazon SageMaker Edge Manager
Recently AWS announced a new capability of SageMaker called Amazon SageMaker Edge Manager. This new capability in SageMaker makes it easy for customers to prepare, run, monitor, and update machine learning models on fleets of edge devices such as smart cameras, robots, and industrial machines.
Discover Trends, Best Practices, and Solutions Implemented by the World's Most Innovative Software Organizations. Attend QCon Plus.
QCon Plus is an online experience for senior software engineers and architects spaced over 2 weeks.
Learn from technical talks, engage in real-time live sessions, and develop new skills to validate your software roadmap.
See the trends and hear what should be on your radar.