InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
OpenAI Introduces InstructGPT Language Model to Follow Human Instructions
OpenAI overhauled the GPT-3 language model and introduced a new default tool called InstructGPT to address complaints about toxic language and misinformation.
-
Meta Open-Sources Multi-Modal AI Algorithm Data2vec
Meta AI recently open-sourced data2vec, a unified framework for self-supervised deep learning on images, text, and speech audio data. When evaluated on common benchmarks, models trained using data2vec perform as well as or better than state-of-the-art models trained with modality-specific objectives.
-
How GitHub Uses Machine Learning to Extend Vulnerability Code Scanning
Applying machine learning techniques to its rule-based security code scanning capabilities, GitHub hopes to be able to extend them to less common vulnerability patterns by automatically inferring new rules from the existing ones.
-
DeepMind Open-Sources Quantum Chemistry AI Model DM21
Researchers at Google subsidiary DeepMind have open-sourced DM21, a neural network model for mapping electron density to chemical interaction energy, a key component of quantum mechanical simulation. DM21 outperforms traditional models on several benchmarks and is available as an extension to the PySCF simulation framework.
-
PipelineDP Brings Google’s Differential-Privacy Library to Python
Google and OpenMined have released PipelineDP, a new open-source library that allows researchers and developers to apply differentially private aggregations to large datasets using batch-processing systems.
-
Alibaba Open-Sources AutoML Algorithm KNAS
Researchers from Alibaba Group and Peking University have open-sourced Kernel Neural Architecture Search (KNAS), an efficient automated machine learning (AutoML) algorithm that can evaluate proposed architectures without training. KNAS uses a gradient kernel as a proxy for model quality, and uses an order of magnitude less compute power than baseline methods.
-
LambdaML: Pros and Cons of Serverless for Deep Network Training
A new study entitled "Towards Demystifying Serverless Machine Learning Training" aims to provide an experimental analysis of training deep networks by leveraging serverless platforms. FaaS for training has challenges due to its distributed nature and aggregation step in the learning algorithms. Results indicate FaaS can be a faster (for lightweight models) but not cheaper alternative than IaaS.
-
Meta AI’s Convolution Networks Upgrade Improves Image Classification
Meta AI released a new generation of improved Convolution Networks, achieving state-of-the-art performance of 87.8% accuracy on Image-Net top-1 dataset and outperforming Swin Transformers on COCO dataset where object detection performance is evaluated. The new design and training approach is inspired by the Swin Transformers model.
-
Evaluating Continual Deep Learning: a New Benchmark for Image Classification
Continual learning aims to preserve knowledge across deep network training iterations. A new dataset entitled "The CLEAR Benchmark: Continual LEArning on Real-World Imagery" has recently been published. The goal of the study is to establish a consistent image classification benchmark with the natural time evolution of objects for a more realistic comparison of continual learning models.
-
Meta Unveils AI Supercomputer for the Metaverse
Meta has unveiled its AI Research SuperCluster (RSC) supercomputer, aimed at accelerating AI research and helping the company build the metaverse. The RSC will help the company build new and better AI models, working across hundreds of different languages, and to develop new augmented reality tools.
-
University Researchers Develop Brain-Computer Interface for Robot Control
Researchers from École Polytechnique Fédérale de Lausanne (EPFL) in Switzerland and the University of Texas at Austin (UT) have developed a brain-computer interface (BCI) that allows users to modify a robot manipulator's motion trajectories. The system uses inverse reinforcement learning (IRL) and can learn a user's preferences using less than five demonstrations.
-
Google Introduces Autoscaling for Cloud Bigtable for Optimizing Costs
Cloud Bigtable is a fully-managed, scalable NoSQL database service for large operational and analytical workloads on the Google Cloud Platform (GCP). And recently, the public cloud provider announced the general availability of Bigtable Autoscaling, which automatically adds or removes capacity in response to the changing demand for applications allowing cost optimizations.
-
Amazon OpenSearch Adds Anomaly Detection for Historical Data
Amazon OpenSearch recently introduced the support of anomaly detection for historical data. The machine learning based feature helps identifying trends, patterns, and seasonality in OpenSearch data.
-
OpenAI Announces Question-Answering AI WebGPT
OpenAI has developed WebGPT, an AI model for long-form question-answering based on GPT-3. WebGPT can use web search queries to collect supporting references for its response, and on Reddit questions its answers were preferred by human judges over the highest-voted answer 69% of the time.
-
InfoQ 2022 Events: Get Ready to Deep-Dive with Leading Software Practitioners
Our events will be both online (InfoQ Live and QCon Plus) and in-person once again with our QCon software development conferences in London (April 4-6) and San Francisco (October 24-28). Join us to find practical inspiration to help you adopt the patterns and practices this year.