InfoQ Homepage Machine Learning Content on InfoQ
-
MLGO Framework Brings Machine Learning in Compiler Optimizations
Google’s new Machine Learning Guided Optimization (MLGO) is an industrial-grade general framework for integrating machine-learning (ML) techniques systematically in a compiler and in particular in LLVM. Compiling faster and smaller code can significantly reduce the operational cost of large data-center applications.
-
LinkedIn Open-Sourced Its Feature Store to Evangelize Productive Machine Learning
LinkedIn Engineering recently open-sourced its feature store Feathr, which helps engineers to develop machine Learning products by simplifying feature management and usage in production. It defines features, computes them for training and inference purposes, and makes them discoverable by other machine learning developers.
-
Amazon Unveils ML-Powered Coding Assistant CodeWhisperer
Amazon launched CodeWhisperer, an ML-Powered Coding Companion which provides code recommendations based on developers' comments in natural language and their code in the integrated development environment. The machine learning-powered service increases developer productivity.
-
AWS and Microsoft Working Together on PyWhy, the New Home of Causal ML Library DoWhy
AWS in a joint effort with Microsoft have established PyWhy as a fresh GitHub organization to integrate AWS algorithms into DoWhy, a casual ML library from Microsoft, which has moved to PyWhy.
-
Amazon Released Incremental Training Feature in SageMaker JumpStart
AWS recently released a new feature in SageMaker (AWS Machine Learning Service) JumpStart to incrementally retrain machine-learning (ML) models trained with expanded datasets. By using this feature, developers could fine-tune their models for better performance in production with a couple of clicks. This recent feature is among the series of efforts to add more automation to SageMaker JumpStart.
-
GitHub Copilot Adopts Paid Model, Still Free for Some Open-Source Maintainers and Students
After almost one year in technical preview, GitHub Copilot is now prime time-ready for students and individual developers, says GitHub, while companies and larger organizations could get access to it before the end of the year.
-
Microsoft's New Simulation Framework FLUTE Accelerates Federated Learning Algorithm Development
Microsoft Research has recently released Federated Learning Utilities and Tools for Experimentation (FLUTE), a new simulation framework to accelerate federated learning ML algorithm development. The main goal of federated learning is to train complex machine-learning models over massive amounts of data without the need to share that data in a centralized location.
-
Amazon Rekognition Introduces Streaming Video Events
AWS recently announced the general availability of Streaming Video Events, a new feature of Amazon Rekognition to provide real-time alerts on live video streams.
-
New GraphWorld Tool Accelerates Graph Neural-Network Benchmarking
Google AI has recently released GraphWorld, a tool to accelerate performance benchmarking in the area of graph neural networks (GNNs). GraphWorld is a configurable framework to generate graphs with a variety of structural properties like different node degree distributions and Gini index.
-
TensorFlow DTensor: Unified API for Distributed Deep Network Training
Recently released TensorFlow v2.9 introduces a new API for the model, data, and space-parallel (aka spatially tiled) deep network training. DTensor aims to decouple sharding directives from the model code by providing higher-level utilities to partition the model and batch parameters between devices.
-
Meta AI’s New Data Set to Accelerate Renewable Energy Catalyst Discovery for Hydrogen Fuel
Meta AI recently announced that it will soon release an entirely new data set for green hydrogen fuel ML modeling and simulation, focused on oxide catalysts for the oxygen evolution reaction (OER), a critical chemical reaction used in green hydrogen fuel production via wind and solar energy.
-
Google Announces General Availability of Cloud TPU VMs
Last year Google introduced Cloud TPU Virtual Machines (VMs), which provide direct access to TPU host machines in preview. Today, Cloud TPU VMs are generally available, including the new TPU Embedding API, which can accelerate ML Based ranking and recommendation workloads.
-
Amazon SageMaker Serverless Inference Now Generally Available
Amazon recently announced that SageMaker Serverless Inference is generally available. Designed for workloads with intermittent or infrequent traffic patterns, the new option provisions and scales compute capacity according to the volume of inference requests the model receives.
-
Serving Deep Networks in Production: Balancing Productivity vs Efficiency Tradeoff
A recently published work provides an alternative modality for serving deep neural networks. It enables utilizing eager-mode model code directly at production workloads by using embedded CPython interpreters. The goal is to reduce the engineering effort to bring the models from the research stage to the end-user and to create a proof-of-concept platform for migrating future numerical libraries.
-
From Natural Language Queries to Insights: GCP BigQuery Data QnA Usage in Twitter
The Twitter engineering team has shared architectural details of their Qurious data insights platform and its advantages for real-time analysis. Designed for internal business customers, the platform allows users to analyze Twitter’s BigQuery data using natural language queries and create dashboards.