InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Google Introduces New AI Features in Workspace
Google’s latest AI developments are aimed at assisting employees in focusing on what matters, collaborating securely, and strengthening human relationships across all work modes and locations.
-
Allen Institute for AI Open-Sources AI Model Inspection Tool LM-Debugger
The Allen Institute for AI (AI2) open-sourced LM-Debugger, an interactive tool for interpreting and controlling the output of language model (LM) predictions. LM-Debugger supports any HuggingFace GPT-2 model and allows users to intervene in the text generation process by dynamically modifying updates in the hidden layers of the model's neural network.
-
New GraphWorld Tool Accelerates Graph Neural-Network Benchmarking
Google AI has recently released GraphWorld, a tool to accelerate performance benchmarking in the area of graph neural networks (GNNs). GraphWorld is a configurable framework to generate graphs with a variety of structural properties like different node degree distributions and Gini index.
-
TensorFlow DTensor: Unified API for Distributed Deep Network Training
Recently released TensorFlow v2.9 introduces a new API for the model, data, and space-parallel (aka spatially tiled) deep network training. DTensor aims to decouple sharding directives from the model code by providing higher-level utilities to partition the model and batch parameters between devices.
-
Amazon Releases 51-Language AI Training Dataset MASSIVE
Amazon Alexa AI's Natural Language Understanding group released Multilingual Amazon SLURP (SLU resource package) for Slot Filling, Intent Classification, and Virtual-Assistant Evaluation (MASSIVE), a dataset for training natural language understanding (NLU) AI models that contains one million annotated samples from 51 languages. The release also includes code and tools for using the data.
-
Meta AI’s New Data Set to Accelerate Renewable Energy Catalyst Discovery for Hydrogen Fuel
Meta AI recently announced that it will soon release an entirely new data set for green hydrogen fuel ML modeling and simulation, focused on oxide catalysts for the oxygen evolution reaction (OER), a critical chemical reaction used in green hydrogen fuel production via wind and solar energy.
-
DeepMind Introduces Gato, a New Generalist AI Agent
Gato, as the agent is known, is DeepMinds’s generalist AI that can perform many different tasks that humans can do, without carving a niche for itself as an expert on one task. Gato can perform more than 600 different tasks, such as playing video games, captioning images and moving real-world robotic arms. Gato is a multi-modal, multi-task, multi-embodiment generalist policy.
-
LAION Releases Five Billion Image-Text Pair Dataset LAION-5B
The Large-scale Artificial Intelligence Open Network (LAION) released LAION-5B, an AI training dataset containing over five billion image-text pairs. LAION-5B contains images and captions scraped from the internet and is 14x larger than its predecessor LAION-400M, making it the largest freely available image-text dataset.
-
Google Announces General Availability of Cloud TPU VMs
Last year Google introduced Cloud TPU Virtual Machines (VMs), which provide direct access to TPU host machines in preview. Today, Cloud TPU VMs are generally available, including the new TPU Embedding API, which can accelerate ML Based ranking and recommendation workloads.
-
Green Software Development: Terminology and Climate Commitments Explained by Microsoft at Devoxx UK
As a side effect of the accelerated move towards the cloud, the software industry is contributing more and more to global warming. Companies have taken on different commitments: Net-Zero, Carbon Neutral, etc. Asim Hussain, Green Cloud Advocacy Lead @ Microsoft deciphers them during the Devoxx UK keynote. Understanding them will help developers move the needle for each type of commitment.
-
DeepMind Trains AI Controller for Nuclear Fusion Research Device
Researchers at Google subsidiary DeepMind and the Swiss Plasma Center at EPFL have developed a deep reinforcement learning (RL) AI that creates control algorithms for tokamak devices used in nuclear fusion research. The system learned control policies while interacting with a simulator, and when used to control a real device was able to achieve novel plasma configurations.
-
Amazon SageMaker Serverless Inference Now Generally Available
Amazon recently announced that SageMaker Serverless Inference is generally available. Designed for workloads with intermittent or infrequent traffic patterns, the new option provisions and scales compute capacity according to the volume of inference requests the model receives.
-
Amazon MSK Serverless Now Generally Available
AWS recently announced that Amazon MSK Serverless is now generally available. The serverless option to manage an Apache Kafka cluster removes the need to monitor capacity and automatically balances partitions within a cluster.
-
Serving Deep Networks in Production: Balancing Productivity vs Efficiency Tradeoff
A recently published work provides an alternative modality for serving deep neural networks. It enables utilizing eager-mode model code directly at production workloads by using embedded CPython interpreters. The goal is to reduce the engineering effort to bring the models from the research stage to the end-user and to create a proof-of-concept platform for migrating future numerical libraries.
-
Microsoft Rebrands its Data Governance Service to Microsoft Purview
Recently, Microsoft announced Microsoft Purview, a new product branding bringing together the Azure Purview data governance service with various Microsoft 365 compliance solutions.
Resources
[eBook] Application Modernization Patterns & Anti-Patterns
This Intellyx eBook will explore four key consideration factors for modernizing applications, so we can observe and learn from the successes and mistakes of other organizations. Download Now.
The App Architect’s 11-Step Strategic Guide to Digital Transformation
Outside of the technical expertise required, what are the components of a successful project? This guide covers 11 best practices for app architects to consider when charged with digital transformation efforts. Download Now.
Intesa Sanpaolo Accelerates Microservice & PaaS Transformation of Mission Critical Applications with vFunction
In this case study we will describe the challenges, how Intesa Sanpaolo decided to convert one of its main business-critical applications from a monolithic application to microservices, and how a platform called vFunction helped to turn this challenge into a success. Download Now.
vFunction Data Sheet
AI For App Modernization. Transform complex monolithic Java & .NET* applications into mini/microservices. Learn More.