InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
NVIDIA Kubernetes Device Plug-in Brings Temporal GPU Concurrency
Starting from the v12 release, the Nvidia GPU device plug-in framework started supporting time-sliced sharing between CUDA workloads on Kubernetes. This feature aims to prevent under-utilization of GPU units and make it easier to scale applications by leveraging concurrently-executing CUDA contexts.
-
AWS Announces Clean Rooms for Secure Collaboration with Analytics Data
During the recent re:Invent conference, AWS announced the preview of Clean Rooms for analytics data. The new service provides safe environments where multiple customers can securely share and analyze data with control of how the data is used, reducing the risk of sharing personal data.
-
OpenAI Releases Conversational AI Model ChatGPT
OpenAI released ChatGPT, a conversational AI model based on their GPT-3.5 language model (LM). ChatGPT is fine-tuned using Reinforcement Learning from Human Feedback (RLHF) and includes a moderation filter to block inappropriate interactions.
-
Wayve's End-to-End Deep Learning Model for Self-Driving Cars
Wayve released a state-of-the-art end-to-end model for learning a world model and vehicular driving policy based on simulation data from CARLA, allowing autonomy to cars without HD maps. Wayve’s new Model-based Imitation Learning (MILE) is a machine-learning model, specifically a reinforcement learning architecture, that learns a model of the world and a driving policy during offline training.
-
Meta MultiRay Allows Efficiency on Large-Scale AI Models
Meta developed MultiRay, a platform that allows the cost-effective running state-of-the-art machine learning models. MultiRay allows models to run on the same input in order to share the majority of the running cost with a little addictive cost per model.
-
Microsoft Open-Sources Agricultural AI Toolkit FarmVibes.AI
Microsoft Research recently open-sourced FarmVibes.AI, a suite of ML models and tools for sustainable agriculture. FarmVibes.AI includes data processing workflows for fusing multiple sets of spatiotemporal and geospatial data, such as weather data and satellite and drone imagery.
-
Recap of AWS re:Invent 2022
After a virtual-only event in 2020 and a reduced-size 2021 edition, re:Invent was back last week in Las Vegas with over 50,000 attendees for the 11th edition. During multiple sessions and keynotes at the largest AWS yearly conference, the cloud provider announced new services and features, with the focus more on business solutions and data options than new building blocks.
-
AWS Announces the General Availability of Amazon Omics
At re:Invent, AWS announced the general availability of Amazon Omics, a managed service for storage, analysis, and elaboration of genomic, transcriptomic, and other omics data. The service is designed for healthcare and life science organizations to enhance patient care and advance scientific research.
-
Apple Adds Core ML Support for Stable Diffusion on Apple Silicon
Apple released a set of optimizations to Core ML to enable running the Stable Diffusion text-to-image model on Apple Silicon-powered devices running the latest iOS or macOS versions, respectively iOS 16.2 and macOS 13.1.
-
Open Source SkyPilot Targets Cloud Cost Optimization for ML and Data Science
A team of researchers at the RISELab at UC Berkeley recently released Skypilot, an open-source framework for running machine learning workloads on the major cloud providers through a unified interface. The project focuses on cost optimization automatically finding the cheapest availability zone, region, and provider for the requested resources.
-
AWS Announces DataZone, a New Data Management Service to Govern Data
At AWS re:Invent, Amazon Web Services announced Amazon DataZone, a new data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on-premises, and third-party sources.
-
Google's Code-as-Policies Lets Robots Write Their Own Code
Researchers from Google's Robotics team have open-sourced Code-as-Policies (CaP), a robot control method that uses a large language model (LLM) to generate robot-control code that achieves a user-specified goal. CaP uses a hierarchical prompting technique for code generation that outperforms previous methods on the HumanEval code-generation benchmark.
-
Galactica: Large Language Model for Scientific Knowledge
Meta AI and Papers with Code recently released Galactica, a 120-billion-parameter scientific-language model which can search and summarize academic literature, solve math problems, and write scientific code. Galactica’s architecture is based on a transformer, an attention mechanism which draws global dependencies between input and output.
-
Neo4j 5 Graph Database Supports Automated Scaling and Deployment Interoperability
Graph NoSQL database Neo4j version 5 supports query language and performance improvements, automated scale-up and scale-out capabilities, and cross-platform deployment interoperability. Neo4j team announced last week the general availability (GA) of latest product release of the graph database.
-
Google Open-Sources Secure ML Operating System KataOS
Google's AmbiML team recently open-sourced KataOS, a provably secure operating system for embedded ML hardware. KataOS is based on the seL4 microkernel and is implemented in Rust. Along with KataOS, Google is releasing Sparrow, a reference implementation of the operating system targeted for a secure hardware platform based on the RISC-V architecture.