InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Google Announces General Availability of Cloud TPU VMs
Last year Google introduced Cloud TPU Virtual Machines (VMs), which provide direct access to TPU host machines in preview. Today, Cloud TPU VMs are generally available, including the new TPU Embedding API, which can accelerate ML Based ranking and recommendation workloads.
-
Green Software Development: Terminology and Climate Commitments Explained by Microsoft at Devoxx UK
As a side effect of the accelerated move towards the cloud, the software industry is contributing more and more to global warming. Companies have taken on different commitments: Net-Zero, Carbon Neutral, etc. Asim Hussain, Green Cloud Advocacy Lead @ Microsoft deciphers them during the Devoxx UK keynote. Understanding them will help developers move the needle for each type of commitment.
-
DeepMind Trains AI Controller for Nuclear Fusion Research Device
Researchers at Google subsidiary DeepMind and the Swiss Plasma Center at EPFL have developed a deep reinforcement learning (RL) AI that creates control algorithms for tokamak devices used in nuclear fusion research. The system learned control policies while interacting with a simulator, and when used to control a real device was able to achieve novel plasma configurations.
-
Amazon SageMaker Serverless Inference Now Generally Available
Amazon recently announced that SageMaker Serverless Inference is generally available. Designed for workloads with intermittent or infrequent traffic patterns, the new option provisions and scales compute capacity according to the volume of inference requests the model receives.
-
Amazon MSK Serverless Now Generally Available
AWS recently announced that Amazon MSK Serverless is now generally available. The serverless option to manage an Apache Kafka cluster removes the need to monitor capacity and automatically balances partitions within a cluster.
-
Serving Deep Networks in Production: Balancing Productivity vs Efficiency Tradeoff
A recently published work provides an alternative modality for serving deep neural networks. It enables utilizing eager-mode model code directly at production workloads by using embedded CPython interpreters. The goal is to reduce the engineering effort to bring the models from the research stage to the end-user and to create a proof-of-concept platform for migrating future numerical libraries.
-
Microsoft Rebrands its Data Governance Service to Microsoft Purview
Recently, Microsoft announced Microsoft Purview, a new product branding bringing together the Azure Purview data governance service with various Microsoft 365 compliance solutions.
-
NVIDIA Announces Next Generation AI Hardware H100 GPU and Grace CPU Superchip
At the recent GTC conference, NVIDIA announced their next generation processors for AI computing, the H100 GPU and the Grace CPU Superchip. Based on NVIDIA's Hopper architecture, the H100 includes a Transformer engine for faster training of AI models. The Grace CPU Superchip features 144 Arm cores and outperforms NVIDIA's current dual-CPU offering on the SPECrate 2017_int_base benchmark.
-
AWS Releases its Digital Twin Service IoT TwinMaker into General Availability
During the recent AWS Summit in San Francisco, AWS announced the general availability of its IoT TwinMaker service – AWS IoT TwinMaker.
-
Google Trains 540 Billion Parameter AI Language Model PaLM
Google Research recently announced the Pathways Language Model (PaLM), a 540-billion-parameter AI natural language processing (NLP) model that surpasses average human performance on the BIG-bench benchmark. PaLM outperforms other state-of-the-art systems on many evaluation tasks, and shows strong results on tasks such as logical inference and joke explanation.
-
Google Announces AI-Generated Summaries for Google Docs
Google has announced a new feature for their Docs app that will automatically generate a summary of the document content. The summarization is powered by a natural language processing (NLP) AI model based on the Transformer architecture.
-
Ten Lessons from Three Generations of Tensor Processing Units
A recent report published by Google’s TPU group highlights ten takeaways from developing three generations of tensor processing units. The authors also discuss how their previous experience will affect the development of future tensor processing units.
-
Google Launches a New Cross-Platform Data Storage Engine BigLake in Preview
At the recent Cloud Data Summit, Google recently announced the preview of BigLake, a new data lake storage engine that makes it easier for enterprises to analyze the data in their data warehouses and data lakes.
-
Stanford University Publishes AI Index 2022 Annual Report
Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published its 2022 AI Index annual report. The report identifies top trends in AI, including advances in technical achievements, a sharp increase in private investment, and increasing attention on ethical issues.
-
Microsoft Introduces Open Data for Social Impact Framework
Microsoft recently introduced the Open Data for Social Impact Framework, a guide to help organizations put data to work to get new insights, make better decisions, and improve efficiency while tackling pressing social issues. The framework includes a five-step roadmap that organizations can use to get started.