InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
OpenAI Releases 1.6 Billion Parameter Multilingual Speech Recognition AI Whisper
OpenAI recently released Whisper, a 1.6 billion parameter AI model that can transcribe and translate speech audio from 97 different languages. Whisper was trained on 680,000 hours of audio data collected from the web and shows robust zero-shot performance on a wide range of automated speech recognition (ASR) tasks.
-
Microsoft Trains Two Billion Parameter Vision-Language AI Model BEiT-3
Researchers from Microsoft's Natural Language Computing (NLC) group announced the latest version of Bidirectional Encoder representation from Image Transformers: BEiT-3, a 1.9B parameter vision-language AI model. BEiT-3 models images as another language and achieves state-of-the-art performance on a wide range of downstream tasks.
-
AWS DataSync Discovery Preview Edition Supports Automated Data Collection and Storage Recommendation
Amazon is announcing the public preview of AWS DataSync Discovery. This new feature of AWS DataSync enables users to better understand on-premises storage usage through automated data collection and analysis, quickly identify data to migrate, and evaluate recommended AWS Storage services for data.
-
Google Open-Sources Natural Language Robot Control Method SayCan
Researchers from Google's Robotics team have open-sourced SayCan, a robot control method that uses a large language model (LLM) to plan a sequence of robotic actions to achieve a user-specified goal. In experiments, SayCan generated the correct action sequence 84% of the time.
-
Amazon Announced Promotion Feature in Its Personalize Service
Amazon web services has recently announced a promotion feature in personalize to explicitly recommend specific items based on business rules. Amazon Personalize enables businesses to improve customer engagement and monetization metrics by recommending personalized items to the customers.
-
Amazon Is Adding Visual Conversation Builder for Amazon Lex
Amazon is introducing the Visual Conversation Builder for Amazon Lex, a drag and drop interface to visualize and build conversation flows in a no-code environment. The Visual Conversation Builder greatly simplifies bot design.
-
Amazon SageMaker Provides New Built-in TensorFlow Image Classification Algorithms
Amazon is announcing a new built-in TensorFlow algorithm for image classification in Amazon Sagemaker. The supervised learning algorithm supports transfer learning for many pre-trained models available in TensorFlow Hub.
-
Netflix’s New Algorithm Offers Optimal Recommendation Lists for Users with Finite Time Budget
Netflix developed a new machine learning algorithm based on reinforcement learning to create an optimal list of recommendations considering a finite time budget for the user. In a recommendation use case, often the factor of finite time to make a decision is ignored.
-
MIT Researchers Develop AI Model to Solve University-Level Mathematics Problems
Researchers at MIT have developed an AI model that can solve problems used in university-level mathematics courses. The system uses the OpenAI Codex engine to generate programs that output the problem solution, including graphs and plots, achieving an accuracy of 81% on the MATH benchmark dataset as well as on real problems from MIT courses.
-
Near-Optimal Scaling of Large Deep Network Training on Public Cloud
A recently published study, MiCS, provides experimental evidence that the infrastructure used to carry out model training should be taken into account, especially for large deep neural networks trained on the public cloud. The article shows distributing the model weights unevenly between GPUs decreases inter-node communication overhead on AWS V100 and A100 instances.
-
Stability AI Open-Sources Image Generation Model Stable Diffusion
Stability AI released the pre-trained model weights for Stable Diffusion, a text-to-image AI model, to the general public. Given a text prompt, Stable Diffusion can generate photorealistic 512x512 pixel images depicting the scene described in the prompt.
-
Amazon Announces the Improvement of ML Models to Better Identify Sensitive Data on Amazon Macie
Amazon is announcing a new capability to create allow lists in Amazon Macie. Now text or text patterns not desire for Macie to report as sensitive data can be specified in allow lists. Amazon Macie is a fully managed data security and data privacy service that uses machine learning and pattern matching to discover and protect sensitive data in AWS.
-
Microsoft Releases SynapseML 0.1.0 with .NET and Cognitive Services Support
Microsoft announced the first .NET-compatible version of SynapseML, a new machine learning (ML) library for Apache Spark distributed processing platform. Version 0.1.0 of the SynapseML library adds support for .NET bindings, allowing .NET developers to write ML pipelines in their preferred language.
-
Amazon Launches What-If Analyses for Machine Learning Forecasting Service Amazon Forecast
Amazon is announcing that now its time-series machine learning based forecasting service Amazon Forecast can run what-if assessments to determine how different business scenarios can affect demand estimates. What-if analysis is an effective business technique for simulating hypothetical scenarios and stress testing on planning assumptions by recording potential outcomes.
-
AWS Announced New Feature Fine-Grained Visual Embedding for Amazon QuickSight
Recently, AWS announced a new feature, Fine-Grained Visual Embedding, for its cloud-scale business intelligence (BI) service Amazon QuickSight allowing customers to embed individual visualizations from Amazon QuickSight dashboards in high-traffic webpages and applications.