InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Amazon SageMaker Clarify Now Supports Online Explainability for ML Predictions
Amazon is announcing that Amazon SageMaker Clarify now supports online explainability by providing explanations for machine learning model’s individual predictions in near real-time on live endpoints.
-
Apache InLong: Integration Framework for Massive Data
Apache InLong, an integration framework designed for massive data, was originally built at Tencent, where it was used in production for more than eight years, to support massive data reporting services in big data scenarios. The project officially graduated as an Apache top-level project three years after the introduction of the project in the Apache Incubator.
-
University Researchers Publish Results of NLP Community Metasurvey
Researchers from New York University, University of Washington, and Johns Hopkins University have published the results of the NLP Community Metasurvey, which compiles the opinions of 480 active NLP researchers about several issues in the natural language processing AI field. The survey also includes meta-questions about the perceived opinions of other researchers.
-
Grab Shared Its Experience in Designing Distributed Data Platform
GrabApp is an application that customers select and buy their daily needs from merchants. To be scalable and manageable the data platform and ingestion should be designed as a distributed, fault-tolerant. To design this data platform two classes of data stores are considered: OLTP and OLAP.
-
Confluent Introduces Stream Governance Advanced to Safely Extend Data Streaming Power
Confluent recently announced new enhancements to its Stream Governance product that will improve engineering teams' ability to discover, understand, and trust real-time data. Organizations can use Stream Governance Advanced to resolve issues within complex pipelines more easily with point-in-time lineage.
-
Fitbit Health Solution and Google Cloud Introduce Device Connect for Fitbit
Fitbit Health Solutions and Google Cloud have recently announced the release of Device Connect for Fitbit, which will provide healthcare and life sciences enterprises with accelerated analytics and insights to help people live healthier lives.
-
Google AI Introduces TensorStore, a High-Performance Open-Source Library for Array Storage
Google has introduced TensorStore, an open-source C++ and Python framework intended to speed up the design for reading and writing large multidimensional arrays.
-
Transformers Can Mock Part of Human Brain
In recent years, neuroscientists have tried many types of neural networks to model the firing of neurons in the human brain. In a recent project, two researchers Whittington and Behrens found that the hippocampus, a structure of the brain critical to memory, works as a particular kind of artificial neural network called transformer.
-
OpenAI Releases 1.6 Billion Parameter Multilingual Speech Recognition AI Whisper
OpenAI recently released Whisper, a 1.6 billion parameter AI model that can transcribe and translate speech audio from 97 different languages. Whisper was trained on 680,000 hours of audio data collected from the web and shows robust zero-shot performance on a wide range of automated speech recognition (ASR) tasks.
-
Microsoft Trains Two Billion Parameter Vision-Language AI Model BEiT-3
Researchers from Microsoft's Natural Language Computing (NLC) group announced the latest version of Bidirectional Encoder representation from Image Transformers: BEiT-3, a 1.9B parameter vision-language AI model. BEiT-3 models images as another language and achieves state-of-the-art performance on a wide range of downstream tasks.
-
AWS DataSync Discovery Preview Edition Supports Automated Data Collection and Storage Recommendation
Amazon is announcing the public preview of AWS DataSync Discovery. This new feature of AWS DataSync enables users to better understand on-premises storage usage through automated data collection and analysis, quickly identify data to migrate, and evaluate recommended AWS Storage services for data.
-
Google Open-Sources Natural Language Robot Control Method SayCan
Researchers from Google's Robotics team have open-sourced SayCan, a robot control method that uses a large language model (LLM) to plan a sequence of robotic actions to achieve a user-specified goal. In experiments, SayCan generated the correct action sequence 84% of the time.
-
Amazon Announced Promotion Feature in Its Personalize Service
Amazon web services has recently announced a promotion feature in personalize to explicitly recommend specific items based on business rules. Amazon Personalize enables businesses to improve customer engagement and monetization metrics by recommending personalized items to the customers.
-
Amazon Is Adding Visual Conversation Builder for Amazon Lex
Amazon is introducing the Visual Conversation Builder for Amazon Lex, a drag and drop interface to visualize and build conversation flows in a no-code environment. The Visual Conversation Builder greatly simplifies bot design.
-
Amazon SageMaker Provides New Built-in TensorFlow Image Classification Algorithms
Amazon is announcing a new built-in TensorFlow algorithm for image classification in Amazon Sagemaker. The supervised learning algorithm supports transfer learning for many pre-trained models available in TensorFlow Hub.