InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Katharine Jarmul on Machine Learning at the Edge
At the recent QCon Plus online conference, Katharine Jarmul gave a talk on federated machine learning titled "Machine Learning at the Edge." She covered several federated ML architectures and use cases, discussed pros and cons of federated ML, and presented tips on how to decide whether federated ML is a good solution for a given problem.
-
AWS Introduces Amazon Redshift Serverless
As part of a trend towards serverless analytics options, AWS announced the public preview of Amazon Redshift Serverless. The latest version of the managed data warehouse service targets deployments where it is difficult to manage capacity due to variable workloads or unpredictable spikes.
-
Microsoft Announces the General Availability of NDm A100 v4 Series Virtual Machines
Recently, Microsoft announced the general availability (GA) of a brand-new virtual machine (VM) series in Azure, the NDm A100 v4 Series - featuring NVIDIA A100 Tensor Core 80 GB GPUs. This high-performance computing (HPC) VM is designed to deliver high performance, scalability, and cost efficiency for various real-world HPC workloads.
-
Facebook Open-Sources GHN-2 AI for Fast Initialization of Deep-Learning Models
A team from Facebook AI Research (FAIR) and the University of Guelph have open-sourced an improved Graph HyperNetworks (GHN-2) meta-model that predicts initial parameters for deep-learning neural networks. GHN-2 executes in less than a second on a CPU and predicts values for computer vision (CV) networks that achieve up to 77% top-1 accuracy on CIFAR-10 with no additional training.
-
D2iQ Releases DKP 2.0 to Run Kubernetes Apps at Scale
D2iQ recently released version 2.0 of the D2iQ Kubernetes Platform (DKP), a platform to help organizations run Kubernetes workloads at scale. The new release provides a single pane of glass for managing multi-cluster environments and running applications across any infrastructure including private cloud, public cloud, or at the network edge.
-
PyTorch 1.10 Release Includes CUDA Graphs APIs, Compiler Improvements, and Android NNAPI Support
PyTorch, Facebook's open-source deep-learning framework, announced the release of version 1.10 which includes an integration with CUDA Graphs APIs and JIT compiler updates to increase CPU performance, as well as beta support for the Android Neural Networks API (NNAPI). New versions of domain-specific libraries TorchVision and TorchAudio were also released.
-
QCon Plus ML Panel Discussion: ML in Production - What's Next?
The recent QCon Plus online conference featured a panel discussion titled "ML in Production - What's Next?" Some key takeaways were that many ML projects fail in production because of poor engineering infrastructure and a lack of intra-disciplinary communication, and that both model explainability and ML for edge computing are important technologies that are still not mature.
-
AWS Announces the Availability of EC2 Instances (G5) with NVIDIA A10G Tensor Core GPUs
Recently AWS announced the availability of new G5 instances, which feature up to eight NVIDIA A10G Tensor Core GPUs. These instances are powered by second-generation AMD EPYC processors.
-
Facebook Develops New AI Model That Can Anticipate Future Actions
Facebook unveiled its latest machine-learning process called Anticipative Video Transformer (AVT), which is able to predict future actions by using visual interpretation. AVT works as an end-to-end attention-based model for action anticipation in videos.
-
Roland Meertens on the Unreasonable Effectiveness of Zero Shot Learning
At the recent QCon Plus online conference, Roland Meertens gave a talk on developing AI-based applications titled "The Unreasonable Effectiveness of Zero Shot Learning." He demonstrated two examples of using foundation models and zero shot learning to rapidly deploy prototype applications and gain feedback without needing to gather large datasets and train models.
-
Francesca Lazzeri on What You Should Know before Deploying ML in Production
At the recent QCon Plus online conference, Dr. Francesca Lazzeri gave a talk on machine learning operations (MLOps) titled "What You Should Know before Deploying ML in Production." She covered four key topics, including MLOps capabilities, open source integrations, machine-learning pipelines, and the MLFlow platform.
-
Google Document Translation Now Generally Available
Google Cloud recently announced the general availability of Document Translation, a new feature of Translation API Advanced that allows formatting of documents to be retained throughout the translation process.
-
What Machine Learning Can Do for Security
Machine learning can be applied in various ways in security, for instance, in malware analysis, to make predictions, and for clustering security events. It can also be used to detect previously unknown attacks with no established signature.
-
BigScience Research Workshop Releases AI Language Model T0
BigScience Research Workshop released T0, a series of natural language processing (NLP) AI models specifically trained for researching zero-shot multitask learning. T0 can often outperform models 6x larger on the BIG-bench benchmark, and can outperform the 16x larger GPT-3 on several other NLP benchmarks.
-
Microsoft Releases Azure Open AI Service Including Access to Powerful GPT-3 Models
At its recent Ignite conference, Microsoft announced the new Azure OpenAI Service in preview, allowing access to OpenAI’s API through the Azure platform. This new Azure Cognitive Service will give customers access to OpenAI’s powerful GPT-3 models, along with security, reliability, compliance, data privacy, and other enterprise-grade capabilities available through the Azure platform.