InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Facebook Open-Sources GHN-2 AI for Fast Initialization of Deep-Learning Models

A team from Facebook AI Research (FAIR) and the University of Guelph have open-sourced an improved Graph HyperNetworks (GHN-2) meta-model that predicts initial parameters for deep-learning neural networks. GHN-2 executes in less than a second on a CPU and predicts values for computer vision (CV) networks that achieve up to 77% top-1 accuracy on CIFAR-10 with no additional training.

Anthony Alford
on Nov 30, 2021
DevOps

D2iQ Releases DKP 2.0 to Run Kubernetes Apps at Scale

D2iQ recently released version 2.0 of the D2iQ Kubernetes Platform (DKP), a platform to help organizations run Kubernetes workloads at scale. The new release provides a single pane of glass for managing multi-cluster environments and running applications across any infrastructure including private cloud, public cloud, or at the network edge.

Mostafa Radwan
on Nov 28, 2021
AI, ML & Data Engineering

PyTorch 1.10 Release Includes CUDA Graphs APIs, Compiler Improvements, and Android NNAPI Support

PyTorch, Facebook's open-source deep-learning framework, announced the release of version 1.10 which includes an integration with CUDA Graphs APIs and JIT compiler updates to increase CPU performance, as well as beta support for the Android Neural Networks API (NNAPI). New versions of domain-specific libraries TorchVision and TorchAudio were also released.

Anthony Alford
on Nov 26, 2021
AI, ML & Data Engineering

QCon Plus ML Panel Discussion: ML in Production - What's Next?

The recent QCon Plus online conference featured a panel discussion titled "ML in Production - What's Next?" Some key takeaways were that many ML projects fail in production because of poor engineering infrastructure and a lack of intra-disciplinary communication, and that both model explainability and ML for edge computing are important technologies that are still not mature.

Anthony Alford
on Nov 23, 2021
Cloud

AWS Announces the Availability of EC2 Instances (G5) with NVIDIA A10G Tensor Core GPUs

Recently AWS announced the availability of new G5 instances, which feature up to eight NVIDIA A10G Tensor Core GPUs. These instances are powered by second-generation AMD EPYC processors.

Steef-Jan Wiggers
on Nov 19, 2021
AI, ML & Data Engineering

Facebook Develops New AI Model That Can Anticipate Future Actions

Facebook unveiled its latest machine-learning process called Anticipative Video Transformer (AVT), which is able to predict future actions by using visual interpretation. AVT works as an end-to-end attention-based model for action anticipation in videos.

Daniel Dominguez
on Nov 18, 2021
AI, ML & Data Engineering

Roland Meertens on the Unreasonable Effectiveness of Zero Shot Learning

At the recent QCon Plus online conference, Roland Meertens gave a talk on developing AI-based applications titled "The Unreasonable Effectiveness of Zero Shot Learning." He demonstrated two examples of using foundation models and zero shot learning to rapidly deploy prototype applications and gain feedback without needing to gather large datasets and train models.

Anthony Alford
on Nov 18, 2021
AI, ML & Data Engineering

Francesca Lazzeri on What You Should Know before Deploying ML in Production

At the recent QCon Plus online conference, Dr. Francesca Lazzeri gave a talk on machine learning operations (MLOps) titled "What You Should Know before Deploying ML in Production." She covered four key topics, including MLOps capabilities, open source integrations, machine-learning pipelines, and the MLFlow platform.

Anthony Alford
on Nov 16, 2021
Cloud

Google Document Translation Now Generally Available

Google Cloud recently announced the general availability of Document Translation, a new feature of Translation API Advanced that allows formatting of documents to be retained throughout the translation process.

Renato Losio
on Nov 14, 2021
Culture & Methods

What Machine Learning Can Do for Security

Machine learning can be applied in various ways in security, for instance, in malware analysis, to make predictions, and for clustering security events. It can also be used to detect previously unknown attacks with no established signature.

Ben Linders
on Nov 11, 2021
AI, ML & Data Engineering

BigScience Research Workshop Releases AI Language Model T0

BigScience Research Workshop released T0, a series of natural language processing (NLP) AI models specifically trained for researching zero-shot multitask learning. T0 can often outperform models 6x larger on the BIG-bench benchmark, and can outperform the 16x larger GPT-3 on several other NLP benchmarks.

Anthony Alford
on Nov 09, 2021
Cloud

Microsoft Releases Azure Open AI Service Including Access to Powerful GPT-3 Models

At its recent Ignite conference, Microsoft announced the new Azure OpenAI Service in preview, allowing access to OpenAI’s API through the Azure platform. This new Azure Cognitive Service will give customers access to OpenAI’s powerful GPT-3 models, along with security, reliability, compliance, data privacy, and other enterprise-grade capabilities available through the Azure platform.

Steef-Jan Wiggers
on Nov 07, 2021
Cloud

Amazon Releases DL1 Instances Powered by Gaudi Accelerators

Amazon recently announced the general availability of the EC2 DL1 instances powered by Gaudi accelerators from Habana Labs. The new instances promise better price performances in training deep learning models for use cases such as computer vision, natural language processing, autonomous vehicle perception and recommendation engines.

Renato Losio
on Nov 07, 2021
AI, ML & Data Engineering

Apache Spark Brings Pandas API with Version 3.2

The Apache Spark team has integrated the Pandas API in the product's latest 3.2 release. With this change, dataframe processing can be scaled to multiple clusters or multiple processors in a single machine using the PySpark execution engine.

Sabri Bolkar
on Nov 04, 2021
AI, ML & Data Engineering

Baidu Announces 11 Billion Parameter Chatbot AI PLATO-XL

Baidu recently announced PLATO-XL, an AI model for dialog generation, which was trained on over a billion samples collected from social media conversations in both English and Chinese. PLATO-XL achieves state-of-the-art performance on several conversational benchmarks, outperforming currently available commercial chatbots.

Anthony Alford
on Nov 02, 2021

Newer News

Older News

InfoQ Software Architects' Newsletter

News