InfoQ Homepage Natural Language Processing Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

EleutherAI Open-Sources Six Billion Parameter GPT-3 Clone GPT-J

A team of researchers from EleutherAI have open-sourced GPT-J, a six-billion parameter natural language processing (NLP) AI model based on GPT-3. The model was trained on an 800GB open-source text dataset and has performance comparable to a GPT-3 model of similar size.

Anthony Alford
on Jul 13, 2021
AI, ML & Data Engineering

Google Open-Sources Token-Free Language Model ByT5

Google Research has open-sourced ByT5, a natural language processing (NLP) AI model that operates on raw bytes instead of abstract tokens. Compared to baseline models, ByT5 is more accurate on several benchmark tasks and is more robust to misspellings and noise.

Anthony Alford
on Jul 06, 2021
AI, ML & Data Engineering

NLP Library spaCy 3.0 Features Transformer-Based Models and Distributed Training

AI software makers Explosion announced version 3.0 of spaCy, their open-source natural-language processing (NLP) library. The new release includes state-of-the-art Transformer-based pipelines and pre-trained models for 17 languages.

Anthony Alford
on Feb 23, 2021
AI, ML & Data Engineering

Google Open-Sources Trillion-Parameter AI Language Model Switch Transformer

Researchers at Google Brain have open-sourced the Switch Transformer, a natural-language processing (NLP) AI model. The model scales up to 1.6T parameters and improves training time up to 7x compared to the T5 NLP model, with comparable accuracy.

Anthony Alford
on Feb 16, 2021
AI, ML & Data Engineering

OpenAI Announces GPT-3 Model for Image Generation

OpenAI has trained a 12B-parameter AI model based on GPT-3 that can generate images from textual description. The description can specify many independent attributes, including the position of objects as well as image perspective, and can also synthesize combinations of objects that do not exist in the real world.

Anthony Alford
on Feb 02, 2021
AI, ML & Data Engineering

Facebook Open-Sources Multilingual Speech Recognition Deep-Learning Model

Facebook AI Research (FAIR) open-sourced Cross-Lingual Speech Recognition (XSLR), a multilingual speech recognition AI model. XSLR is trained on 53 languages and outperforms existing systems when evaluated on common benchmarks.

Anthony Alford
on Jan 26, 2021
AI, ML & Data Engineering

AWS Introduces HealthLake and Redshift ML in Preview

AWS introduced preview releases of Amazon HealthLake service and a feature for Amazon Redshift called Redshift ML during re:Invent 2020 in December. Amazon HealthLake is a data lake service that helps healthcare, health insurance, and pharmaceutical companies to derive value out of their data with the help of NLP. Redshift ML is a service that provides a gateway into SageMaker to Redshift users.

Kovid Rathee
on Jan 21, 2021
AI, ML & Data Engineering

AI Models from Google and Microsoft Exceed Human Performance on Language Understanding Benchmark

Research teams from Google and Microsoft have recently developed natural language processing (NLP) AI models which have scored higher than the human baseline score on the SuperGLUE benchmark. SuperGLUE measures a model's score on several natural language understanding (NLU) tasks, including question answering and reading comprehension.

Anthony Alford
on Jan 12, 2021
AI, ML & Data Engineering

Rasa Announces Open Source AI Assistant Framework 2.0

Rasa, the customizable open source machine learning framework to automate text and voice-based AI assistants, has released version 2.0 with significant improvements to dialogue management, training data format, and interactive documentation. In addition, the latest release reduces the learning curve to get started while expanding configuration options for advanced users.

Uday Tatiraju
on Nov 10, 2020
AI, ML & Data Engineering

Large-Scale Multilingual AI Models from Google, Facebook, and Microsoft

Researchers from Google, Facebook, and Microsoft have published their recent work on multilingual AI models. Google and Microsoft have released models that achieve new state-of-the-art performance on NLP tasks measured by the XTREME benchmark, while Facebook has produced a non-English-centric many-to-many translation model.

Anthony Alford
on Nov 03, 2020
AI, ML & Data Engineering

AI Training Method Exceeds GPT-3 Performance with 99.9% Fewer Parameters

A team of scientists at LMU Munich have developed Pattern-Exploiting Training (PET), a deep-learning training technique for natural language processing (NLP) models. Using PET, the team trained a Transformer NLP model with 223M parameters that out-performed the 175B-parameter GPT-3 by over 3 percentage points on the SuperGLUE benchmark.

Anthony Alford
on Oct 06, 2020
AI, ML & Data Engineering

Microsoft Obtains Exclusive License for GPT-3 AI Model

Microsoft announced an agreement with OpenAI to license OpenAI's GPT-3 deep-learning model for natural-language processing (NLP). Although Microsoft's announcement says it has "exclusively" licensed the model, OpenAI will continue to offer access to the model via its own API.

Anthony Alford
on Sep 29, 2020
AI, ML & Data Engineering

Salesforce Releases Photon Natural Language Interface for Databases

A team of scientists from Salesforce Research and Chinese University of Hong Kong have released Photon, a natural language interface to databases (NLIDB). The team used deep-learning to construct a parser that achieves 63% accuracy on a common benchmark and an error-detecting module that prompts users to clarify ambiguous questions.

Anthony Alford
on Sep 08, 2020
AI, ML & Data Engineering

Google's BigBird Model Improves Natural Language and Genomics Processing

Researchers at Google have developed a new deep-learning model called BigBird that allows Transformer neural networks to process sequences up to 8x longer than previously possible. Networks based on this model achieved new state-of-the-art performance levels on natural-language processing (NLP) and genomics tasks.

Anthony Alford
on Sep 01, 2020
AI, ML & Data Engineering

AI Conference Recap: Facebook, Google, Microsoft, and Others at ACL 2020

At the recent Annual Meeting of the Association for Computational Linguistics (ACL), research teams from several tech companies, including Facebook, Google, Microsoft, Amazon, and Salesforce presented nearly 200 papers out of a total of 779 on a wide variety of AI topics related to Natural Language Processing (NLP).

Anthony Alford
on Aug 04, 2020

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News