InfoQ Homepage Natural Language Processing Content on InfoQ
-
AI Researchers' Open-Source Model Explanation Toolkit AllenNLP Interpret
Researchers from the Allen Institute for AI and University of California, Irvine, have released AllenNLP Interpret, a toolkit for explaining the results from NLP models. The extensible toolkit includes several built-in methods for interpretation and visualization components, as well as examples using AllenNLP to explain the results of state-of-the art NLP models including BERT and RoBERTa.
-
Google Releases Two New NLP Dialog Datasets
Researchers from Google AI released two new dialog datasets for natural-language processing (NLP) development: Coached Conversational Preference Elicitation (CCPE) and Taskmaster-1. The datasets contain thousands of conversations as well as labels and annotations for training digital assistants to better determine users' preferences and intentions.
-
Facebook Open-Sources RoBERTa: an Improved Natural Language Processing Model
Facebook AI open-sourced a new deep-learning natural-language processing (NLP) model, Robustly-optimized BERT approach (RoBERTa). Based on Google's BERT pre-training model, RoBERTa includes additional pre-training improvements that achieve state-of-the-art results on several benchmarks, using only unlabeled text from the world-wide web, with minimal fine-tuning and no data augmentation.
-
Amazon Introduces Two New Features for Polly: Neural Text-to-Speech and Newscaster Style
Recently, Amazon announced the general availability of Neural Text-to-Speech (NTTS) technology in their Polly service in AWS, which turns text into lifelike speech. Furthermore, Amazon Polly now also offers a Newscaster speaking style.
-
Baidu Open-Sources ERNIE 2.0, Beats BERT in Natural Language Processing Tasks
In a recent blog post, Baidu, the Chinese search engine and e-commerce giant, announced their latest open-source, natural language understanding framework called ERNIE 2.0. They also shared recent test results including achieving state-of-the art (SOTA) results and outperforming existing frameworks, including Google’s BERT and XLNet in 16 NLP tasks in both Chinese and English.
-
Google Releases TensorFlow.Text Library for Natural Language Processing
Google released a TensorFlow.Text, a new text-processing library for their TensorFlow deep-learning platform. The library allows several common text pre-processing activities, such as tokenization, to be handled by the TensorFlow graph computation system, improving consistency and portability of deep-learning models for natural-language processing.
-
OpenAI Introduces Sparse Transformers for Deep Learning of Longer Sequences
OpenAI has developed the Sparse Transformer, a deep neural-network architecture for learning sequences of data, including text, sound, and images. The networks can achieve state-of-the-art performance on several deep-learning tasks with faster training times.
-
AWS Releases Enhancements to AI Services for NLP, Speech-to-Text Transcription, and Image Detection
Amazon Web Services (AWS) released new features for three of its AI services: Amazon Comprehend, Amazon Rekognition, and Amazon Transcribe.
-
Google Expands ML Kit, Adds Smart Reply and Language Identification
In a recent Android blog post, Google announced the release of two new Natural Language Processing (NLP) features for ML Kit, including Language Identification and Smart Reply. In both cases, Google is providing domain-independent APIs that help developers analyze and generate text, speak and other types of Natural Language text.
-
Q&A on Condé Nast's Natural Language Processor and Content Analysis
Beginning in 2015, Condé Nast created a natural-language-processing and content-analysis engine to improve the metadata around content created across their 22 brands. The new system has led to a 30% increase in click-through rates. InfoQ spoke with Antonino Rau, a software engineer and technology manager at Condé Nast US about the evolution of their NLP-as-a-service system named HAL.
-
Facebook Open-Sources PyText NLP Modeling Framework
Facebook AI Research is open-sourcing PyText, a natural-language-processing (NLP) modeling framework that is used in the Portal video-calling device and M Suggestions in Facebook Messenger.
-
Facebook Open-Sources PyText for Faster Natural Language Processing Development
In a recent blog post, Facebook announced they have open-sourced PyText, a modeling framework, used in natural language processing (NLP) systems. PyText is a library built upon PyTorch and improves the effectiveness of promoting experimentation projects to large-scale production deployments.
-
Q&A with Christoph Windheuser on AI Applications in the Industry
Increased hardware power and huge amounts of data are making existing machine learning approaches like pattern recognition, natural language processing, and reinforcement learning possible. Artificial Intelligence is impacting the development process; it’s increasing the complexity of things like version control, CI/CD and testing.
-
AWS Marketplace Offers Machine Learning Algorithms and Model Packages
Amazon Web Services is offering machine learning algorithms and model packages on their AWS Marketplace. This was announced at AWS re:Invent Conference last week.
-
Google Open-Sources Speaker Diarization AI Technology, Claims 92% Accuracy
In a recent blog post, Google announced they have open-sourced their speaker diarization technology, which is able to differentiate people’s voices at a high accuracy rate. Google is able to do this by partitioning an audio stream that includes multiple participants into homogeneous segments per participant.