InfoQ Homepage Natural Language Processing Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Applies NLP Algorithm BERT to Search

BERT, Google's latest NLP algorithm, will power Google search and make it better at understanding user queries in a way more similar to how humans would understand them, writes Pandu Nayak, Google fellow and vice president for Search, with one in 10 queries providing a different set of results.

Sergio De Simone
on Nov 05, 2019
AI, ML & Data Engineering

Microsoft and University of Maryland Researchers Announce FreeLB Adversarial Training System

Researchers from Microsoft and the University of Maryland (UMD) announced Free Large-Batch (FreeLB), a new adversarial training technique for deep-learning natural-language processing (NLP) systems that improves accuracy, increasing RoBERTa's scores on the General Language Understanding Evaluation (GLUE) benchmark and achieving the highest score on AI2 Reasoning Challenge (ARC) benchmark.

Anthony Alford
on Oct 29, 2019
AI, ML & Data Engineering

Facebook Releases AI Code Search Datasets

Facebook AI released a dataset containing coding questions paired with code-snippet answers, intended for evaluating AI-based natural-language code search systems. The release also includes benchmark results for several of Facebook's own code-search models and a training corpus of over 4 million Java methods parsed from over 24,000 GitHub repositories.

Anthony Alford
on Oct 15, 2019
AI, ML & Data Engineering

AI Researchers' Open-Source Model Explanation Toolkit AllenNLP Interpret

Researchers from the Allen Institute for AI and University of California, Irvine, have released AllenNLP Interpret, a toolkit for explaining the results from NLP models. The extensible toolkit includes several built-in methods for interpretation and visualization components, as well as examples using AllenNLP to explain the results of state-of-the art NLP models including BERT and RoBERTa.

Anthony Alford
on Oct 08, 2019
AI, ML & Data Engineering

Google Releases Two New NLP Dialog Datasets

Researchers from Google AI released two new dialog datasets for natural-language processing (NLP) development: Coached Conversational Preference Elicitation (CCPE) and Taskmaster-1. The datasets contain thousands of conversations as well as labels and annotations for training digital assistants to better determine users' preferences and intentions.

Anthony Alford
on Oct 01, 2019
AI, ML & Data Engineering

Facebook Open-Sources RoBERTa: an Improved Natural Language Processing Model

Facebook AI open-sourced a new deep-learning natural-language processing (NLP) model, Robustly-optimized BERT approach (RoBERTa). Based on Google's BERT pre-training model, RoBERTa includes additional pre-training improvements that achieve state-of-the-art results on several benchmarks, using only unlabeled text from the world-wide web, with minimal fine-tuning and no data augmentation.

Anthony Alford
on Sep 24, 2019
Cloud

Amazon Introduces Two New Features for Polly: Neural Text-to-Speech and Newscaster Style

Recently, Amazon announced the general availability of Neural Text-to-Speech (NTTS) technology in their Polly service in AWS, which turns text into lifelike speech. Furthermore, Amazon Polly now also offers a Newscaster speaking style.

Steef-Jan Wiggers
on Aug 09, 2019
AI, ML & Data Engineering

Baidu Open-Sources ERNIE 2.0, Beats BERT in Natural Language Processing Tasks

In a recent blog post, Baidu, the Chinese search engine and e-commerce giant, announced their latest open-source, natural language understanding framework called ERNIE 2.0. They also shared recent test results including achieving state-of-the art (SOTA) results and outperforming existing frameworks, including Google’s BERT and XLNet in 16 NLP tasks in both Chinese and English.

Kent Weare
on Aug 06, 2019
AI, ML & Data Engineering

Google Releases TensorFlow.Text Library for Natural Language Processing

Google released a TensorFlow.Text, a new text-processing library for their TensorFlow deep-learning platform. The library allows several common text pre-processing activities, such as tokenization, to be handled by the TensorFlow graph computation system, improving consistency and portability of deep-learning models for natural-language processing.

Anthony Alford
on Jul 16, 2019
AI, ML & Data Engineering

OpenAI Introduces Sparse Transformers for Deep Learning of Longer Sequences

OpenAI has developed the Sparse Transformer, a deep neural-network architecture for learning sequences of data, including text, sound, and images. The networks can achieve state-of-the-art performance on several deep-learning tasks with faster training times.

Anthony Alford
on May 21, 2019
AI, ML & Data Engineering

AWS Releases Enhancements to AI Services for NLP, Speech-to-Text Transcription, and Image Detection

Amazon Web Services (AWS) released new features for three of its AI services: Amazon Comprehend, Amazon Rekognition, and Amazon Transcribe.

Anthony Alford
on Apr 23, 2019
Emerging Technologies

Google Expands ML Kit, Adds Smart Reply and Language Identification

In a recent Android blog post, Google announced the release of two new Natural Language Processing (NLP) features for ML Kit, including Language Identification and Smart Reply. In both cases, Google is providing domain-independent APIs that help developers analyze and generate text, speak and other types of Natural Language text.

Kent Weare
on Apr 22, 2019
AI, ML & Data Engineering

Q&A on Condé Nast's Natural Language Processor and Content Analysis

Beginning in 2015, Condé Nast created a natural-language-processing and content-analysis engine to improve the metadata around content created across their 22 brands. The new system has led to a 30% increase in click-through rates. InfoQ spoke with Antonino Rau, a software engineer and technology manager at Condé Nast US about the evolution of their NLP-as-a-service system named HAL.

Reda Hmeid
on Mar 29, 2019
AI, ML & Data Engineering

Facebook Open-Sources PyText NLP Modeling Framework

Facebook AI Research is open-sourcing PyText, a natural-language-processing (NLP) modeling framework that is used in the Portal video-calling device and M Suggestions in Facebook Messenger.

Anthony Alford
on Feb 15, 2019
Emerging Technologies

Facebook Open-Sources PyText for Faster Natural Language Processing Development

In a recent blog post, Facebook announced they have open-sourced PyText, a modeling framework, used in natural language processing (NLP) systems. PyText is a library built upon PyTorch and improves the effectiveness of promoting experimentation projects to large-scale production deployments.

Kent Weare
on Dec 31, 2018

Newer News

Older News

InfoQ Software Architects' Newsletter

News