BT
Data Science Follow 539 Followers

Researchers Improve State of the Art in Image Recognition Using Data Set with 300 Million Images

by Roland Meertens Follow 4 Followers on  Jul 28, 2017

Researchers improved the state of the art results on several benchmarks with models trained on a generated data set with 300 million images instead of the 1 million normally used. To test what happens with more train data, Google created an internal dataset of 300 million images. They labelled the data automatically in a noisy way. The conclusion is that more training data indeed helps.

Data Science Follow 539 Followers

Netflix Announces Genie 3

by Dylan Raithel Follow 5 Followers on  Jul 11, 2017

Netflix announced major revisions and functionality in their Big Data distributed workflow management tool, Genie 3. In its newest version, Genie 3 supports scalable, config-driven data processing executables and task pipelines.

Data Science Follow 539 Followers

Scalable Chatbot Architecture with eBay ShopBot Shopping Assistant

by Srini Penchikala Follow 22 Followers on  Jul 09, 2017

Robert Enyedi, software engineer at eBay spoke at QCon New York 2017 Conference about ShopBot personal shopping assistant application. ShopBot, launched in late 2016 based on Facebook Messenger bot, leverages AI components and the eBay user data to provide shopping options in a conversational style.

Data Science Follow 539 Followers

Enhancing Google Maps with Deep Learning and Street View

by Srini Penchikala Follow 22 Followers on  Jun 13, 2017 1

Google's Ground Truth team recently announced a new Deep Learning model for the automatic extraction of information from geo-located image files to improve Google Maps. This neural network model achieved a higher accuracy in processing the challenging French Street Name Signs (FSNS) dataset.

Data Science Follow 539 Followers

Facebook Publishes New Neural Machine Translation Algorithm

by Roland Meertens Follow 4 Followers on  May 24, 2017

Facebook’s Artificial Intelligence Research team published research results using a new approach for neural machine translation (NMT). Their algorithm scores higher than any other system on three established machine translation tasks.

Data Science Follow 539 Followers

Developing Virtual Assistant Apps with Amazon Lex and Polly Deep Learning Technologies

by Srini Penchikala Follow 22 Followers on  May 22, 2017

Greg Bulmash from Amazon spoke at the OSCON 2017 Conference last week about developing your own virtual assistant applications using Amazon's Lex and Polly technologies.

Data Science Follow 539 Followers

Apache Metron Graduates to Top-Level Project

by Dylan Raithel Follow 5 Followers on  May 18, 2017

Hortonworks and Apache announce graduation of Metron, a realtime big data security platform to top-level project at the ASF.

Architecture & Design Follow 1381 Followers

Confluent Cloud, Apache Kafka as a Service in AWS

by Alex Giamas Follow 4 Followers on  May 18, 2017

Apache Kafka is a distributed, fault-tolerant pub sub messaging soltuion, originally developed by LinkedIn and open sourced. Confluent was formed by former LinkedIn engineers in the Kafka development group and today announced Confluent Cloud, a fully hosted and managed Apache Kafka as a Service in AWS. We also take a look at Confluent's second annual Streaming Data report and its findings.

Data Science Follow 539 Followers

Dani Traphagen on Next Phase of Distributed Systems with Apache Ignite

by Srini Penchikala Follow 22 Followers on  May 15, 2017

Dani Traphagen from GridGain spoke at OSCON 2017 Conference about Apache Ignite platform. She talked about the paradigm shift in viewing the disk as a bottleneck, the decreasing costs of memory and how to optimize toward the cache, leveraging it for microservices architectures with the open source project Apache Ignite.

Java Follow 629 Followers

Emerging Technologies for the Enterprise Conference 2017: Day Two Recap

by Michael Redlich Follow 10 Followers on  Apr 30, 2017

Day Two of the 12th annual Emerging Technologies for the Enterprise Conference was held in Philadelphia. This two-day event included keynotes by Blair MacIntyre (augmented reality pioneer) and Scott Hanselman (podcaster), and featured speakers Kyle Daigle (engineering manager at GitHub), Holden Karau (principal software engineer at IBM), and Karen Kinnear (JVM technical lead at Oracle).

Java Follow 629 Followers

Emerging Technologies for the Enterprise Conference 2017: Day One Recap

by Michael Redlich Follow 10 Followers on  Apr 24, 2017

Day One of the 12th annual Emerging Technologies for the Enterprise Conference was held on Tuesday, April 18 in Philadelphia, PA. This two-day event included keynotes by Blair MacIntyre (augmented reality pioneer) and Scott Hanselman (podcaster), and featured speakers Monica Beckwith (JVM consultant at Oracle), Yehuda Katz (co-creator of Ember.js), and Jessica Kerr (lead engineer at Atomist).

Data Science Follow 539 Followers

Data Preparation Pipelines: Strategy, Options and Tools

by Srini Penchikala Follow 22 Followers on  Apr 16, 2017

Data preparation is an important aspect of data processing and analytics use cases. Business analysts and data scientists spend about 80% of their time gathering and preparing the data rather than analyzing it or developing machine learning models. Kelly Stirman spoke last week at Enterprise Data World 2017 Conference about the data preparation best practices.

Data Science Follow 539 Followers

Google Announces Cloud Machine Learning API Updates

by Srini Penchikala Follow 22 Followers on  Apr 10, 2017

Google recently announced the Cloud Machine Learning API updates at the Google Cloud Next Conference. This includes a set of APIs in the areas of vision, video intelligence, speech, natural language, translation and job search.

Data Science Follow 539 Followers

Using Deep Learning Technologies IBM Reaches a New Milestone in Speech Recognition

by Srini Penchikala Follow 22 Followers on  Mar 31, 2017

The research team at IBM recently announced they've reached a new industry record at 5.5%, using the SWITCHBOARD linguistic corpus. This brings us closer to what's considered to be the human error rate, 5.1%. They used deep learning technologies and acoustic models to accomplish this milestone.

Data Science Follow 539 Followers

Netflix Demonstrates Big Data Analytics Infrastructure

by Andrew Morgan Follow 1 Followers on  Mar 21, 2017

At QCon San Francisco, engineers at Netflix discussed their big data strategy and analytics infrastructure. This included a summary of the scale of their data, their S3 data warehouse, and Genie, their big data federated orchestration system.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT