PIG  Content on InfoQ rss

Presentations about PIG rss

AI, ML & Data Engineering Follow 1068 Followers The Mechanics of Testing Large Data Pipelines by Mathieu Bastian Follow 0 Followers Posted on Apr 24, 2016 Mathieu Bastian explores the mechanics of testing large, complex data workflows and tries to identify the most common challenges developers face. He looks at good practices to develop unit, integration, data and performance tests for data workflows. In terms of tools, he looks at what exists today for Hadoop, Pig and Spark with code examples.

Followers Big Data Platform as a Service at Netflix by Jeff Magnusson Follow 0 Followers Posted on Nov 18, 2013 Jeff Magnusson takes a deep dive into key services of Netflix’s “data platform as a service” architecture, including RESTful services that: provide comprehensive metadata management across data sources (Franklin); enable visualization and caching of results of Hadoop jobs (Sting); and visualize the execution plans produced by languages such as Pig and Hive (Lipstick).

Articles about PIG rss

Followers Interview with Alex Holmes, author of “Hadoop in Practice. Second Edition” by Boris Lublinsky Follow 1 Followers Posted on Nov 20, 2014 The new “Hadoop in Practice. Second Edition” book by Alex Holmes provides a deep insight into Hadoop ecosystem covering a wide spectrum of topics such as data organization, layouts and serialization, data processing, including MapReduce and big data patterns, special structures along with their usage to simplify big data processing, and SQL on Hadoop data.