InfoQ Homepage MapReduce Content on InfoQ

Interviews

RSS Feed

Jeremy Pollack of Ancestry.com on Test-driven Development and More

Hadoop, the distributive file system and MapReduce are just a few of the topics covered in this interview recorded live at QCon San Francisco 2013. Industry-standard Agile implementation and a lot of testing, assures the development team at Ancestry.com that they have an app that can handle the large traffic demands of the popular genealogy site.

Jeremy Pollack
on Feb 15, 2014

Icon

30:52
Cliff Click on In-Memory Processing, 0xdata H20, Efficient Low Latency Java and GCs

Cliff Click explains 0xdata's H20, a clustering and in-memory math and statistics solution (available for Hadoop and standalone), writing H20's memory representation and compression in Java, low latency Java vs GCs, and much more.

Cliff Click
on Jan 10, 2014

Icon

29:36
Eva Andreasson on Hadoop, the Hadoop Ecosystem, Impala

Eva Andreasson explains the various Hadoop technologies and how they interact, real-time queries with Impala, the Hadoop ecosystem including Hue, Oozie, YARN, and much more.

Eva Andreasson
on Nov 11, 2013

Icon

27:41
Eli Collins on Hadoop

Eli Collins discusses Cloudera's CDH4 release, which tasks are well suited for Hadoop, Hadoop and MapReduce vs SQL, the state of Hadoop, and much more.

Eli Collins
on Aug 17, 2012

Icon

15:06
Stuart Halloway on Datomic, Clojure, Reducers

Stuart Halloway explains Datomic, programming transactional behavior with Datomic, Datalog and logic programming, programming with values, Clojure Reducers and much more.

Stuart Halloway
on Aug 15, 2012

Icon

34:08
Hadoop and NoSQL in a Big Data Environment

Ron Bodkin of Big Data Analytics discusses early adoption of Hadoop, NoSQL and big data technologies. He discusses common patterns and explains how developers can write low-level primitives to optimize MapReduce function. Other topics include Hive, Pig, multi tenancy, and security.

Ron Bodkin
on Feb 03, 2012

Icon

16:04
All things Hadoop

In this interview Ted Dunning talk about Hadoop, its current usage and its future. He explains the reasons for Hadoop's success and make recommendations on how to start using it.

Ted Dunning
on Feb 02, 2012

Icon

25:55
Ville Tuulos on Big Data and Map/Reduce in Erlang and Python with Disco

Ville Tuulos talks about Disco, the Map/Reduce framework for Python and Erlang, real-world data mining with Python, the advantages of Erlang for distributed and fault tolerant software, and more.

Ville Tuulos
on Jun 24, 2011

Icon

16:28
Rob Pike on Parallelism and Concurrency in Programming Languages

Rob Pike discusses concurrency in programming languages: CSP, channels, the role of coroutines, Plan 9, MapReduce and Sawzall, processes vs threads in Unix, and more programming language history.

Rob Pike
on Feb 17, 2011

Icon

32:07
Ron Bodkin on Big Data and Analytics

Ron Bodkin discusses big data architecture, real-time analytics, batch processing, map-reduce, and data science.

Ron Bodkin
on Jan 27, 2011

Icon

22:51
Laforge and Rocher Discuss the future of Groovy, Grails and Java

In this interview, Graeme Rocher and Guillaume Laforge of SpringSource talk about the present and future of the Grails framework and the Groovy language. Rocher talks about Grails 1.4 and some of its enhancements such as improvements to GORM. And Laforge discusses Groovy 1.8, which features new DSL authoring capabilities, among other things. They look at how Java’s future impacts their projects.

Graeme Rocher Guillaume LaForge
on Dec 03, 2010

Icon

22:23

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

Expanding Swift from Apps to Services

Engineering Speed at Scale — Architectural Lessons from Sub-100-ms APIs

Beyond the Warehouse: Why BigQuery Alone Won’t Solve Your Data Problems

Scaling to 100+ as a Director: Lessons From Growing Engineering Organizations

From Alert Fatigue to Agent-Assisted Intelligent Observability

Helpful links

Choose your language

Interviews

Jeremy Pollack of Ancestry.com on Test-driven Development and More

Cliff Click on In-Memory Processing, 0xdata H20, Efficient Low Latency Java and GCs

Eva Andreasson on Hadoop, the Hadoop Ecosystem, Impala

Eli Collins on Hadoop

Stuart Halloway on Datomic, Clojure, Reducers

Hadoop and NoSQL in a Big Data Environment

All things Hadoop

Ville Tuulos on Big Data and Map/Reduce in Erlang and Python with Disco

Rob Pike on Parallelism and Concurrency in Programming Languages

Ron Bodkin on Big Data and Analytics

Laforge and Rocher Discuss the future of Groovy, Grails and Java

How CNAME Ordering in RFC Specs Caused Cloudflare 1.1.1.1 Outage

Expanding Swift from Apps to Services

Google Pushes for gRPC Support in Model Context Protocol

LinkedIn Leverages GitHub Actions, CodeQL, and Semgrep for Code Scanning

LinkedIn Re-Architects Service Discovery: Replacing Zookeeper with Kafka and xDS at Scale

GitHub Reworks Layered Defenses after Legacy Protections Block Legitimate Traffic

Getting Feedback from Test-Driven Development and Testing in Production

Scaling to 100+ as a Director: Lessons From Growing Engineering Organizations

The Technical Founder's Path: Code, Leadership, and Balance

Cloudflare Demonstrates Moltworker, Bringing Self-Hosted AI Agents to the Edge

Google Supercharges Gemini 3 Flash with Agentic Vision

Conductor Quantum Introduces Coda, a Natural Language Interface for Quantum Computing

Datadog Integrates Google Agent Development Kit into LLM Observability Tools

From Alert Fatigue to Agent-Assisted Intelligent Observability

Etleap Launches Iceberg Pipeline Platform to Simplify Enterprise Adoption of Apache Iceberg

QCon London

QCon AI Boston

QCon San Francisco

InfoQ Software Architects' Newsletter

Interviews