BT
Older Newer rss
01:11:51

How to Build Big Data Pipelines for Hadoop Using OSS

Posted by Costin Leau  on  Feb 08, 2013

Costin Leau discusses Big Data, current available tools for dealing with it, and how Spring can be used to create Big Data pipelines.

F# Big Data Scripting

Posted by Matthew Moloney  on  Jan 18, 2013 1

Matthew Moloney shares some of the F# tools built at Microsoft Research for dealing with Big Data.

The Evolving Panorama of Data

Posted by Rebecca Parsons  on  Jan 17, 2013 1

Rebecca Parsons proposes taking a different look at data, using different approaches and tools, then looks at some of the ways social data is used these days.

Scaling Scalability: Evolving Twitter Analytics

Posted by Dmitriy Ryaboy  on  Jan 13, 2013

Dmitriy Ryaboy shares some of the lessons learned scaling Twitter’s analytics infrastructure: Data loves a schema, Make data sources discoverable, and Make costs visible.

Lean Data Architecture: Minimize Investment, Maximize Value

Posted by Manvir Singh Grewal, Brandon Byars  on  Jan 04, 2013

Manvir Singh Grewal and Brandon Byars propose a business intelligence workflow along with Lean principles and practices for implementing a data warehouse and reporting capability.

Storm: Distributed and Fault-Tolerant Real-time Computation

Posted by Nathan Marz  on  Jan 04, 2013

Nathan Marz introduces Twitter Storm, outlining its architecture and use cases, and takes a look at future features to be made available.

Extending the Enterprise Data Warehouse with Hadoop

Posted by Rob Lancaster  on  Dec 27, 2012

Rob Lancaster explains the steps made by Orbitz in order to bridge the gap between their data in the data warehouse and the data in Hadoop.

Big Data Problems in Monitoring at eBay

Posted by Bhaven Avalani, Yuri Finklestein  on  Dec 21, 2012

Bhaven Avalani and Yuri Finklestein discuss 4 aspects encountered at eBay when dealing with monitoring data: reduction of data entropy, robust data distribution, metric extraction, efficient storage.

100% Big Data, 0% Hadoop, 0% Java

Posted by Pavlo Baron  on  Dec 20, 2012

Pavlo Baron presents a big data case, a solution and the tools for collecting, mining and storing large amounts of data without using Hadoop or Java.

NoSQL: Past, Present, Future

Posted by Eric Brewer  on  Dec 20, 2012 1

Eric Brewer takes a look at NoSQL’s history and considers what should be done so the current NoSQL solutions to evolve in order to address the full range of the application needs.

Big Data, Small Computers

Posted by Cliff Click  on  Dec 20, 2012

Cliff Click discusses RAIN, H2O, JMM, Parallel Computation, Fork/Joins in the context of performing big data analysis on tons of commodity hardware.

Introducing Apache Hadoop: The Modern Data Operating System

Posted by Eli Collins  on  Dec 18, 2012 2

Eli Collins introduces Hadoop: why it came about, the benefits it produces, its history, its architecture, use cases and applications.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT