InfoQ

InfoQ

Editor Specific Content View

All of Boris Lublinsky, Mike Segel's Content on InfoQ


Latest featured content by Boris Lublinsky, Mike Segel

Integrating Lucene with HBase

Topics
HBase,
Columnar Databases,
Database,
Search,
Lucene

The article describes overall design and implementation of integrating Lucene search library with HBase back end. It describes integration architecture, implementation and HBase tables design

Articles by Boris Lublinsky, Mike Segel

Uncovering mysteries of InputFormat: Providing better control for your Map Reduce execution.

Topics
Big Data,
Clusters,
Clustering & Caching,
Database Design,
Performance & Scalability,
Infrastructure,
Database,
MapReduce

In their article authors, Boris Lublinsky and Mike Segel, show how to leverage custom InputFormat class implementation to tighter control execution strategy of Maps in Hadoop Map Reduce jobs.

Extending Oozie

Topics
Business Process Management,
Big Data,
Java,
Business,
SOA,
Enterprise Architecture,
Database Design,
Languages,
Architecture,
Database,
Programming,
Hadoop

In this article authors show how leverage Oozie extensibility to implement custom language extensions. This approach can be viewed a specializing workflow language for a given company/line of business.

Oozie by Example

Topics
Big Data,
Java,
Business Process Management,
Business,
Languages,
SOA,
Database Design,
Enterprise Architecture,
Programming,
Architecture,
Database,
Hadoop

End to end Oozie example, including process design, resource coordinator and workflow implementation