InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Exploring Hadoop OutputFormat
As more companies adopt Hadoop, its integration with other applications is becoming more important. A key to such integration is usage of the appropriate OutputFormat allowing to produce output data in a form most appropriate for other applications.
-
Virtual Panel: Security Considerations in Accessing NoSQL Databases
NoSQL databases offer alternative data storage options for storing unstructured data compared to traditional relational databases. Though the NoSQL databases have been getting a lot of attention lately, the security aspects of storing and accessing NoSQL data haven't been given much emphasis. This article focuses on the security considerations and best practices in accessing the NoSQL databases.
-
Uncovering mysteries of InputFormat: Providing better control for your Map Reduce execution.
In their article authors, Boris Lublinsky and Mike Segel, show how to leverage custom InputFormat class implementation to tighter control execution strategy of Maps in Hadoop Map Reduce jobs.
-
Finding the Right Data Solution for Your Application in the Data Storage Haystack
With the recent NoSQL movement there are several alternative data storage solutions available compared to the traditional relational databases. In this article, author Srinath Perera discusses the various data storage options and what to consider when choosing each of these solutions.
-
Extending Oozie
In this article authors show how leverage Oozie extensibility to implement custom language extensions. This approach can be viewed a specializing workflow language for a given company/line of business.
-
Using Entity Framework to Successfully Target Multiple Databases
Yevhen Shchyholyev discusses some of the problems that the user may face in the process of developing an application with Entity Framework that interacts with Oracle, MySQL, PostgreSQL or SQLite as well as SQL Server. It is intended to be useful to both developers using one of these databases for the first time as well as for those who regularly interaction with multiple databases.
-
Oozie by Example
End to end Oozie example, including process design, resource coordinator and workflow implementation
-
Omnipresent Synching and Availability with Mobile Couchbase
Omnipresent synchronisation will be easy with CouchDB not only hosted as server side database but also being embedded in mobile applications. Couchbase made the NOSQL Document Database available for the mobile platforms to enable developers to benefit from the replication and offline capabilities while using the convenient and uniform HTTP based protocol to interact with it locally or remotely.
-
Book Excerpt and Interview: Amazon SimpleDB Developer Guide
Boris Lublinsky interviews Prabhakar Chaganti and Rich Helms as part of a review of their new book, Amazon SimpleDB Developer Guide. The book provides a simple step-by-step guide on how to develop applications for Amazon Simple DB in different programming languages including Java, PHP, and Python.
-
Architecting TekPub - Moving from ASP.NET MVC to Ruby on Rails
TekPub is a web site devoted to developers, giving them a source of focused on-line training in various topics from Microsoft Entity Framework to writing your own blog engine using Ruby on Rails. They are an interesting case about company who started on ASP.NET MVC and quickly moved to Ruby on Rails. We had the opportunity to talk with them about their technology turnaround.
-
Graph Databases, NOSQL and Neo4j
Peter Neubauer introduces Graph databases and how they compare to RDBMS' and where they stand in the NOSQL-movement, followed by examples of using a graph database in Java with Neo4j.
-
NoSQL in the Enterprise
In this article, Sourav Mazumder explores what NoSQL databases are, how they fit into Enterprise IT, the challenges facing enterprise adoption, how to choose the appropriate NoSQL database for a given application, a short list of NoSQL databases which are likely to be good matches for enterprise applications, and advice for how to adopt NoSQL databases within an enterprise.