InfoQ Homepage Database Content on InfoQ
-
Julien Nioche on Apache Nutch 2 Features and Product Roadmap
Open source web-search framework Apache Nutch version 2 supports large scale crawling, link-graph database and HTML parsing. InfoQ spoke with Julien Nioche, VP of Apache Nutch project, about the framework new features and its future roadmap.
-
Blueprint for a Big Data Solution
In his new article Jonathan Natkins explains how to use components of Apache Hadoop, including Flume, Hive and Oozie to implement a typical Data management system. He also gives a practical example of such architecture to measure Twitter user’s influence.
-
arrayDB, a New and Easy PHP ORM
Most ORM libraries make you write a new class for each item you want to keep in the database. Extending this and that for no apparent reason is repetitious and capricious. Using techniques that can be applied to most languages, arrayDB looks at simplifying the whole process by doing away with statically defined classes.
-
Inside the Complexity of Delivering Cloud Computing
There's a lot more to cloud computing than meets the eye. This article presents an insider's view on what really is entailed in designing and deploying a cloud-based solution.
-
Hadoop and Metadata (Removing the Impedance Mis-match)
A new Apache HCatalog project is a table and storage management layer for Hadoop that enables different data processing tools – Pig, MapReduce, and Hive – to more easily inter-operate data. HCatalog’s presents users with a relational view of data and ensures that users need not worry about where or in what format their data is stored – RCFile format, text files, or sequence files.
-
Transitioning from RDBMS to NoSQL. Interview with Couchbase’s Dipti Borkar
While relational databases have been used for decades to store data, and they still represent a viable solution for many use cases, NoSQL is being chosen today especially for scalability and performance reasons. This article contains an interview with Dipti Borkar, Director of Product Management at Couchbase, on the challenges, benefits and the process of migrating from RDBMS to NoSQL.
-
What is CouchDB and Why Should I Care?
CouchDB, a NoSQL solution, is a document-oriented database and within each document fields are stored as key-value maps. CouchDB has some unique characteristics like advanced replication. This article covers getting started, unit testing, CRUD and query operations.
-
Virtual Panel: NoSQL Database Patterns and Polyglot Persistence
NoSQL database space has different databases that support different data storage patterns. InfoQ spoke with four panelists about the current state of NoSQL adoption, architecture patterns supported by different NoSQL databases, polyglot persistence and security aspects when using NoSQL databases.
-
Implementing Aggregation Functions in MongoDB
In this article, authors Arun Viswanathan and Shruthi Kumar discuss how to implement common aggregation functions on a MongoDB document database using its MapReduce functionality. They also discuss a typical application of aggregations which includes business reporting of sales data.
-
An Interview with Rebecca Parsons - Thoughtworks CTO
Dr Rebecca Parsons is the Chief Technology Officer for Thoughtworks, and a Director of the Agile Alliance. At the Agile Australia conference in Melbourne she discussed the trends in database technology around NoSQL databases, the linking of continuous design with continuous delivery and the impact of the mainstream adoption of agile practices on the Agile Alliance.
-
Testing SQL Server Code with TST
Automated Testing (unit/integration) is an integral part of any agile development process. However a project with significant logic housed in database code creates severe constraints to writing unit level tests, especially if it is large, complex and depend on data. We will explore the TST framework and a few ideas for writing and maintaining good tests for database code.
-
Data Modeling: Sample E-Commerce System with MongoDB
The rich document capabilities and atomic operation guarantees in MongoDB makes it possible to model many different applications. Even rigorous requirements of conventional applications like e-commerce system are possible in a document database. This data model (i.e. "schema design,") is useful for developing applications around any restricted resource system, not just e-commerce systems.