InfoQ

InfoQ

Topic/Tag specific view

Data Warehouse Content on InfoQ


Latest featured content about Data Warehouse

Hadoop and NoSQL in a Big Data Environment

Topics
NoSQL,
Data Access,
Design Pattern,
Agile,
Big Data,
Database Design,
Performance & Scalability,
Data Warehousing

Ron Bodkin of Big Data Analytics discusses early adoption of Hadoop, NoSQL and big data technologies. He discusses common patterns and explains how developers can write low-level primitives to optimize MapReduce function. Other topics include Hive, Pig, multi tenancy, and security.

Data Mining in the Swamp: Taming Unruly Data With Cloud Computing

Topics
Business,
Cloud Computing,
Architecture

Matrix presents a white paper on using the open source tool, Hadoop, to implement the MapReduce strategy and a Cloud computing strategy to solve business intelligence problems.

Randy Shoup on Evolvable Systems

Topics
Operations,
Data Access,
Deployment / Datacenter,
Database Design,
Data Warehousing,
Architecture,
Data Portability,
Event Driven Architecture

Randy Shoup discusses evolvable systems: how to run different versions of a system in parallel during migrations, decoupling a system with events, schemas at eBay and much more.

News about Data Warehouse

Better Developer Experience in Version 1.5 of the Data Access Framework MetaModel

Topics
Open Source,
Data Access,
Data Warehousing,
Architecture

Eobject.org's open-source Java framework MetaModel implements a unified API for the access, exploration, and query of different datastores. Eobjects.org, both a website and an open source software organization dedicated to "the development of Open Source software related to Business Intelligence and Data Warehousing", has recently published version 1.5 of MetaModel.

Facebook on Hadoop, Hive, HBase, and A/B Testing

Topics
Operations,
Data Access,
Deployment / Datacenter,
Database Design,
Performance & Scalability,
Data Warehousing,
Architecture

The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook's VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop.

Mahout 0.3: Open Source Machine Learning

Topics
Machine Learning,
Java,
Enterprise Architecture,
Architecture,
Data Warehousing

The need for machine-learning techniques like clustering, collaborative filtering, and categorization has steadily increased the last decade along with the number of solutions needing quick and efficient algorithms to transform vast amounts of raw data into relevant information. Apache Mount 0.3 has been announced on March, adding more functionality, stability and performance.

Event Stream Processing: Scalable Alternative to Data Warehouses?

Topics
Enterprise Architecture,
Events,
Data Warehousing,
Architecture

Dan Pritchett suggests that analyzing streams of events using Event Stream Processor could be an interesting alternative solution to data warehousing applications, which have, in his opinion, important downsides in terms of cost, scalability and reactivity.

Agile Business Intelligence

Topics
Agile in the Enterprise,
Agile,
Enterprise Architecture,
Data Warehousing

Large centrally designed BI systems often don't meet the expectations of their end users. In this article at Cutter IT journal Scott Ambler has written about using Agile methods to help meet the user's expectations and deliver business value quickly.

Michael Stonebraker: Major RDBMSes are legacy technology

Topics
Java,
.NET,
Data Access,
Architecture,
Data Warehousing,
Ruby

Michael Stonebraker, co-founder of the Ingres and Postgres relational database management systems (RDBMS) and CTO of Vertica Systems, laid the framework for a debate in the database community by declaring that most major databases should be considered legacy technology.