This post talks about the agile implementation in data warehouse projects.
Standardizing on common models for business objects that are exchanged within an enterprise, e.g. Customer, Order and Product together with the attributes and associations they have, might seem compelling but for Stefan Tilkov this creation of Canonical Data Models (CDMs) is a horrible idea which he strongly advices against.
The latest version of Oracle NoSQL Database supports tabular data model, secondary indexing, security enhancements via Oracle Wallet integration, and data center enhancements. Oracle recently announced the release of version 3.0 of the distributed key-value database.
Arun Kejariwal, from Twitter, talked at Velocity Conf London last month about forecasting algorithms used at Twitter to proactively predict system resource needs as well as business metrics such as number of users or tweets. Given the dynamic nature of their data stream, they found that a refined ARIMA model works well once data is cleansed, including removal of outliers.