AI, ML & Data Engineering Follow 1067 Followers

William McKnight on Data Platforms and Creating a Modern Data Architecture

by Srini Penchikala Follow 40 Followers on  Oct 15, 2018

William McKnight gave a keynote presentation last week at Data Architecture Summit 2018 Conference on creating a modern data architecture using different data platforms.

Big Data Follow 152 Followers

Data Workflow Management Using Airbnb's Airflow

by Alex Giamas Follow 10 Followers on  Sep 08, 2015

Airbnb recently opensourced Airflow, its own data workflow management framework. Airflow is being used internally at Airbnb to build, monitor and adjust data pipelines. Airflow’s creator, Maxime Beauchemin and Agari’s Data Architect and one of the framework’s early adopters Siddharth Anand discuss about Airflow, where it can be of use and future plans.


Snowflake Announces General Availability of their Cloud Data Warehouse Offering

by Benjamin Darfler Follow 0 Followers on  Jul 28, 2015 1

Snowflake Computing has announced the general availability of their Snowflake Elastic Data Warehouse, a software as a service offering that provides a SQL data warehouse on top of Amazon Web Services.


Software Defined Data Mart In The Enterprise Using Metanautix Quest

by Alex Giamas Follow 10 Followers on  Jun 29, 2015

Metanautix recently announced the newest version of its product, Quest. Quest allows enterprises to build software defined data marts that can run in virtualized servers. Designed from the ground up with security and auditability in mind, Quest can deal with Big Data workloads and encapsulate it into different logical views, ready for consumption by different departments in enterprise.


Implementing Agile in Data Warehouse Projects

by Savita Pahuja Follow 3 Followers on  Apr 14, 2015

This post talks about using an agile implementation for data warehouse projects.


Google unveils Mesa - Geo-Replicated Near-Realtime Scalable Data Warehouse

by Matt Kapilevich Follow 0 Followers on  Aug 19, 2014

Google has unveiled their new data-warehouse called Mesa. Mesa is a system that scales across multiple data centers and processes petabytes of data, while being able to respond to queries in sub-second time and maintain ACID properties.


Teradata Offers Data Warehouse as a Service as Part of Their Cloud Strategy

by Alex Giamas Follow 10 Followers on  Nov 18, 2013

Teradata revamps its cloud offering, offers Data Warehouse Data Platform as a Service solution. Teradata Cloud is aspiring to become a worthy competitor to Amazon Redshift, with a richer set of predefined libraries and a more effective way of loading data.


Amazon Makes Compelling Case for Hosting and Processing Your Big Data

by Richard Seroter Follow 8 Followers on  Dec 03, 2012

The AWS team has announced a limited preview of Amazon Redshift, a cloud-hosted data warehouse whose cost and capabilities are poised to disrupt the industry. In addition, AWS revealed two new massive compute instance types, and a data integration tool called Data Pipeline.


Better Developer Experience in Version 1.5 of the Data Access Framework MetaModel

by Michael Stal Follow 0 Followers on  Feb 22, 2011 1's open-source Java framework MetaModel implements a unified API for the access, exploration, and query of different datastores., both a website and an open source software organization dedicated to "the development of Open Source software related to Business Intelligence and Data Warehousing", has recently published version 1.5 of MetaModel.


Facebook on Hadoop, Hive, HBase, and A/B Testing

by Ron Bodkin Follow 0 Followers on  Jul 14, 2010 1

The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook's VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop.


Mahout 0.3: Open Source Machine Learning

by Gilad Manor Follow 0 Followers on  Apr 19, 2010 1

The need for machine-learning techniques like clustering, collaborative filtering, and categorization has steadily increased the last decade along with the number of solutions needing quick and efficient algorithms to transform vast amounts of raw data into relevant information. Apache Mount 0.3 has been announced on March, adding more functionality, stability and performance.


Event Stream Processing: Scalable Alternative to Data Warehouses?

by Sadek Drobi Follow 1 Followers on  Oct 31, 2008

Dan Pritchett suggests that analyzing streams of events using Event Stream Processor could be an interesting alternative solution to data warehousing applications, which have, in his opinion, important downsides in terms of cost, scalability and reactivity.


Agile Business Intelligence

by Mark Levison Follow 0 Followers on  Jun 17, 2008

Large centrally designed BI systems often don't meet the expectations of their end users. In this article at Cutter IT journal Scott Ambler has written about using Agile methods to help meet the user's expectations and deliver business value quickly.


Michael Stonebraker: Major RDBMSes are legacy technology

by Ryan Slobojan Follow 0 Followers on  Sep 07, 2007 5

Michael Stonebraker, co-founder of the Ingres and Postgres relational database management systems (RDBMS) and CTO of Vertica Systems, laid the framework for a debate in the database community by declaring that most major databases should be considered legacy technology.


ActiveWarehouse, a New Step for Enterprise Ruby

by Sebastien Auvray Follow 0 Followers on  Mar 28, 2007 4

ActiveWarehouse, is a significant new plugin that makes it easier to build data warehouses in Rails.