BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

rss
Data Science Follow 125 Followers

Data Preparation Pipelines: Strategy, Options and Tools

by Srini Penchikala Follow 6 Followers on  Apr 16, 2017

Data preparation is an important aspect of data processing and analytics use cases. Business analysts and data scientists spend about 80% of their time gathering and preparing the data rather than analyzing it or developing machine learning models. Kelly Stirman spoke last week at Enterprise Data World 2017 Conference about the data preparation best practices.

Followers

VMware Releases SQLFire 1.0

by Kostis Kapelonis Follow 0 Followers on  Jan 31, 2012

VMware releases SQLFire 1.0 a distributed SQL database geared towards high availability and horizontal scalability which offers table replication, table partitioning and parallel execution of queries.

Followers

JBoss Releases Hibernate 4.0

by Kostis Kapelonis Follow 0 Followers on  Jan 18, 2012 2

JBoss Releases Hibernate 4.0 which comes with Multi-tenancy support, the introduction of a standard mechanism for writing Hibernate extensions, initial refactorings towards OSGI and several other cleanups.

Followers

Windows Azure Gets Node.js, SQL Azure Federation, Increased DB Limits

by Roopesh Shenoy Follow 0 Followers on  Dec 14, 2011

Windows Azure team announced major updates including support for Node.js, better scalability for SQL Azure through Federation and higher individual DB Size limits (upto 150 GB), a limited preview for Hadoop and more.

Followers

Facebook on Hadoop, Hive, HBase, and A/B Testing

by Ron Bodkin Follow 0 Followers on  Jul 14, 2010 1

The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook's VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop.

Followers

Databases Roundup: Data Sharding for ActiveRecord and Faster Postgres IO

by Mirko Stocker Follow 0 Followers on  Jul 21, 2008

In this databases roundup we take a look at DataFabric, FiveRun's recently open sourced data sharding plug-in for ActiveRecord. Also: a look at speeding up Postgres data access using the asynchronous client API and Ruby 1.9's Fibers.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT