InfoQ

InfoQ

Topic/Tag specific view

Data Analysis Content on InfoQ


Latest featured content about Data Analysis

Ron Bodkin on Big Data and Analytics

Topics
Map-Reduce,
Machine Learning,
Operations,
Big Data,
Architecture

Ron Bodkin discusses big data architecture, real-time analytics, batch processing, map-reduce, and data science.

News about Data Analysis

Can SAP HANA boost Real-time Data Analytics?

Topics
SaaS,
Big Data,
Platforms,
Real Time,
Cloud Computing,
Data Access

In a recent press news from 13th December, SAP announced at the SAP Influencer Summit in Boston that “leading software vendors are adopting the open SAP HANA platform for their existing products and building completely new applications.” Among them are companies such as T-Mobile and TIBCO.

Making Sense of the Social Web with Microsoft Social Analytics (Vancouver)

Topics
REST,
Cloud Computing,
Architecture

Microsoft is making available a cloud service called Social Analytics for users interested in analyzing Twitter, Facebook, Blogger, YouTube, etc. in order to get insight on the trends on the social web.

MIT introduces Oracle for Object-Oriented Programmers

Topics
Machine Learning,
Extensibility,
Open Source,
Programming,
Technology,
Tools,
Code Analysis

In a recent news article the Massachusetts Institute of Technology has introduced a technology for automatically remembering connections between objects. The provided system determines how objects in a large software project interact, so it can inform latecomers which objects they will need to design certain types of functions.

Column-based Storage in SQL Server 2011

Topics
.NET,
Database Design,
SQL Server,
Data Access,
Data Warehousing

Imagine ad hock data mining queries against a single table with 1 TB of data and 1.44 billion rows coming back in roughly a second. This is the scenario Microsoft intends to support using 32-core machines and their new column-based storage engine.

Presentations about Data Analysis

NoSQL at Twitter

Topics
NoSQL,
Architecture

Kevin Weil presents how Twitter does data analysis using Scribe for logging, base analysis with Pig/Hadoop, and specialized data analysis with HBase, Cassandra, and FlockDB.

Machine Learning: A Love Story

Topics
Machine Learning,
Architecture

Hilary Mason presents the history of machine learning covering some of the most significant developments taking place over the last two decades, especially the fundamental math and algorithmic tools employed. She also exemplifies how machine learning is used by bit.ly to discover various statistical information about users.

Facebook’s Petabyte Scale Data Warehouse using Hive and Hadoop

Topics
Architecture,
Data Warehousing,
Performance & Scalability

Ashish Thusoo and Namit Jain explain how Facebook manages to deal with 12 TB of compressed new data everyday with Hive’s help. Hive is an open source data warehousing framework built on Hadoop, allowing developers to perform analysis against large datasets using SQL.

Interviews about Data Analysis

Ilya Grigorik on Tokyo Cabinet, MySQL and Ruby HTTP Performance

Topics
Dynamic Languages,
Data Access,
Ruby,
Deployment / Datacenter,
Database Design,
Performance & Scalability,
Architecture

Ilya Grigorik discusses his company's PostRank algorithm for tracking reader engagement with content. Also: his experience scaling MySQL, Tokyo Cabinet, Ruby HTTP libs, Solr, Amazon EC2 and more.