Big Data Architecture at LinkedIn
In this interview at QCon London, LinkedIn's Sid Anand discusses the problems they face when serving high-traffic, high-volume data. Sid explains how they're moving some use cases from Oracle to gain headroom, and lifts the hood on their open source search and data replication projects, including Kafka, Voldemort, Espresso and Databus.
Big Data Architectures at Facebook
Ashish Thusoo presents the data scalability issues at Facebook and the data architecture evolution from EDW to Hadoop to Puma.
Future of Data Architecture: NoSQL, Big Data, Linked Data and the Cloud
In this panel, Siddharth Anand, Dwight Merriman, Ashish Thusoo, Damien Katz, Tom Wilkie and Akmal Chaudhri (moderator) answer questions on NoSQL from the audience.
With Spring Data, the ever popular Spring Framework has cultivated a new patch of ground, bringing Big Data and NOSQL technology like Neo4j to enterprise developers. This guide introduces you to Spring Data Neo4j, using the fast, powerful and scalable graph database Neo4j to enjoy the benefits of having good relationships in your data.
NetApp Case Study
Kumar Palaniapan and Scott Fleming present how NetApp deals with big data using Hadoop, HBase, Flume, and Solr, collecting and analyzing TBs of log data with Think Big Analytics.
Eli Collins on Hadoop
Eli Collins discusses Cloudera's CDH4 release, which tasks are well suited for Hadoop, Hadoop and MapReduce vs SQL, the state of Hadoop, and much more.
eBook: Understanding Big Data
By the time you are done reading this book, you'll have a good handle on the big data opportunity that lies ahead, a better understanding on the requirements that ensures you have the right big data platform (as opposed to just technology), and have a strong foundational knowledge as to the business opportunities that lie ahead with big data and some of the technologies available.
Analyzing social media and structured data with InfoSphere BigInsights
This article teaches you the basics of using BigSheets to analyze social media and structured data collected through sample applications provided with BigInsights. You'll learn how to model this data in BigSheets, manipulate this data using built-in macros and functions, create charts to visualize your work, and export the results of your analysis in one of several popular output formats.
Get started with Hadoop-based data analytics on IBM SmartCloud Enterprise
The combination of cloud and Hadoop make it possible to handle large amounts of structured and unstructured data. In this article, the author explains how to get started using Hadoop (in the form of InfoSphere® BigInsights Basic) on IBM® SmartCloud Enterprise. Learn how to set up a three-node cluster and verify your cluster is working.
Query social media and structured data with InfoSphere BigInsights
If you're looking to get off to a quick start with big data projects involving IBM® InfoSphere® BigInsights™, learning the basics of how to query, manipulate, and analyze your data is important. This article takes you through simple query examples that show how you can read, write, filter, and refine social media and structured data. You'll even see how business analysts can visualize query results using a spreadsheet-style tool.
Starting your education in big data
Learn about current free, online courses that are available at BigDataUniversity.com. Follow the suggested path to get started and grow your knowledge on big data, and get ready to implement it to meet your own business needs.
IBM InfoSphere Streams: Assembling Continuous Insight in the Information Revolution
In this IBM® Redbooks® publication, we discuss and describe the positioning, functions, capabilities, and advanced programming techniques for IBM InfoSphere™ Streams (V2), a new paradigm and key component of IBM Big Data platform.