InfoQ Homepage Database Content on InfoQ
-
A Research Agenda and Vision for Big Data at NASA
Chris Mattmann covers snow hydrology, regional climate modeling, climate science, and intelligence activities that need advancement to deal with the data deluge across NASA and government agencies.
-
A Call for Sanity in NoSQL
Nathan Marz discusses building NoSQL-based data systems that are scalable and easy to reason about.
-
Next Gen Hadoop
Akmal B. Chaudhri introduces Apache™ Hadoop® 2.0 and Yet Another Resource Negotiator (YARN).
-
What Can Hadoop Do for You?
Eva Andreasson presents typical categories of problems that are commonly solved using Hadoop and also some concrete examples in each category.
-
Let Me Graph That For You
Ian Robinson discusses graphs data structures, some of the queries that can extract data from them, and tools and techniques to work with graph data.
-
Design Patterns for Large-Scale Real-Time Learning
Sean Owen provides examples of operational analytics projects, presenting a reference architecture and algorithm design choices for a successful implementation based on his experience Oryx/Cloudera.
-
Revealing the Uncommonly Common with Elasticsearch
Mark Harwood shows how anomaly detection algorithms can spot card fraud, incorrectly tagged movies and the UK's most unexpected hotspot for weapon possession.
-
Haskell in the Newsroom
Erik Hinton discusses the successes and failures of making a cultural shift in the newsroom at NYT to accept Haskell and some of the projects Haskell has been used for.
-
Sync is the Future of Mobile Data
Chris Anderson provides code samples on how to build offline applications for mobile platforms based on the NoSQL document model, and how to contribute to the open source projects behind this movement
-
Excel Coding Errors Are Destroying World Economies and F# (with Tsunami) Is Here to Stop Them!
Matthew Moloney discusses using F# and .NET inside Excel, demonstrating doing big data, cloud computing, using GPGPU and compiling F# Excel UDFs.
-
An API for Distributed Computing
Cliff Click introduces a coding style & API for in-memory analytics that handles datasets from 1K to 1TB without changing a line of code and clusters with TB of RAM and hundreds of CPUs.
-
How Developers Can Treat Ovarian Cancer
Mridula Jayaraman shares from her experience developing a next generation sequencing solution used to customize cancer treatment based on patient's genetic makeup.