Apache Spark is an open source big data processing framework built around speed, ease of use, and sophisticated analytics. In this article, Srini Penchikala talks about how Apache Spark framework helps with big data processing and analytics with its standard API. He also discusses how Spark compares with traditional MapReduce implementation like Apache Hadoop.
Current agile practices are far too narrowly focused on delivering code to users and customers. There is no systems-wide view of other stakeholders, of databases, and anything else except the code. This article describes what ‘Evo’ is at core, and how it is different from other Agile practices, and why ‘done’ should mean ‘value delivered to stakeholders’.
DynamoDB is a NoSQL database service that aims to be easily managed, so you don't have to worry about administrative burdens such as operating and scaling. This article shows how to use Amazon DynamoDB to create a Mars Rover application. You can use the same concepts described in this post to build your own web application.
With architectures becoming wider and more concurrent, dealing with errors in production is harder than ever. In this article we investigate five techniques for finding and fixing bugs in production. 3
In this part 2 of InfoQ's conversation with Gil Tene of Azul Systems, Gil discusses their latest engineering project aimed at bringing better low-level memory layout control to Java.
Gil Tene of Azul Systems talks about shipping their Java 8 compatible release and the need for certified builds.
Oracle's Brian Goetz (Java Language Architect) and John Rose (JVM Architect) take us on a deep-dive discussion about some of the technologies and features being discussed for Java 9 and beyond. 1
During the holiday season we think about our families, our children, & their future. In this article, Yakov Fain discusses his new book "Java Programming for Kids" (now available as a free download) 3
The new book, The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems, looks at a wide range of considerations for cloud-scale systems.
In this article Monica Beckwith, starting from core Hadoop components, investigates the design of a highly available, fault tolerant Hadoop cluster, adding security and data-level isolation.
This article summarizes the key takeaways and highlights from QCon San Francisco 2014 as blogged and tweeted by attendees.
GridGain announced that the In-Memory Data Fabric has been accepted into Apache Incubator program as Apache Ignite. InfoQ spoke with Nikita Ivanov about their product becoming part of Apache.