This article provides an overview of tools and libraries available for embedded data analytics and statistics, both stand-alone software packages and programming languages with statistical capabilities. The authors also discuss how to combine and integrate these embedded analytics technologies to handle big data.
In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to process data for security analysis.
Apache Samza is a stream processor LinkedIn recently open-sourced. Chris Riccomini shares Samza's feature set, how it integrates with YARN and Kafka, how it's used at LinkedIn and more.
There has been a raging debate about PaaS and whether it is still a valuable part of a cloud portfolio, so InfoQ reached out to 4 leaders in the cloud domain for their opinions on the future of PaaS.
How to use various tools such as Apache Avro, Apache Crunch, Cloudera ML and the Cloudera Development Kit to build applications that use Hadoop.
ActiveJPA is a Java implementation of Martin Fowler’s Active Record over JPA and provides abstractions to simplify data access. In this article the primary committer illustrates ActiveJPA usage 7
Writing Java software that correctly makes use of concurrency requires careful thought. Static analysis tools can analyze source code or compiled bytecode to discover latent bugs hidden within code. 1
The new book Real World OCaml provides a deep dive into the OCaml programming language. InfoQ caught up with the authors to hear about real world Ocaml users, its unique features, and much more.
This article we focuses on the database backup tools and security policies when deploying MongoDB NoSQL databases. Cloud backup with MongoDB Management Service (MMS) is covered.