Accueil InfoQ Apache Spark sur InfoQ
Présentations
Flux RSS-
Ramping up your DevOps-fu for big data developers
Reproducible setups for test & deployment are hard. Harder on a cluster. This talk presents lessons learned making a Spark distribution.
-
Couchbase avec Kafka et Hadoop
Couchbase accommodates fast access to user profiles at scale while leveraging Kafka to stream data to Hadoop for deep analytics.
-
Anomaly Detection with Apache Spark
A Gentle Introduction to Apache Spark and Clustering for Anomaly Detection.
-
Spark Streaming As Near Real Time ETL…and beyond !
L’objectif de cette session est de présenter Spark Streaming, les gênes communs avec Spark et les cas d’utilisation possibles.
-
Apache Spark : a practical feedback after implementing a data analysis workflow
Within a few months, we have rewritten the complete workflow for a data analysis engine: eXenGine. We'll give our feedback about using Apache Spark for implementing a matrix factorization method.