BT
Vous êtes désormais en PLEIN ECRAN
QUITTER LE PLEIN ECRAN

Apache Spark : a practical feedback after implementing a data analysis workflow
Enregistré à :

| par Guillaume Pitel le 09 mai 2014 |
  • Voir la Présentation
  •  
  •  
  •  
41:43

Résumé
Within a few months, we have rewritten the complete workflow for a data analysis engine: eXenGine. We'll give our feedback about using Apache Spark for implementing a proprietary matrix factorization method and analyzing Wikipedia for textual content, links and meta-data. Focus will be on the nice things we have found about Spark.

Bio

Founder, Chief Scientist @ eXenSa : #recsys and #textmining for #BigData. #MachineLearning, Startups. http://blog.guillaume-pitel.fr Paris · wikinsights.org

Let's get together and chat about machine-learning, natural language processing, large scale data analytics using open source tools such as Hadoop MapReduce, Shark, NoSQL databases, the semantic web and linked data.

BT