Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ


Choose your language

InfoQ Homepage Presentations Docker Data Science Pipeline

Docker Data Science Pipeline



Lennard Cornelis explains why they chose OpenShift and Docker to connect to the Hadoop environment, and also how to set up a Docker container running a data science model using Hive, Python, and Spark.


Lennard Cornelis is a senior big data engineer who has a great passion for technology. He is a hands-on person and who loves to solve difficult and challenging problems. Knowledge sharing is very important to him as he loves the role of mentoring colleagues.

About the conference

Big Data Conference Vilnius is a three-day conference with technical talks in the fields of Big Data, High Load, Data Science, Machine Learning and AI.Conference brings together developers, IT professionals and users to share their experience, discuss best practices, describe use cases and business applications related to their successes.

Recorded at:

May 18, 2019