Facilitating the spread of knowledge and innovation in professional software development



Choose your language

InfoQ Homepage Presentations Machine Learning and End-to-End Data Analysis Processes in Spark Using Python and R

Machine Learning and End-to-End Data Analysis Processes in Spark Using Python and R



Debraj GuhaThakurta discusses ML and data analysis processes in Spark using examples written in Python and R.


Debraj GuhaThakurta is a Senior Data Scientist in Microsoft’s Azure Machine Learning group, focusing on platforms and toolkits, such as Microsoft’s Cortana Analytics suite, R Server, SQL Server, Hadoop and Spark clusters, for creating scalable and operationalized analytical processes for various business problems. He has published more than 25 peer-reviewed papers, book-chapters and patents.

About the conference

Managing Big Data has become a major competitive advantage for many organizations and hence maintaining a proper analytics platform is vital for an organization's survival. This conference provides insights and potential solutions to address Big Data issues from well known experts and thought leaders through panel sessions and open Q&A sessions.

Recorded at:

Feb 05, 2017

Hello stranger!

You need to Register an InfoQ account or or login to post comments. But there's so much more behind being registered.

Get the most out of the InfoQ experience.

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Community comments

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p


Is your profile up-to-date? Please take a moment to review and update.

Note: If updating/changing your email, a validation request will be sent

Company name:
Company role:
Company size:
You will be sent an email to validate the new email address. This pop-up will close itself in a few moments.