You are now in FULL VIEW

Exploring Wikipedia with Apache Spark: A Live Coding Demo
Recorded at:

| by Sameer Farooqui Follow 0 Followers on Aug 23, 2016 |

Sameer Farooqui demos connecting to the live stream of Wikipedia edits, building a dashboard showing what’s happening with Wikipedia datasets and how people are using them in real time.

Sponsored Content


Sameer Farooqui is a Technology Evangelist at Databricks where he focuses on enabling Spark deployments via tech support, consulting and training. Before that, Sameer was a Systems Architect at Hortonworks and an Enterprise Solutions Specialist at Symantec. He is also a regular speaker at various big data conferences such as Strata + Hadoop World, Cassandra Summit and Big Data Tech Con.

Chariot Solutions is a software development consulting firm. We build and integrate the critical software applications that run our clients’ businesses. We are successful because we attract the most talented and collaborative software architects in the region. They are leaders in Java, open source and emerging technologies. We work in small, agile teams. We solve hard problems with a practical approach centered on communication, common sense and continual learning. We believe it is important to give back to our community through shared learning.

Login to InfoQ to interact with what matters most to you.

Recover your password...


Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.


More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.


Stay up-to-date

Set up your notifications and don't miss out on content that matters to you