Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ


Choose your language

InfoQ Homepage Presentations Using NLP to Categorize and Find Similar Web Pages

Using NLP to Categorize and Find Similar Web Pages



Thomas Levi shows how to categorize web pages by building a system that exploits techniques in natural language processing and topic modeling.


Thomas Levi, having started out with a PhD in Theoretical Physics and String Theory, decided to move into industry, taking the role of Senior Data Scientist at PlentyOfFish and then on to Director of Data Science at Unbounce in 2015. Thomas has been involved in behavior analysis, social network analysis, scam detection, Bot detection, matching algorithms, topic modelling and semantic analysis.

About the conference

Managing Big Data has become a major competitive advantage for many organizations and hence maintaining a proper analytics platform is vital for an organization's survival. This conference provides insights and potential solutions to address Big Data issues from well known experts and thought leaders through panel sessions and open Q&A sessions.

Recorded at:

Oct 30, 2016

Hello stranger!

You need to Register an InfoQ account or or login to post comments. But there's so much more behind being registered.

Get the most out of the InfoQ experience.

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Community comments

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p