InfoQ Homepage Presentations Samza: Real-time Stream Processing at LinkedIn
Samza: Real-time Stream Processing at LinkedIn
Summary
Chris Riccomini discusses: Samza's feature set, how Samza integrates with YARN and Kafka, how it's used at LinkedIn, and what's next on the roadmap.
Bio
Chris Riccomini is a Staff Software Engineer at LinkedIn, where he's is currently working as a committer and PMC member for Apache Samza. He's been involved in a wide range of projects at LinkedIn, including, "People You May Know", REST.li, Hadoop, engineering tooling, and OLAP systems. Prior to LinkedIn, he worked on data visualization and fraud modeling at PayPal.
About the conference
Software is Changing the World. QCon empowers software development by facilitating the spread of knowledge and innovation in the developer community. A practitioner-driven conference, QCon is designed for technical team leads, architects, engineering directors, and project managers who influence innovation in their teams.
Community comments
Re-processing
by peter lin,
Re-processing
by peter lin,
Your message is awaiting moderation. Thank you for participating in the discussion.
The term in the stream processing space is "back testing", where you take historical data and run it against a new model. There's hundreds of papers in this space, people should study the domain to see what's been done to avoid making common mistakes. One common challenge with stream and event processing is temporal data and temporal logic.