-
AI, ML and Data Engineering AI, ML and Data Engineering Follow 651 Followers
Apache Beam Interview with Frances Perry
InfoQ Interviews Apache Beam's Frances Perry about the impetus for using Beam and the future of the top-level open source project and covers the thoughts behind the programming model as well as some of the touch-points in integration with other data engineering tools like Apache Spark and Flink.
-
AI, ML and Data Engineering AI, ML and Data Engineering Follow 651 Followers
From Alibaba to Apache: RocketMQ’s Past, Present, and Future
Feng Jia and Wang Xiaorui share the core distributed systems principals behind RocketMQ, Alibaba's distributed messaging and data streaming platform now open sourced through the Apache Foundation.
-
AI, ML and Data Engineering AI, ML and Data Engineering Follow 651 Followers
Article Series: Getting a Handle on Data Science
In this series we explore ways of making sense of data science - understanding where it’s needed and where it’s not, and how to make it an asset for you, from people who’ve been there and done it.
Case Study: Selecting Big Data and Data Science Technologies at a large Financial Organisation
Adopting Big Data and Data Science technologies into an organisation is a transformative project similar to an agile transformation and with many similar challenges.
Q&A: Relevant Search with Elasticsearch and Solr
Bridging Microsoft Word and the Browser
HTML editors work fine for general formatting, but they don’t have all the capabilities that some businesses require. In this article, Prasadu Babu Dandu shows how to convert Word documents to HTML.
What is Apache Tez?
Bikas Saha and Arun Murthy discuss Tez’s design, highlight some of its features and share some of the initial results obtained by making Hive use Tez instead of MapReduce.
How LinkedIn Uses Apache Samza
Apache Samza is a stream processor LinkedIn recently open-sourced. Chris Riccomini shares Samza's feature set, how it integrates with YARN and Kafka, how it's used at LinkedIn and more.