InfoQ Homepage Infrastructure Content on InfoQ
-
Predicting Movie Ratings: NLP Tools is What Film Studios Need
In this article, the author discusses how to use Natural Language Processing (NLP) techniques to predict the movie ratings using the data shared on social media platforms. Sentiment analysis of movie reviews can also be used to classify movies into different genres and to improve the movie recommendation systems.
-
Serverless Takes DevOps to the Next Level
Serverless doesn’t only supplement DevOps, but it goes beyond the current thinking on how IT organisations can achieve greater business agility. It’s geared towards the rapid delivery of business value and continuous improvement and learning, and as such has clear potential to drive substantial cultural change, even in organisations that have adopted DevOps culture and practices already.
-
From Alibaba to Apache: RocketMQ’s Past, Present, and Future
Feng Jia and Wang Xiaorui share the core distributed systems principals behind RocketMQ, Alibaba's distributed messaging and data streaming platform now open sourced through the Apache Foundation.
-
Building Pipelines for Heterogeneous Execution Environments for Big Data Processing
The Pipeline61 framework supports the building of data pipelines involving heterogeneous execution environments. It reuses the existing code of the deployed jobs in different environments and provides version control and dependency management that deals with typical software engineering issues. A real-world case study shows its effectiveness.
-
Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included!
Goldman Sachs is widely known as a leader in investment banking, but they are very much a leading technology firm as well. Reladomo is the primary Java ORM used at GS, and it is now open source. In this article GS Technology Fellow, Mohammad Rezaei, takes us on a deep dive into Reladomo.
-
Big Data Processing Using Apache Spark - Part 6: Graph Data Analytics with Spark GraphX
In this article, author Srini Penchikala discusses Apache Spark GraphX library used for graph data processing and analytics. The article includes sample code for graph algorithms like PageRank, Connected Components and Triangle Counting.
-
Three Experts on Big Data Engineering
Clemens Szyperski (Microsoft), Martin Petitclerc (IBM), and Roger Barga (Amazon Web Services) answer three questions: What major challenges do you face when building scalable, big data systems? How do you address these challenges? Where should the research community focus its efforts to create tools and approaches for building highly reliable, scalable, big data systems?
-
Learning Paths: QCon London Expert Recommendations
Advice on the best talks to attend at QCon London 2017 from London Thought Leaders.
-
Q&A with Immuta on the Implications of EU’s General Data Protection Regulation (GDPR)
InfoQ talked with Immuta’s Andrew Burt and Steve Touw, to better understand the implications and challenges of the EU's Global Data Protection Regulation, which will come into effect in May 2018.
-
Cassandra: The Definitive Guide, 2nd Edition Book Review and Interview
Cassandra: The Definitive Guide, 2nd Edition book authored by Jeff Carpenter and Eben Hewitt covers the Cassandra NoSQL database version 3.0. Authors discuss several different important topics related to this popular database, including data modeling and Cassandra architecture. InfoQ spoke with Jeff Carpenter about the book and Cassandra database current features and future roadmap.
-
Article Series: Getting a Handle on Data Science as a Software Developer
Software developers and managers are realizing that they need data science among their skills, to be able to tackle pressing problems. In this series, field experts provide guidance to help us navigate among the available data analysis options. They explore ways of understanding where data science is needed and where it’s not, and how to turn it into an asset.
-
The Container Landscape: Docker Alternatives, Orchestration, and Implications for Microservices
The orchestration of containers is key for success, and various technologies are competing for market share. This article examines the current tooling and how this relates to deploying microservices. A key takeaway is that developers should create business logic of their microservices using a vendor -and platform- agnostic approach.