InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
JVMs Across the Data Center and Twitter's JDK
The Twitter Sponsored Solutions track at QConSF2016 features an engineering talk on JVMs Across the Data Center and unveils an in-house OpenJDK fork, the Twitter-JDK, with noted potential open-sourcing or release to broader public.
-
Google Details Allo Recommendation Graph Processing Algorithm
Google details a graph streaming algorithm for constant runtime over large graphs of varying complexity space and predictor outputs.
-
Microsoft Releases Data Science Tools for Interactive Data Exploration and Modeling
Microsoft recently released two new data science tools for interactive data exploration: modeling and reporting. These tools can be reused by data science teams with data specific tasks in their projects. The goal is to ensure consistency and completeness of data science tasks across different projects in the organization.
-
Microservices and Stream Processing Architecture at Zalando Using Apache Flink
Javier Lopez and Mihail Vieru spoke at Reactive Summit 2016 Conference about cloud-based data integration and distribution platform used for stream processing in business intelligence use cases. Their solution is based on technologies such as Flink, Kafka and Elasticsearch.
-
Google Machine Learning Models for Image Captioning Ported to TensorFlow and Open-Sourced
As TensorFlow becomes more widely adopted in the machine learning and data science domains, existing machine learning models and engines are being ported from existing frameworks to TensorFlow for improved performance, furthering the adoption and success of the open-sourced project.
-
Wolfram Wants to Deliver “Computation Everywhere” with New Private Cloud
Wolfram, the software company behind computation-centric products like Mathematica and Wolfram|Alpha, shipped a new private cloud appliance targeting companies that want to centralize their computational efforts.
-
QCon Awarded 10 Diversity Scholarships for QCon SF 2016
QCon San Francisco has provided diversity scholarships to underrepresented groups in the technology community. The Conference is committed to encouraging diversity.
-
Ocado Uses TensorFlow and Google Cloud Platform for Novel Customer Service Approach
Ocado Technology uses TensorFlow to categorize customer emails for automated support queue categorization and prioritization for the goals of quick response time and avoiding impersonal support bots often used with large customer volumes and finite support resources.
-
CommAI, a Training and Testing AI System by Facebook
Facebook recently announced CommAI-env, a platform for training and evaluating an AI system. Inspired by A roadmap towards Machine Intelligence the system aims for teaching intelligent agents general learning capabilities that would serve as the groundwork for further, more specialized training by human or machine level interaction. The article provides a high level overview of current state and..
-
Stream Processing and Lambda Architecture Challenges
Lambda architecture has been a popular solution that combines batch and stream processing. Kartik Paramasivam at LinkedIn wrote about how his team addressed stream processing and Lambda architecture challenges using Apache Samza for data processing. The challenges described are the late arrival of events and the processing of duplicated messages.
-
Jay Kreps on Distributed Stream Processing with Apache Kafka and Kafka Streams
Apache Kafka and Kafka Streams frameworks help with developing stream-centric architectures and distributed stream processing applications. Jay Kreps, CEO of Confluent, gave the keynote presentation on stream processing and microservices at Reactive Summit 2016 Conference last week.
-
Reactive Summit 2016 Conference: Reactive Microservices and Staging Data Pipelines
Reactive microservices, data center scale operating system (DCOS), and staging reactive data pipelines were the highlighted topics at Reactive Summit 2016 Conference held this week. InfoQ team attended the conference and this post is a summary of the first day's events at the conference.
-
Confluent Announces Kafka for the Enterprise with Multi-Datacenter Replication
Confluent Enterprise latest version supports multi-datacenter replication, automatic data balancing, and cloud migration capability. Confluent, provider of the Apache Kafka based streaming platform, announced last week the new features for Confluent Enterprise, to help build streaming data pipelines and develop stream processing applications.
-
Twitter Open Sources Stream Processing Engine Heron
InfoQ's Rags Srinivas caught up with Karthik Ramasamy, co-creator and engineering manager at Twitter, regarding the Open Sourcing of the Stream-Processing engine Heron, a successor for Apache Storm.
-
How YouTube's Recommendation Algorithm Works
In a recent paper published by Google, YouTube engineers analyzed in greater detail the inner workings of YouTube’s recommendation algorithm. The paper was presented on the 10th ACM Conference on Recommender Systems last week in Boston. In this news item we analyze how YouTube uses deep learning to operate one of the largest and most complex recommendation systems in industry.