InfoQ Homepage Big Data Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Netflix Keystone Real-Time Stream Processing Platform

Netflix recently published a post in their tech blog discussing the design considerations and insights of Keystone, their Real-time stream processing platform. Keystone has been operational since December 2015 and has grown significantly over the years as Netflix subscribers have grown from 65 to over 130 million in the past 3 years. This article follows on the latest state of Keystone platform...

Alex Giamas
on Sep 30, 2018
AI, ML & Data Engineering

California Creates Consumer Privacy Act

California has enacted the California Consumer Privacy Act (CCPA) of 2018 which, starting on January 1, 2020, would grant consumers several rights with respect to information about them that businesses collect, store, sold, and share. This is the first legislation of its kind in the United States.

Michael Stiefel
on Aug 31, 2018
AI, ML & Data Engineering

New York Creates Task Force to Examine Automated Decision Making

New York City has created an Automated Decision Systems Task Force to demand accountability and transparency in how algorithms are used in city government. The final report of the task force is due in December 2019. This task force is the first in the United States to study this issue.

Michael Stiefel
on Jul 31, 2018
AI, ML & Data Engineering

A Team's Transformation from Software Development to ML: Golestan Radwan at QCon NY

As companies start to add Big Data and Machine Learning initiatives to their project portfolios, they face several challenges including the teams' transition from software engineering to data engineering and machine learning. Golestan "Sally" Radwan spoke at QCon New York 2018 Conference about her experience in leading a traditional software engineering team on a machine learning/AI journey.

Srini Penchikala
on Jul 12, 2018
AI, ML & Data Engineering

Distributed Messaging Framework Apache Pulsar 2.0 Supports Schema Registry and Topic Compaction

The latest version of open-source distributed pub-sub messaging framework Apache Pulsar enables companies to move “beyond batch” by acting on data in motion. Streamlio recently announced the availability of Apache Pulsar 2.0 streaming messaging solution. The new version supports Pulsar Functions, Schema Registry and Topic Compaction.

Srini Penchikala
on Jun 25, 2018
AI, ML & Data Engineering

eBay's Accelerator Data Processing Framework Provides Parallel Execution and Live Recommendations

eBay's Accelerator data processing framework provides parallel execution and automatic organization of source code, input data, and results. It can be used for data analysis, and algorithm development, as well as a live recommendation system.

Srini Penchikala
on May 31, 2018
AI, ML & Data Engineering

PayPal's Gimel Analytics Platform Provides Unified Data API and GSQL

Romit Mehta and Deepak Chandramouli from PayPal spoke at the recent QCon.ai Conference about Gimel data analytics platform and how it can be used to commoditize data access. InfoQ spoke with Mehta and Chandramouli about the data platform and its support in the areas of security,

Srini Penchikala
on Apr 17, 2018
Architecture & Design

Chile’s Energy Regulator to Adopt Blockchain

PV magazine, a publication focused on reporting photovoltaics (solar power generation), has announced the Chile Energy Regulator is set to adopt blockchain in March 2018. The regulator plans to use blockchain technology to transparently record market prices, marginal costs, fuel prices and compliance documentation.

Kent Weare
on Mar 28, 2018
Cloud

Oral Arguments before Supreme Court in Microsoft Cloud Computing Case Focus on Legal Issues

On February 27, 2018, the Supreme Court of the United States heard oral arguments on the Microsoft cloud computing case. A ruling against Microsoft could require companies based in the United States to hand over to law enforcement data stored on foreign servers. U.S. based organizations might then not be able to provide cloud computing services to foreign countries.

Michael Stiefel
on Mar 15, 2018
Cloud

Managing and Operating Kafka Clusters in Kubernetes

Nenad Bogojevic, platform solutions architect at Amadeus, spoke at KubeCon + CloudNativeCon North America 2017 Conference on how to run and manage Kafka clusters in Kubernetes environment. He talked about provisioning Kafka clusters and configuring them using Kubernetes custom resources or ConfigMaps.

Srini Penchikala
on Feb 06, 2018
AI, ML & Data Engineering

Modern Big Data Pipelines over Kubernetes

Container management technologies like Kubernetes make it possible to implement modern big data pipelines. Eliran Bivas, senior big data architect at Iguazio, spoke at the recent KubeCon + CloudNativeCon North America 2017 Conference about big data pipelines and how Kubernetes can help develop them.

Srini Penchikala
on Jan 08, 2018
AI, ML & Data Engineering

TensorFlow Lite Supports On-Device Conversational Modeling

TensorFlow Lite, the light-weight solution of open source deep learning framework TensorFlow, supports on-device conversation modeling to plugin the conversational intelligence features into chat applications. The TensorFlow team recently announced the release of TensorFlow Lite, which can be used in mobile and embedded devices.

Srini Penchikala
on Nov 29, 2017
Culture & Methods

Leslie Miley on Bias in Big Data/ML and AI - QCon San Francisco

At QCon San Francisco Leslie Miley gave a keynote talk in which he explained how inherent bias in data sets have affected things from the 2016 Presidential race to criminal sentencing in the United States.

Shane Hastie
on Nov 20, 2017
AI, ML & Data Engineering

Confluent Releases KSQL, a Distributed Streaming SQL Engine for Apache Kafka

Confluent released KSQL: interactive, distributed streaming SQL engine for Apache Kafka. KSQL supports stream processing operations like aggregations, joins, windowing, and sessionization on topics in Apache Kafka. Confluent announced the open source streaming SQL engine at the recent Kafka Summit conference.

Srini Penchikala
on Oct 25, 2017
AI, ML & Data Engineering

Q&A with Andrew Brust of Datameer Regarding Big Data's Role in AI

Rags Srinivas talks to Datameer's Andrew Brust about the larger role of Big Data in AI and how it's operationalized with SmartAI.

Rags Srinivas
on Jul 31, 2017

Newer News

Older News

InfoQ Software Architects' Newsletter

News