BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Data Science Follow 210 Followers

Julien Le Dem on the Future of Column-Oriented Data Processing with Apache Arrow

by Alexandre Rodrigues Follow 0 Followers on  Dec 08, 2016 1

Julien Le Dem, the PMC chair of the Apache Arrow project, presented on Data Eng Conf NY on the future of column-oriented data processing. Apache Arrow is an open-source standard for columnar in-memory execution. InfoQ interviewed Le Dem to find out the differences between Arrow and Parquet.

Data Science Follow 210 Followers

Couchbase 4.6 Developer Preview Released, Adds Real-Time Connectors for Apache Spark 2.0 and Kafka

by Alexandre Rodrigues Follow 0 Followers on  Nov 28, 2016

Couchbase 4.6 Developer Preview features full text search improvements, cross data center replication with globally-ordered conflict resolution and connectors for real-time analytics technologies: one for Spark 2.0 and the other for Kafka.

Data Science Follow 210 Followers

Spark Summit EU Highlights: TensorFlow, Structured Streaming and GPU Hardware Acceleration

by Alexandre Rodrigues Follow 0 Followers on  Nov 13, 2016

Apache Spark integration with deep learning library TensorFlow, online learning using Structured Streaming and GPU hardware acceleration were the highlights of Spark Summit EU 2016 held last week in Brussels.

Data Science Follow 210 Followers

Microsoft Releases Data Science Tools for Interactive Data Exploration and Modeling

by Srini Penchikala Follow 12 Followers on  Nov 07, 2016

Microsoft recently released two new data science tools for interactive data exploration: modeling and reporting. These tools can be reused by data science teams with data specific tasks in their projects. The goal is to ensure consistency and completeness of data science tasks across different projects in the organization.

Data Science Follow 210 Followers

Microservices and Stream Processing Architecture at Zalando Using Apache Flink

by Srini Penchikala Follow 12 Followers on  Oct 31, 2016 1

Javier Lopez and Mihail Vieru spoke at Reactive Summit 2016 Conference about cloud-based data integration and distribution platform used for stream processing in business intelligence use cases. Their solution is based on technologies such as Flink, Kafka and Elasticsearch.

Cloud Follow 42 Followers

Wolfram Wants to Deliver “Computation Everywhere” with New Private Cloud

by Richard Seroter Follow 2 Followers on  Oct 26, 2016

Wolfram, the software company behind computation-centric products like Mathematica and Wolfram|Alpha, shipped a new private cloud appliance targeting companies that want to centralize their computational efforts.

Data Science Follow 210 Followers

Stream Processing and Lambda Architecture Challenges

by Alexandre Rodrigues Follow 0 Followers on  Oct 19, 2016 4

Lambda architecture has been a popular solution that combines batch and stream processing. Kartik Paramasivam at LinkedIn wrote about how his team addressed stream processing and Lambda architecture challenges using Apache Samza for data processing. The challenges described are the late arrival of events and the processing of duplicated messages.

Data Science Follow 210 Followers

Jay Kreps on Distributed Stream Processing with Apache Kafka and Kafka Streams

by Srini Penchikala Follow 12 Followers on  Oct 16, 2016

Apache Kafka and Kafka Streams frameworks help with developing stream-centric architectures and distributed stream processing applications. Jay Kreps, CEO of Confluent, gave the keynote presentation on stream processing and microservices at Reactive Summit 2016 Conference last week.

Data Science Follow 210 Followers

Reactive Summit 2016 Conference: Reactive Microservices and Staging Data Pipelines

by Srini Penchikala Follow 12 Followers on  Oct 08, 2016

Reactive microservices, data center scale operating system (DCOS), and staging reactive data pipelines were the highlighted topics at Reactive Summit 2016 Conference held this week. InfoQ team attended the conference and this post is a summary of the first day's events at the conference.

Data Science Follow 210 Followers

Confluent Announces Kafka for the Enterprise with Multi-Datacenter Replication

by Srini Penchikala Follow 12 Followers on  Oct 05, 2016

Confluent Enterprise latest version supports multi-datacenter replication, automatic data balancing, and cloud migration capability. Confluent, provider of the Apache Kafka based streaming platform, announced last week the new features for Confluent Enterprise, to help build streaming data pipelines and develop stream processing applications.

Cloud Follow 42 Followers

Amazon Kinesis Analytics is Like SaaS for Big Data Analysis

by Elton Stoneman Follow 0 Followers on  Sep 14, 2016

Real-time analysis of event streams has a new focus in Big Data platforms, both on-premise and in the cloud. AWS have released Amazon Kinesis Analytics, a rival to Azure StreamAnalytics. Both platforms use a simple SQL language for complex querying, and move Big Data analysis into a SaaS-like space.

Data Science Follow 210 Followers

IBM Creates Artificial Neurons from Phase Change Memory for Cognitive Computing

by Srini Penchikala Follow 12 Followers on  Sep 13, 2016

A team of scientists at IBM Research in Zurich, have created an artificial version of neurons using phase-change materials to store and process data. These phase change based artificial neurons can be used to detect patterns and discover correlations in Big Data (real-time streams of event based data) and unsupervised machine learning at high speeds using very little energy.

Culture & Methods Follow 133 Followers

Getting the Data Needed for Data Science

by Ben Linders Follow 8 Followers on  Sep 02, 2016

Data science is about the data that you need; deciding which data to collect, create, or keep is fundamental argues Lukas Vermeer, an experienced Data Science professional and Product Owner for Experimentation at Booking.com. True innovation starts with asking big questions, then it becomes apparent which data is needed to find the answers you seek.

Cloud Follow 42 Followers

Azure Premium Messaging Service Reaches General Availability

by Kent Weare Follow 7 Followers on  Jul 31, 2016

On July 15th, Microsoft announced the Azure Premium Messaging service has reached General Availability (GA). Premium Messaging targets customers who would like more predictable messaging performance. InfoQ reached out to Dan Rosanova, Principal Program Manager on the Azure Service Bus team for additional insight into this milestone.

Development Follow 102 Followers

Basho Open Sources Time Series Database Riak TS 1.3

by Rags Srinivas Follow 2 Followers on  Jul 15, 2016

InfoQ's Rags Srinivas talks to Basho's CTO Dave McCrory about the open sourcing of Riak TS 1.3 which is geared to handle time series data.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT