BT

Speedment Releases Stream ORM Version 3.0.1

by Michael Redlich on  Dec 16, 2016

Speedment released version 3.0.1 of their stream object-relational mapping Java toolkit and runtime application, featuring a new declarative Java 8 stream API, an improved user interface, and better code generation. InfoQ spoke to Per-Åke Minborg, co-founder and CTO of Speedment, about this latest release.

Julien Nioche on StormCrawler, Open-Source Crawler Pipelines Backed by Apache Storm

by Alexandre Rodrigues on  Dec 15, 2016

Julien Nioche, director of DigitalPebble, PMC member and committer of the Apache Nutch web crawler project, talks about StormCrawler, a collection of reusable components to build distributed web crawlers based on the streaming framework Apache Storm. InfoQ interviewed Nioche, main contributor of the project, to find out more about StormCrawler and how it compares to other similar technologies.

Facebook's Comparison of Apache Giraph and Spark GraphX for Graph Data Processing

by Srini Penchikala on  Dec 09, 2016

A Facebook team has recently published a comparison of the performance of their existing Giraph-based graph processing system with the newer GraphX which is part of the popular Spark framework. Their conclusion is that GraphX is neither sufficiently scalable or performant to support their graph processing workloads.

Julien Le Dem on the Future of Column-Oriented Data Processing with Apache Arrow

by Alexandre Rodrigues on  Dec 08, 2016 1

Julien Le Dem, the PMC chair of the Apache Arrow project, presented on Data Eng Conf NY on the future of column-oriented data processing. Apache Arrow is an open-source standard for columnar in-memory execution. InfoQ interviewed Le Dem to find out the differences between Arrow and Parquet.

AWS re:Invent Recap

by Richard Seroter on  Dec 04, 2016

At their annual re:Invent conference in Las Vegas, AWS unleashed a flurry of announcements about upcoming cloud services. Amazon outlined over two dozen new capabilities coming to the public cloud, including directly querying data in S3 object storage, building code as part of deployment pipelines, provisioning cheap virtual private servers, and moving data in bulk, ETL-style.

Technologies for the Future of Software Engineering

by Ben Linders on  Dec 01, 2016 2

The Cloud, infrastructure as code, federated architectures with APIs, and anti-fragile systems: these are technologies for developing software systems that are rapidly coming into focus, claimed Mary Poppendieck. Systems are moving towards the cloud, and APIs are replacing central shared databases and enable the internet of things. We need to develop anti-fragile systems which embrace failure.

Couchbase 4.6 Developer Preview Released, Adds Real-Time Connectors for Apache Spark 2.0 and Kafka

by Alexandre Rodrigues on  Nov 28, 2016

Couchbase 4.6 Developer Preview features full text search improvements, cross data center replication with globally-ordered conflict resolution and connectors for real-time analytics technologies: one for Spark 2.0 and the other for Kafka.

Realm Releases Object Database for Node.js

by James Chesters on  Nov 22, 2016

Realm has launched an open source object database for Node.js, allowing mobile developers to create and send pre-populated Realms to clients.

Spark Summit EU Highlights: TensorFlow, Structured Streaming and GPU Hardware Acceleration

by Alexandre Rodrigues on  Nov 13, 2016

Apache Spark integration with deep learning library TensorFlow, online learning using Structured Streaming and GPU hardware acceleration were the highlights of Spark Summit EU 2016 held last week in Brussels.

Microsoft Releases Data Science Tools for Interactive Data Exploration and Modeling

by Srini Penchikala on  Nov 07, 2016

Microsoft recently released two new data science tools for interactive data exploration: modeling and reporting. These tools can be reused by data science teams with data specific tasks in their projects. The goal is to ensure consistency and completeness of data science tasks across different projects in the organization.

Microservices and Stream Processing Architecture at Zalando Using Apache Flink

by Srini Penchikala on  Oct 31, 2016 1

Javier Lopez and Mihail Vieru spoke at Reactive Summit 2016 Conference about cloud-based data integration and distribution platform used for stream processing in business intelligence use cases. Their solution is based on technologies such as Flink, Kafka and Elasticsearch.

Wolfram Wants to Deliver “Computation Everywhere” with New Private Cloud

by Richard Seroter on  Oct 26, 2016

Wolfram, the software company behind computation-centric products like Mathematica and Wolfram|Alpha, shipped a new private cloud appliance targeting companies that want to centralize their computational efforts.

Stream Processing and Lambda Architecture Challenges

by Alexandre Rodrigues on  Oct 19, 2016 4

Lambda architecture has been a popular solution that combines batch and stream processing. Kartik Paramasivam at LinkedIn wrote about how his team addressed stream processing and Lambda architecture challenges using Apache Samza for data processing. The challenges described are the late arrival of events and the processing of duplicated messages.

Jay Kreps on Distributed Stream Processing with Apache Kafka and Kafka Streams

by Srini Penchikala on  Oct 16, 2016

Apache Kafka and Kafka Streams frameworks help with developing stream-centric architectures and distributed stream processing applications. Jay Kreps, CEO of Confluent, gave the keynote presentation on stream processing and microservices at Reactive Summit 2016 Conference last week.

Reactive Summit 2016 Conference: Reactive Microservices and Staging Data Pipelines

by Srini Penchikala on  Oct 08, 2016

Reactive microservices, data center scale operating system (DCOS), and staging reactive data pipelines were the highlighted topics at Reactive Summit 2016 Conference held this week. InfoQ team attended the conference and this post is a summary of the first day's events at the conference.

General Feedback
Bugs
Advertising
Editorial
Marketing
InfoQ.com and all content copyright © 2006-2016 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT

We notice you're using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.