BT

Apache HBase 1.3 Ships with Multiple Performance Improvements

by Alexandre Rodrigues on  Jan 30, 2017

Apache HBase 1.3.0 was released mid-January 2017 and ships with support for date-based tiered compaction and improvements in multiple areas, like write-ahead log (WAL), and a new RPC scheduler, among others. The release includes almost 1,700 resolved issues in total.

The Infrastructure Behind Twitter: Scaling Networking, Storage and Provisioning

by Daniel Bryant on  Jan 30, 2017

The Twitter Engineering team has recently provided an insight into the evolution and scaling of the core technologies behind their in-house infrastructure that powers the social media service. Core lessons shared included: Architect beyond the original specifications; there is no such a thing as a “temporary change or workaround”; and documenting best practices has been a “force multiplier”.

Q&A with Matt Klein on Creating Envoy at Lyft

by Thomas Betts on  Jan 25, 2017

At the upcoming Microservices.com Practitioners Summit on Jan 31, Matt Klein, a senior software engineer at Lyft, will be presenting his work on Envoy, a Layer 7 communications bus used throughout Lyft’s service-oriented architecture. InfoQ met with Klein to discuss the benefits of creating a custom tool for Lyft’s networking needs, and how it could benefit other microservices architectures.

Apache Eagle, Originally from eBay, Graduates to top-level project

by Alexandre Rodrigues on  Jan 24, 2017

Apache Eagle, an open-source solution for identifying security and performance issues on big data platforms, graduates to Apache top level project on January 10, 2017. Firstly open-sourced by eBay on October 2015, Eagle was created to instantly detect access to sensitive data or malicious activities and, to take actions in a timely fashion.

Docker 1.13 Released with Improved CLI, Compose-File Support for Swarm Mode, and Secrets API

by Daniel Bryant on  Jan 23, 2017 1

Docker Inc, has released version 1.13 of its open source Docker container engine project. This release includes significant restructuring of the Docker CLI, and the introduction of ‘clean-up’ commands to reclaim disk space. Alongside the launch of Docker 1.13, new releases of the supporting toolchain were also made, including: Docker Compose 1.10, Docker Machine 0.9.0, and Notary 0.4.3.

Improving Azure SQL Database Performance Using In-Memory Technologies

by Kent Weare on  Jan 21, 2017 4

In late 2016, Microsoft announced the general availability of Azure SQL Database In-Memory technologies. In-Memory processing is only available in Azure Premium database tiers and provides performance improvements for On-line Analytical Processing (OLTP), Clustered Columnstore Indexes and Non-clustered Columnstore Indexes for Hybrid Transactional and Analytical Processing (HTAP) scenarios.

Multiple DNS Providers to Mitigate DDoS Attacks

by Hrishikesh Barua on  Jan 07, 2017

Distributed Denial of Service (DDoS) attacks against Domain Name System (DNS) providers are increasing in number and scale with the proliferation of insecure IoT devices. While DNS providers have various methods of protecting themselves against such attacks, one of the ways for a website to protect itself is to use multiple DNS providers.

Mathieu Ripert on Instacart's Machine Learning Optimizations

by Alexandre Rodrigues on  Jan 05, 2017

Instacart is an online delivery service for groceries under one hour. Customers order the items on the website or using the mobile app, and a group of Instacart’s shoppers go to local stores, purchase the items and deliver them to the customer. InfoQ interviewed Mathieu Ripert, data scientist at Instacart, to find out how machine learning is leveraged to guarantee a better customer experience.

Google BigQuery Adds New Public Datasets

by Alex Giamas on  Jan 05, 2017

Stack Overflow recently announced making its dataset available through Google’s BigQuery. Using regular SQL statements, developers can query the full set of Stack Overflow data including posts, votes, tags, and badges. In this article we explore datasets that are available through Google's BigQuery platform.

Neo4j 3.1 Supports Causal Clustering and Security Enhancements

by Srini Penchikala on  Dec 31, 2016

The latest version of Graph NoSQL database Neo4j introduces causal clustering and new security architecture. Neo4j team recently released version 3.1 of the graph database. Other new features include database kernel improvements and a Schema Viewer.

Netflix Conductor, an Orchestration Engine for Microservices

by Abel Avram on  Dec 20, 2016 3

Netflix has developed an orchestration engine called “Conductor”, and has used it internally in production for the last year . During this time they executed some 2.6 million process workflows, starting with linear ones and ending with dynamic ones running over multiple days. Now they have open sourced Conductor, making it available to all those interested in workflow orchestration.

Julien Nioche on StormCrawler, Open-Source Crawler Pipelines Backed by Apache Storm

by Alexandre Rodrigues on  Dec 15, 2016

Julien Nioche, director of DigitalPebble, PMC member and committer of the Apache Nutch web crawler project, talks about StormCrawler, a collection of reusable components to build distributed web crawlers based on the streaming framework Apache Storm. InfoQ interviewed Nioche, main contributor of the project, to find out more about StormCrawler and how it compares to other similar technologies.

Q&A with Drew Koszewnik on a Disseminated Cache, Netflix Hollow

by Rags Srinivas on  Dec 14, 2016

Drew Koszewnik of Netflix talks to Rags Srinivas about a disseminated cache called Hollow.

Google Pushing for HTTPS

by Manuel Pais on  Dec 11, 2016

Google wants to push for HTTPS everywhere with a combination of deprecating existing Chrome features in non-secure sites, as well as new features only supported in HTTPS.

Facebook's Comparison of Apache Giraph and Spark GraphX for Graph Data Processing

by Srini Penchikala on  Dec 09, 2016

A Facebook team has recently published a comparison of the performance of their existing Giraph-based graph processing system with the newer GraphX which is part of the popular Spark framework. Their conclusion is that GraphX is neither sufficiently scalable or performant to support their graph processing workloads.

BT