At QCon San Francisco, engineers at Netflix discussed their big data strategy and analytics infrastructure. This included a summary of the scale of their data, their S3 data warehouse, and Genie, their big data federated orchestration system.
In a recent blog post, Amazon announced changes to its NoSQL database service, DynamoDB, that includes support for Time to Live (TTL) on data stored in the service. Deleting data, based upon time-based thresholds, provides opportunities for organizations to reduce storage costs for data that decreases in value over time.
Apache Ranger, a security management framework for Apache Hadoop ecosystem, graduated to top level. Ranger is used as a centralized component to define and administer security policies that are enforced across supported Hadoop components such as Apache HBase, Hadoop (HDFS and YARN), Apache Hive, Apache Kafka, Apache Solr, among others.
Google has launched the public beta for Cloud Spanner, its globally distributed relational database service. Part of Google Cloud Platform, it delivers both ACID transactions and high availability, appearing to violate CAP theorem.
Nine months after acquiring BoldRadius, Lightbend announced their acquisition of OpsClarity, a company specializing in monitoring reactive applications. InfoQ interviewed Mark Brewer, president and CEO at Lightbend and Alan Ngai, co-founder of OpsClarity and now VP of cloud services at Lightbend to learn more about this new partnership.
Beam exits incubation period and graduates to top-level Apache project, Google support and contribution to open source integration for various data processing backends and more.
GitLab's postmortem into the root cause of their 18 hour site outage is a detailed look at how the incident began, how it got worse before it got better, and how they plan to learn from the mistakes and improve the service.
Deep Learning is a rapidly evolving subfield of Machine Learning originating from Neural Networks. Recent algorithmic advances and utilization of GPU parallelization have resulted in Deep Learning based algorithms mastering the game of Go as well as several practical applications. The fashion industry is one of the target sectors for Deep Learning. Gilt is using Deep Learning for real world apps
Microsoft has developed and open sourced AirSim, a tool that can be used to simulate the flight of drones around the world. The simulator is built on the Unreal Engine and Microsoft will soon add support for robots and other types of vehicles.
Apache Flink 1.2 was announced and features dynamic rescaling, security, queryable state, and more. The release resolved 650 issues, maintains compatibility with all public APIs and ships with Apache Kafka 0.10 and Apache Mesos support. Flink’s dynamic rescaling allows one to change the parallelism of a streaming job or of an operator within the job.
Data Geekery released version 3.9.0 of jOOQ, their object-relational mapping (ORM) Java toolkit, featuring an experimental parser, additional type safety, better integration with Oracle, and improved transactions. InfoQ spoke to Lukas Eder, founder and CEO of Data Geekery GmbH, about this latest release.
Your SQL Server database is slowly grinding to a halt, your DBA is on vacation, and you don’t know where to start. This is when you break out the SQL Server First Responder Kit. This open source project consists of a set of scripts to help the DBA, or accidental DBA, fix and tune a SQL Server instance.
Google recently announced increased support for Microsoft Windows Server and SQL Server, in the Google Cloud Platform, by providing pre-configured images for Windows Server Core and SQL Server Enterprise Edition. Google has also added support for High Availability and Disaster Recovery scenarios using SQL Server AlwaysOn Availability Groups.
MindMeld, a conversational AI company, has published The Conversational AI Playbook, a guide outlining the challenges and the steps to be made to create conversational applications.
Apache HBase 1.3.0 was released mid-January 2017 and ships with support for date-based tiered compaction and improvements in multiple areas, like write-ahead log (WAL), and a new RPC scheduler, among others. The release includes almost 1,700 resolved issues in total.