InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
The Evolution of Uber’s 100+ Petabyte Big Data Platform
Uber’s engineering team wrote about how their big data platform evolved from traditional ETL jobs with relational databases to one based on Hadoop and Spark. A scalable ingestion model, standard transfer format and a custom library for incremental updates are the key components of the platform.
-
Data Lakes and Modern Data Architecture in Clinical Research and Healthcare
Dr. Prakriteswar Santikary, chief data officer at ERT, spoke at Data Architecture Summit 2018 Conference last month about data lake architecture his team developed at their clinical research organization. He discussed the data platform deployed in the cloud to streamline data collection, aggregation and clinical reporting and analytics, using concepts like serverless computing and data services.
-
JetBrains Introduces Datalore 1.0, an Intelligent Web Application for Data Analysis
JetBrains recently introduced Datalore 1.0, an intelligent web application for data analysis and visualization in Python. Datalore 1.0 brings an improved smart code editor, user-controlled code execution, professional subscription, and more.
-
New Updates to Firebase: Enterprise-Grade Support, ML Kit Face Contours, Management API, and More
Firebase is a service available on the Google infrastructure, enabling developers to build apps for Android, iOS, and the web. Recently, Google updated Firebase with paid enterprise-grade support, ML Kit Face Contours, a Firebase Management API, Test Lab for iOS, Performance Monitoring improvements, and Firebase Predictions.
-
Event Sourcing to the Cloud at HomeAway
Adam Haines, Data Architect at HomeAway, recently spoke at the Data Architecture Summit 2018 Conference about how his team leverages event sourcing cloud design pattern to accelerate the big data initiatives in their organization.
-
Cloudera and Hortonworks Merge with Goal to Increase Competition with Cloud Offerings
Earlier this month, Cloudera and Hortonworks announced an all-stock merger at a combined value of around $5.2 billion. Analysts have argued that this merger is aimed at increased competition that both companies are facing from cloud vendors like Amazon, Google and Microsoft. In this article we log reactions from analysts and the industry, and the implications for current customers.
-
Introducing EmoPy: An Open Source Toolkit for Facial Expression Recognition
In a recent blog post, Angelica Perez shared information about a new open source project for an interactive film experience. The project is called EmoPy and focuses on Facial Expression Recognition (FER) by providing a toolkit that allows developers to accurately predict emotions based upon images passed to the service.
-
Agile Data Modeling for NoSQL Databases
Pascal Desmarets recently spoke at Data Architecture Summit 2018 Conference about agile modeling and best practices for NoSQL databases.
-
QCon San Francisco 2018 Preview: What You Can Expect from This Year’s Conference
QCon, the series of global software conferences organized by InfoQ, is returning to the Bay Area for the 12th annual event, November 5th - 7th, and we’re expecting around 1,600 senior individual contributors and architects to join us. As in years past, the conference runs over three days with two bonus workshop days.
-
Q&A with Microsoft's Arindam Chatterjee Discussing Azure HDInsight 4.0
InfoQ caught up with Arindam Chatterjee, principal group manager at Microsoft, regarding the announcements about HDInsight at Microsoft Ignite.
-
Concept and Object Modeling Notation for Data Modeling NoSQL Databases
Ted Hills hosted a workshop at the recent Data Architecture Summit 2018 Conference about data modeling for relational and NoSQL databases. He said that the NoSQL movement helped the database community realize two things. First, not every application needs ACID properties. Second, the tabular data organization is still a good choice for much data, although not for all datasets.
-
William McKnight on Data Platforms and Creating a Modern Data Architecture
William McKnight gave a keynote presentation last week at Data Architecture Summit 2018 Conference on creating a modern data architecture using different data platforms.
-
New Git Submodule Vulnerability Patched
The Git community has disclosed a security vulnerability affecting the clone and submodule commands that could enable remote code execution when vulnerable machines access malicious repositories. The vulnerability, which has been assigned CVE–2018–17456 by Mitre, has been fixed in Git 2.19.1.
-
NVIDIA Announces RAPIDS, Medical Image Application, and a Driving Simulator for Autonomous Vehicles.
Today Jensen Huang, CEO of NVIDIA, gave a keynote at the GPU Technology Conference 2018 in Munich. He announced RAPIDS, an open-source CUDA accelerated toolkit that can help data scientists to faster process their data. They announced a partnership to work on medical imaging. They announced a self-driving car simulator that car manufacturers can use for verification of autonomous vehicles.
-
QCon SF 2018 (Nov 5-7) International Software Conference: Tracks, Workshops and Highlights
QCon San Francisco, the 12th annual software conference that attracts attendees from all over the world, returns to the Silicon Valley area November 5-7, 2018. With 32 days before the conference, 75% of the individually curated talks have been confirmed. The remaining talks are expected to be finalized over the next few weeks.