InfoQ Homepage ETL Content on InfoQ
-
Data Lakes and Modern Data Architecture in Clinical Research and Healthcare
Dr. Prakriteswar Santikary, chief data officer at ERT, spoke at Data Architecture Summit 2018 Conference last month about data lake architecture his team developed at their clinical research organization. He discussed the data platform deployed in the cloud to streamline data collection, aggregation and clinical reporting and analytics, using concepts like serverless computing and data services.
-
AWS re:Invent Recap
At their annual re:Invent conference in Las Vegas, AWS unleashed a flurry of announcements about upcoming cloud services. Amazon outlined over two dozen new capabilities coming to the public cloud, including directly querying data in S3 object storage, building code as part of deployment pipelines, provisioning cheap virtual private servers, and moving data in bulk, ETL-style.
-
AWS Launches Relational Database Migration Service
After a brief beta period that saw customers migrate more than 1,000 on-premises databases to the cloud, AWS formally released their Database Migration Service. This on-demand cloud service supports live migration scenarios, and customers who wish to switch their database platform as part of the migration can do so, thanks to a free schema conversion tool.
-
Introducing Reactive Streams
Modern software increasingly operates on data in near real-time. There is business value in sub-second responses to changing information and stream processing is one way to help turn data into knowledge as fast as possible, Kevin Webber explains in an introduction to Reactive Streams.
-
Google's Cloud Dataflow Enters General Availability
On August 12, Google announced that its big data processing service has reached general availability. This managed service allows customers to build pipelines that manipulate data prior to being processed by big data solutions. Cloud Dataflow supports both streaming and batch programming in a unified model.
-
Implementing Agile in Data Warehouse Projects
This post talks about using an agile implementation for data warehouse projects.
-
SvcPerf - Trace Analysis Tool for WCF Released
Microsoft recently released SvcPerf which is an end to end ETW based trace viewer for manifest based traces.
-
Amazon Makes Compelling Case for Hosting and Processing Your Big Data
The AWS team has announced a limited preview of Amazon Redshift, a cloud-hosted data warehouse whose cost and capabilities are poised to disrupt the industry. In addition, AWS revealed two new massive compute instance types, and a data integration tool called Data Pipeline.
-
Is Enterprise Data Management the Third Face of the SOA/BPM Coin?
Fred Cummins, an EDS fellow, and SOA veteran, wrote an essay last week on "Data Management for SOA". He is looking at how some of the key tenets of service design ("loose coupling" and "autonomy") relate to enterprise data in the context of achieving reuse and enabling change.
-
ActiveWarehouse, a New Step for Enterprise Ruby
ActiveWarehouse, is a significant new plugin that makes it easier to build data warehouses in Rails.