InfoQ Homepage Big Data Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Amazon Elastic MapReduce Now Generally Available as a Serverless Offering

AWS recently announced that Amazon Elastic MapReduce (EMR) Serverless is generally available (GA). The offering is a serverless deployment option for customers to run big data analytics applications using open-source frameworks like Apache Spark and Hive without configuring, managing, and scaling clusters or servers.

Steef-Jan Wiggers
on Jun 07, 2022
Cloud

Google Introduces Autoscaling for Cloud Bigtable for Optimizing Costs

Cloud Bigtable is a fully-managed, scalable NoSQL database service for large operational and analytical workloads on the Google Cloud Platform (GCP). And recently, the public cloud provider announced the general availability of Bigtable Autoscaling, which automatically adds or removes capacity in response to the changing demand for applications allowing cost optimizations.

Steef-Jan Wiggers
on Jan 31, 2022
Cloud

Amazon OpenSearch Adds Anomaly Detection for Historical Data

Amazon OpenSearch recently introduced the support of anomaly detection for historical data. The machine learning based feature helps identifying trends, patterns, and seasonality in OpenSearch data.

Renato Losio
on Jan 29, 2022
Cloud

AWS Announces the Public Preview of AWS Data Exchange for Amazon Redshift

Recently AWS announced the public preview of AWS Data Exchange for Amazon Redshift. This new feature enables customers to find and subscribe to third-party data in AWS Data Exchange to query in an Amazon Redshift data warehouse.

Steef-Jan Wiggers
on Oct 27, 2021
Cloud

AWS Announces the General Availability and Open Sourcing of the Amazon Genomics CLI

Amazon Genomics CLI is a tool that makes it easier to process genomics data at a petabyte-scale on AWS. Earlier this year, the public cloud vendor shared a preview of the tool, and it is now open source and generally available.

Steef-Jan Wiggers
on Oct 06, 2021
Cloud

Hazelcast Jet 4.4 Released - the Four-Year Anniversary Release as Seen by Scott McMahon

Hazelcast Jet recently celebrated its four-year anniversary with the release of version 4.4. Besides the normal bug fixes and performance enhancements, this new version ships with new features such as the unified file connector and the first beta version of the SQL interface. InfoQ spoke to Scott McMahon, technical director of field engineering at Hazelcast, about this new release.

Olimpiu Pop
on Mar 19, 2021
Culture & Methods

Using Machine Learning in Testing and Maintenance

With machine learning, we can reduce maintenance efforts and improve the quality of products. It can be used in various stages of the software testing life-cycle, including bug management, which is an important part of the chain. We can analyze large amounts of data for classifying, triaging, and prioritizing bugs in a more efficient way by means of machine learning algorithms.

Ben Linders
on Mar 18, 2021
AI, ML & Data Engineering

DataStax Announces Astra Serverless Database-as-a-Service

DataStax , the company behind the Cassandra database, announced last week the general availability of Astra serverless, the open, multi-cloud serverless database-as-a-service (DBaaS).

Srini Penchikala
on Mar 15, 2021
Architecture & Design

Designing for Failure in the BBC's Analytics Platform

Last week at InfoQ Live, Blanca Garcia-Gil, principal systems engineer at BBC, gave a session on Evolving Analytics in the Data Platform. During this session, Garcia-Gil focused on how her team prepared and designed for two types of failure - "known unknowns" and "unknown unknowns."

Eran Stiller
on Feb 24, 2021
Cloud

Google Brings Databricks to Its Cloud Platform

Recently Google announced a partnership with Databricks to bring their fully-managed Apache Spark offering and data lake capabilities to Google Cloud. The offering will become available as Databricks on Google Cloud.

Steef-Jan Wiggers
on Feb 23, 2021
Architecture & Design

PayPal Standardizes on Apache Airflow and Apache Gobblin for Its Next-Gen Data Movement Platform

PayPal recently described how it standardized on Apache Airflow and Apache Gobblin for implementing its next-gen data movement platform. In a recent blog post, PayPal engineers detail how the existing data movement platform evolved into many tools & platforms in a complex and unmanageable ecosystem and their shift towards a new implementation.

Eran Stiller
on Feb 10, 2021
Culture & Methods

Analyzing Large Amounts of Feedback to Learn from Users

Making it easy for users to give feedback and automating the collection of feedback helps to get more feedback faster. Using artificial intelligence, you can analyze large amounts of feedback to get insights and visualize trends. Sharing this information widely supports taking action to enhance your product and solve issues that users are having.

Ben Linders
on Dec 24, 2020
Cloud

Google Announces a New, More Services-Based Architecture Called Runner V2 to Dataflow

Google Cloud Dataflow is a fully-managed service for executing Apache Beam pipelines within the Google Cloud Platform(GCP). In a recent blog post, Google announced a new, more services-based architecture called Runner v2 to Dataflow – which will include multi-language support for all of its language SDKs.

Steef-Jan Wiggers
on Aug 30, 2020
AI, ML & Data Engineering

Spark AI Summit 2020 Highlights: Innovations to Improve Spark 3.0 Performance

At the recent Spark AI Summit 2020, held online for the first time, the highlights of the event were innovations to improve Apache Spark 3.0 performance, including optimizations for Spark SQL, and GPU acceleration.

Carol McDonald
on Jul 03, 2020
DevOps

Splunk Launches New Release of SignalFx APM

Splunk, a platform for searching, monitoring, and examining machine-generated big data, has launched a new release of application monitoring tool SignalFx Microservices APM™. The new release combines NoSample™ tracing, open standards based instrumentation and artificial intelligence (AI)-driven directed troubleshooting from SignalFx and Omnition into a single solution.

Helen Beal
on Apr 30, 2020

Newer News

Older News

InfoQ Software Architects' Newsletter

News