InfoQ Homepage Big Data Content on InfoQ

News

RSS Feed

Newer Older

Leveraging Data Science to Improve Monitoring

At the recent devopsdays Amsterdam 2015, Patrick Roelke contended that monitoring still has lots of issues. Roelke believes that data science can help by eliminating static thresholds and coalescing information from various data sources into a single metric. The talk included a quick overview of monitoring tools that leverage data science: Kale, Bosun and AnomalyDetection.

João Miranda
on Jun 30, 2015
Software Defined Data Mart In The Enterprise Using Metanautix Quest

Metanautix recently announced the newest version of its product, Quest. Quest allows enterprises to build software defined data marts that can run in virtualized servers. Designed from the ground up with security and auditability in mind, Quest can deal with Big Data workloads and encapsulate it into different logical views, ready for consumption by different departments in enterprise.

Alex Giamas
on Jun 29, 2015
Developments in IT Project Management

The demand for IT project managers is increasing. Agile methodologies support collaboration with distributed teams for creative problem solving. The Internet of Things, cloud, big data, and cyber security will continue to dominate the IT landscape. Project managers have to pioneer IOT initiatives, be prepared for the influx of data and ensure that deliverables from their projects are secure.

Ben Linders
on Jun 25, 2015
Twitter Has Replaced Storm with Heron

Twitter has replaced Storm with Heron which provides up to 14 times more throughput and up to 10 times less latency on a word count topology, and helped them reduce the needed hardware to a third.

Abel Avram
on Jun 12, 2015
Parquet Becomes Top-Level Apache Project

Apache Parquet, the open-source columnar storage format for Hadoop, recently graduated from the Apache Software Foundation Incubator and became a top-level project. Initially created by Cloudera and Twitter in 2012 to speed up analytical processing, Parquet is now openly available for Apache Spark, Apache Hive, Apache Pig, Impala, native MapReduce, and other key components of the Hadoop ecosystem.

Jérôme Serrano
on Jun 11, 2015
Capgemini Apollo: An Open Source Microservice and Big Data Platform

Capgemini are currently working on Apollo, an open source application platform built on top of the Apache Mesos cluster manager and Docker, which is designed to power next generation web services, microservices and big data platforms running at scale.

Daniel Bryant
on Jun 06, 2015
MemSQL 4 Database Supports Community Edition, Geospatial Intelligence and Spark Integration

Latest version of MemSQL, in-memory database with support for transactions and analytics, includes a new Community Edition for free use by organizations. MemSQL 4, released last week, also supports integration with Apache Spark, Hadoop Distributed File System (HDFS), and Amazon S3.

Srini Penchikala
on May 30, 2015
Deep Convolutional Networks for Super-Resolution Image Reconstruction at Flipboard

Flipboard recently reported on an in-house application of deep learning to scale up low-resolution images that illustrates the power and flexibility of this class of learning algorithms.

Mikio Braun
on May 26, 2015
Glenn Tamkin on Applying Apache Hadoop to NASA's Big Climate Data

NASA Center for Climate Simulation (NCCS) is using Apache Hadoop for high-performance data analytics. Glenn Tamkin from NASA team, recently spoke at ApacheCon Conference and shared the details of the platform they built for climate data analysis with Hadoop.

Srini Penchikala
on May 06, 2015
Google Offers Bigtable in the Cloud

Google is making available to customers Cloud Bigtable, their own database used for more than a decade for services such as Search, GMail, Maps or YouTube. While they are not open sourcing Bigtable as they did with other products, the new cloud service is accessible through an open source interface, the Apache HBase 1.0.1 API.

Abel Avram
on May 06, 2015
Adatao Launches Full Stack Data Intelligence Platform

Adatao recently announced the general availability of its Data Intelligence platform. Its platform aims to make data analysis and predictive analytics available to everyone in large organizations. Adatao had secured an investment of $13 million last year from a group of investors including Bloomberg Beta, Lightspeed Venture Partners and Andreessen Horowitz.

Alex Giamas
on May 04, 2015
Fabian Hueske on Apache Flink Framework

Apache Flink is a distributed data flow processing system for performing analytics on large data sets. It can be used for real time data streams as well as batch data processing. It supports APIs in Java and Scala programming languages. Fabian Hueske, PMC member of Apache Flink, spoke about the data processing framework at the recent ApacheCon Conference.

Srini Penchikala
on Apr 28, 2015
Hortonworks, IBM and Pivotal to Support Open Data Platform in Their Big Data Solutions

Big data vendors Hortonworks, IBM, and Pivotal recently announced that their Hadoop based platform products will use the common Open Data Platform (ODP). They made the announcement at the recent HadoopSummit Europe Conference of the open platform which includes Apache Hadoop 2.6 (HDFS, YARN, and MapReduce) and Apache Ambari software.

Srini Penchikala
on Apr 24, 2015
Google Enhances Data and Network Services for its Cloud Platform

Google announced the general availability of Cloud DNS, expanded locations for load balancing, additional carrier providers for peering, beta availability of Cloud Dataflow and VPN services

Janakiram MSV
on Apr 18, 2015
Amazon Web Services launches Machine Learning Service

Amazon Web Services have recently launched their Amazon Machine Learning service that allows users to learn predictive models in the cloud. After Google with Prediction API, and Microsoft with Azure Machine Learning, Amazon is the latest major cloud service provider to launch a similar service.

Mikio Braun
on Apr 17, 2015

Newer News

Older News

InfoQ Software Architects' Newsletter

News