BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

  • Adatao Launches Full Stack Data Intelligence Platform

    Adatao recently announced the general availability of its Data Intelligence platform. Its platform aims to make data analysis and predictive analytics available to everyone in large organizations. Adatao had secured an investment of $13 million last year from a group of investors including Bloomberg Beta, Lightspeed Venture Partners and Andreessen Horowitz.

  • Fabian Hueske on Apache Flink Framework

    Apache Flink is a distributed data flow processing system for performing analytics on large data sets. It can be used for real time data streams as well as batch data processing. It supports APIs in Java and Scala programming languages. Fabian Hueske, PMC member of Apache Flink, spoke about the data processing framework at the recent ApacheCon Conference.

  • Hortonworks, IBM and Pivotal to Support Open Data Platform in Their Big Data Solutions

    Big data vendors Hortonworks, IBM, and Pivotal recently announced that their Hadoop based platform products will use the common Open Data Platform (ODP). They made the announcement at the recent HadoopSummit Europe Conference of the open platform which includes Apache Hadoop 2.6 (HDFS, YARN, and MapReduce) and Apache Ambari software.

  • Google Enhances Data and Network Services for its Cloud Platform

    Google announced the general availability of Cloud DNS, expanded locations for load balancing, additional carrier providers for peering, beta availability of Cloud Dataflow and VPN services

  • Amazon Web Services launches Machine Learning Service

    Amazon Web Services have recently launched their Amazon Machine Learning service that allows users to learn predictive models in the cloud. After Google with Prediction API, and Microsoft with Azure Machine Learning, Amazon is the latest major cloud service provider to launch a similar service.

  • Twitter Cuts Off Firehose Partner DataSift

    Twitter recently announced that it has cut-off their firehose data distributor DataSift. This move echoes Twitter's controversial 2012 API changes which restricted the Twitter client ecosystem. There is much speculation as to whether this latest announcement is an attempt to control the tweet analytics space and whether or not this is behaviour fitting of a platform provider.

  • Microsoft Announces General Availability of Azure DocumentDB

    Azure DocumentDB, Microsoft’s NoSQL cloud database service is now generally available.

  • Meteor 1.1: Now with Support for Windows and MongoDB 3.0

    Matt DeBergalis has released version 1.1 of Meteor, announcing support for both Windows and MongoDB 3.0. The 1.1 release "adds first-class official support" for developing applications software on Microsoft Windows, DeBergalis said, adding that it is the start of "a commitment to developers on the Microsoft platform."

  • Apache HBase Hits 1.0

    After three developer previews, six release candidates and over 1500 closed tickets the Apache foundation has announced version 1.0 of Apache HBase, a NoSQL database in the Hadoop ecosystem. After more than 7 years of active development, the team behind HBase felt that the project had matured and stabilized enough to warrant a 1.0 version.

  • Real-time Data Analytics at Pinterest using MemSQL and Spark Streaming

    Pinterest, the company behind the visual bookmarking tool that helps you discover and save creative ideas, is using real-time data analytics for data-driven decision making purposes. It’s experimenting with MemSQL and Spark technologies for real-time user engagement across the globe.

  • Microsoft Acquires Revolution Analytics

    Microsoft increased its foothold in the data science community last winter by acquiring Revolution Analytics, a major provider of software and services based on the open-source R project for computational statistics. The deal is expected to bring R capabilities to the Microsoft suite of products and facilitate the adoption of R-based solutions in the enterprise environment.

  • Apache Spark 1.3 Released, Data Frames, Spark SQL, and MLlib Improvements

    Apache Spark has released version 1.3 of their project. The main improvements are the addition of the DataFrames API, better maturity of the Spark SQL, as well as a number of new methods added to the machine learning library MLlib, and better integration of Spark Streaming with Apache Kafka.

  • MongoDB 3.0 - WiredTiger Storage Engine and Updated MMS

    Some time ago, when MongoDB 2.6 was released Kelly Stirman, Director of Products at MongoDB answered our questions regarding the latest release. Now with MongoDB 3.0 announced for March and MongoDB 3.0 RC-8 already available, it’s time to see in more detail what WiredTiger storage engine, new and improved MMS and storage compression can bring to NoSQL users.

  • Advancing The Realtime Web With RethinkDB

    RethinkDB is an open-source distributed database built to store JSON and scale with very little effort. Self compared with MongoDB, RethinkDB is aiming to be developer friendly all the while maintaining an operations oriented approach of being highly available and high scale..a way to subscribe to change notifications from the database. A client can subscribe to changes in a table and get notified

  • Pivotal Open Sources Their Big Data Suite

    Pivotal has decided to open source core components of their Big Data Suite and has announced the Open Data Platform, an initiative promoting open source and standardization for Big Data.

BT