InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Amazon re:invent roundup
Amazon announced a number of new services at the recent re:invent conference in Las Vegas: Amazon WorkSpaces - Desktop Computing in the Cloud, Identity and Access Management using SAML, Amazon AppStream - Delivering Streaming Applications from the Cloud, Amazon Kinesis - Streaming Big Data, CloudTrail - Capturing AWS API Activity, Postgres support in RDS and new EC2 instance types
-
Increasing Pace of Change Drives Agile In Enterprise Applications
The pace of organizational change and technology adoption is increasing which means that enterprise software development needs to find ways to keep pace with these changes. The rise of big data is also driving the need to undertake many experiment and adapt rapidly. Blogger Matt Asay recently wrote about this in a post titled "Hey, Enterprise Developers! Get Agile Or Get Steamrollered"
-
Streaming Big Data With Amazon Kinesis
Amazon recently announced Kinesis, a service that allows developers to stream large amounts of data from different sources and process it. The service is currently in limited preview.
-
Cascading 2.5 Supports Hadoop 2
New version of Cascading released this week incorporates Hadoop 2 support and includes Cascading Lingual - an open source project that provides a comprehensive ANSI SQL interface for accessing Hadoop-based data
-
Presto: Facebook’s Distributed SQL Query Engine
Facebook has open-sourced Presto, their distributed SQL query engine. Presto uses a pipelined architecture rather than the Map/Reduce design found elsewhere. In production since early this year, Facebook has since “deployed in multiple geographical regions and [they] have successfully scaled a single cluster to 1,000 nodes”.
-
SQL Makes a Comeback through NewSQL
New database developments indicate a return to SQL, but not by running the traditional relational stores on bigger and better hardware, not even on sharded architectures, but through NewSQL solutions.
-
Current Trends in NoSQL - Q&A with Peter Bell
Peter Bell shares insights on the latest trends in NoSQL, a rapidly evolving category of database storage that covers a wide variety of solutions.
-
AnyPresence Soups up Enterprise MBaaS Platform- Part 1 of 2
Mobile Backend as a Service provider AnyPresence continues to hone their chops. Launching the fifth update to their self-titled platform geared for the enterprise. Co-founder Rich Mendis provides some insights for InfoQ readers…
-
NuoDB Blackbirds 2.0 Gets Geo-distributed Capability
NuoDB has announced version 2.0 of their NewSQL database, now a globally distributed database that can run in the cloud or on premises with real-time replication.
-
Java In-Memory Data Grid Hazelcast 3.0 Supports Continuous Queries and Entry Processing
Latest version of open source Java In-Memory Data Grid Hazelcast supports entry processing, multi-thread execution, continuous queries and lazy indexing. They have also re-implemented all of the existing distributed objects like map, queue, executor service using Service Provider Interface (SPI).
-
Introducing SQL Server 2014's New Clustered Columnstore Indexes
SQL Server 2014 will offer Clustered Columnstore Indexes. These will offer the performance and compression benefits of column-oriented storage without the need to restrict the underlying table to read-only access.
-
Java In-Memory Grid Hazelcast gets VC Funding from Bain Capital
Open source in-memory data grid company Hazelcast received funding from Bain Capital Ventures. Joining the board of the company are Rod Johnson (founder of SpringSource), Salil Deshpande (a prominent venture capitalist who focuses heavily on open source and software infrastructure) and Ali Kutay (former CEO of WebLogic).
-
RavenDB 2.5 with Dynamic Aggregation and Query Streaming
A stable version 2.5 of the document database RavenDB has been released with dynamic aggregation allowing for complex queries and an Unbounded results API using query streaming to retrieve large result sets in a single request.
-
QuantCell Research Announces First Public Beta of their Java-Aware Big-Data Spreadsheet
Big Data analytics startup QuantCell Research has announced the release of the first public beta of what they are positioning as their "Big Data" spreadsheet.
-
Best Practices for Amazon EMR
In his new whitepaper, Best Practices for Amazon EMR, Parviz Deyhim outlines the best practices in using AWS EMR including moving data to AWS, strategies for collecting, compressing, aggregating the data, and common architectural patterns for setting up and configuring Amazon EMR clusters for processing.