BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

  • MLConf NYC 2014 Highlights

    The MLConf conference was going strong in NYC on April 11th and was a full day packed with talks around Machine Learning and Big Data, featuring speakers from many prominent companies.

  • NoSQL, JSON, and Time Series Data Management: Interview with Anuj Sahni

    Time series data management is gaining more attention lately because the data is coming at us from all directions: sensors, mobile devices, Web tracking, financial events, factory automation, and utilities. InfoQ spoke with Anuj Sahni, Principal Product Manager at Oracle about the time series data and how to do data modeling for this type of data.

  • SQL Server 2014: NoSQL Speeds with Relational Capabilities

    For the last four years Microsoft has been working on the first rewrite of SQL Server’s query execution since 1998. The goal is to offer NoSQL-like speeds without sacrificing the capabilities of a relational database. At the heart of this is Hekaton, their memory optimized tables. While still accessible via traditional T-SQL operations, internally they are a fundamentally different technology.

  • Lambda Architecture: Design Simpler, Resilient, Maintainable and Scalable Big Data Solutions

    Lambda Architecture proposes a simpler, elegant paradigm designed to store and process large amounts of data. In this article, author Daniel Jebaraj presents the motivation behind the Lambda Architecture, reviews its structure with the help of a sample Java application.

  • Embedded Analytics and Statistics for Big Data

    This article provides an overview of tools and libraries available for embedded data analytics and statistics, both stand-alone software packages and programming languages with statistical capabilities. The authors also discuss how to combine and integrate these embedded analytics technologies to handle big data.

  • Big Data Analytics for Security

    In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to efficiently process data for security analysis for use cases like Security Information and Event Management (SIEM) and Fraud Detection.

  • Building Applications With Hadoop

    When building applications using Hadoop, it is common to have input data from various sources coming in various formats. In his presentation, “New Tools for Building Applications on Apache Hadoop”, Eli Collins overviews how to build better products with Hadoop and various tools that can help, such as Apache Avro, Apache Crunch, Cloudera ML and the Cloudera Development Kit.

  • Preparing for Your First MongoDB Deployment: Backup and Security

    This article we focuses on the database backup tools and security policies when deploying MongoDB NoSQL databases. Topics like cloud backups with MongoDB Management Service (MMS), authentication, and authorization are covered.

  • Building a Real-time, Personalized Recommendation System with Kiji

    Jon Natkins explains in this article how to create a personalized recommendation system fed with large amounts of real-time data using Kiji, which leverages HBase, Avro, Map-Reduce and Scalding.

  • Cassandra CLI Internals Using JArchitect

    Cassandra CLI is a useful tool for Cassandra administrators. It's a good example of how to implement a Cassandra client and CLI internals help us to develop custom Cassandra clients or even extend the CLI tool. In this article, author explores Cassandra CLI architecture model using JArchitect tool and CQLinq language to analyze its code base.

  • Don’t jump the SQL ship just yet

    The SQL language has been evolving steadily over the last two decades. At the same time, the verbosity caused by the JDBC API in Java client code and the lack of first class SQL support within the Java language have led to the introduction of ORMs such as Hibernate, which was later standardised into JPA and the Criteria API.If SQL and JPA are diverging, where will our data interaction patterns go?

  • Building a RESTful Web Service with Spring Boot to Access Data in an Aerospike Cluster

    Spring Boot allows you to build Spring based applications with little effort on your part. Aerospike is a distributed and replicated in-memory database that is ACID compliant. This article will take you through creating a simple RESTful web service with Spring Boot and Aerospike.

BT