InfoQ Homepage Infrastructure Content on InfoQ
-
No Starch Press' Python Playground Review and Q&A with the Author
No Starch Press’ Python Playground provides a fairly large and varied collection of projects that aim to show how Python can be used in such different contexts as creating ASCII art, birds simulation, interfacing to Raspberry Pi, and more. InfoQ has spoken to the book’s author, Mahesh Venkitachalam.
-
Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library
In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.
-
Key Takeaway Points and Lessons Learned from QCon San Francisco 2015
This article summarizes the key takeaways and highlights from QCon San Francisco 2015 as blogged and tweeted by QCon's 1,300 attendees. Over the course of the next 4 months, InfoQ will be publishing most of the conference sessions online, including 10 video interviews that were recorded by the InfoQ editorial team.
-
Data Lake-as-a-Service: Big Data Processing and Analytics in the Cloud
Data Lake-as-a-Service solutions provide big data processing in the cloud for faster business outcomes in a very cost effective way. InfoQ spoke with Lovan Chetty and Hannah Smalltree from Cazena team about how Data Lake as a Service works.
-
Real-time Data Processing in AWS Cloud
In this article, author Oleksii Tymchenko discusses a bio-informatic software as a service (SaaS) product called Chorus, which was built as a public data warehousing and analytical platform for mass spectrometry data. Other features of the product include real-time visualization of raw mass-spec data.
-
Scaling Docker with Kubernetes V1
Find out what's new in Kubernetes V1 with a Jenkins example in Google Container Engine (and locally with Vagrant). Kubernetes V1 brings enterprise-level capabilities such as self healing, service discovery, dynamic DNS, resource quotas, centralized logging, network isolation, resource quotas. In short, Kubernetes V1 makes management of Docker fleets easier.
-
Oozie Plugin for Eclipse
Oozie Eclipse plugin is a new tool for editing Apache Oozie workflows graphically inside Eclipse. Usage of this plugin allows to skip hard to develop and maintain process definition in HPDL. Instead a process graph is defined graphically by placing process actions on pallet and connecting them. An article introduces Eclipse Oozie plugin and provides an example of its usage.
-
Big Data Solutions with MS SQL ColumnStore Index
Columnar data storage can offer significant performance improvements over the way database tables are traditionally stored, but they aren’t always faster. Aleksandr Shavlyuga explores the power, and limitations of SQL Server’s ColumnStore Indexes.
-
Packet Inspection for Unauthorized OS Detection in Enterprises
The authors discuss an approach that uses TCP SYN packets for OS fingerprinting to detect the presence of unauthorized OSs in an enterprise.
-
The Estimation Game - Techniques for Informed Guessing
In this article, author Carlos Bueno discusses the strategies for estimating the server capacity for big data projects and initiatives, with the help of two case studies.
-
Machine Learning and Cognitive Computing
Based on a webinar on analytics, this article covers the topics of machine learning and cognitive computing, and how these fields are related to artificial intelligence (AI). Panelists discuss how this technology is being applied in digital marketing space and what concerns organizations have in providing machine learning services.
-
Article Series: Patterns of DevOps Culture
Healthy organizations exhibit similar patterns of behavior, organization and improvement efforts. In this series we explore some of those patterns through testimonies from their practitioners and through analysis by consultants in the field who have been exposed to multiple DevOps adoption initiatives.