InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
IBM Brings Watson to IoT
IBM has inaugurated the IoT Global Headquarters and will use the Watson technology to analyze and interpret IoT data.
-
Eight Dragons of Agile Measurement
Larry Maccherone, Director of Analytics and Research at AgileCraft and frequent speaker at agile conferences like QCon, gave a webinar in which he discusses the major risks and challenges when introducing metrics in an agile environment. Risks are referred as "dragons" and the techniques to get rid of them as "slayers".
-
Haskell Can Now Do Strict Evaluation by Default
A new Strict language extension to Haskell aims to make it easier to use Haskell for code that is meant to be mostly strict, i.e., evaluated in a non-lazy manner. The feature was recently merged into GHC’s git HEAD and will be included in GHC’s next release.
-
MongoDB Hits 3.2 and Becomes Enterprise Ready
MongoDB recently announced the newest version of its NoSQL database synonymous product. Building upon the new features introduced in 3.0 release, 3.2 is expanding and solidifying MongoDB’s interest towards the corporate world.
-
IBM Commits to Advance Apache Spark
Earlier last month in Las Vegas, at IBM Insight 2015, IBM announced a major commitment to the Apache Spark project. Referring to it as “potentially the most significant open source project of the next decade” tells a lot about how important IBM believes Apache Spark is. With IDC reporting that 80% of cloud applications in the future will be data intensive, Apache Spark can unlock previously...
-
What Is New on ThoughtWorks Radar Nov 2015
ThoughtWorks has published their radar for the end of 2015, covering technologies in four areas: Languages & Frameworks, Platforms, Techniques, and Tools.
-
DMTK, a Machine Learning Toolkit from Microsoft
About the same time Google announced open sourcing TensorFlow, Microsoft has pushed to GitHub DMTK, a Distributed Machine Learning Toolkit. While Google has released a one-machine version of TensorFlow, DMTK runs on a cluster of machines.
-
TensorFlow: Google Open Sources Their Machine Learning Tool
TensorFlow is a machine learning library created by the Brain Team researchers at Google and now open sourced under the Apache License 2.0. TensorFlow is detailed in the whitepaper TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. The source code can be found on Google Git.
-
Teradata Announces New Software for Real-Time Analysis of Internet of Things Data
At its 2015 Partners User Group Conference, Teradata announced two new software capabilities for real-time ingestion and analysis of massive streams of IoT data. While the Teradata Listener software enables "listening" to multiple, diverse IoT data streams in real time, the new Teradata Aster Analytics on Hadoop software provides scalable analysis of massive IoT data streams.
-
Neo4j Launches Open Source Graph Query Language openCypher
Neo4j Graph NoSQL database team launches open source graph query language called openCypher. Neo Technology, the company behind the graph database, announced last week at GraphConnect Conference, the launch of the open source project that will be available to technology providers as a common language for querying graph data.
-
Neo4j 2.3 Supports In-Memory Page Cache, Docker Tools and IBM POWER8 Integration
Latest version of Graph NoSQL database Neo4j supports in-memory page cache, Docker tools, enhanced query planner and IBM POWER8 integration. Neo4j team announced last week the release of version 2.3 which also supports query development with graph and text string search.
-
Basho Technologies Releases Riak TS NoSQL Database to Store and Analyze Time Series Data
Basho Technologies releases Riak TS distributed NoSQL database to store and analyze time series data. Basho team recently announced at AWS re:invent event, the availability of Riak TS which is optimized for reads and writes of time series data.
-
LinkedIn Open Sources PalDB, a Read-only Key-value Store
LinkedIn has open sourced PalDB, an embeddable read-only key value store, 8 times faster than LevelDB and taking several times less memory than a hashset.
-
Couchbase 4.0 Supports Multi-dimensional Scaling, Geospatial Indexes and New Query Language N1QL
Latest version of Document NoSQL database Couchbase supports multi-dimensional scaling, geospatial indexes and new query language called N1QL. Couchbase team announced earlier this month the general availability of Couchbase Server 4.0 version which also supports new filtering capabilities on Cross Datacenter Replication (XDCR) and enhanced security.
-
DistributedLog at Twitter for High Performance Logging
Twitter is using replicated logs for high performance data collection and analysis of its systems. DistributedLog is the system developed at Twitter for this purpose. Twitter has developed a distributed key-value database, Manhattan. Manhattan can trade consistency for latency in reads following the eventually consistent data model. We examine Twitter's design and tradeoffs for DistributedLog.