InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Databricks Open Sources Delta Lake to Make Data Lakes More Reliable
Databricks recently announced open sourcing Delta Lake, their proprietary storage layer, to bring ACID transactions to Apache Spark and big data workloads. Databricks is the company behind the creators of Apache Spark, while Delta Lake is already being used in several companies like McAffee, Upwork etc . Delta Lake is addressing the heterogeneous data problem that data lakes often have...
-
Microsoft Open-Sources Approximate Nearest Neighbor Search Algorithm Powering Bing
Microsoft's latest contribution to open source, Space Partition Tree And Graph (SPTAG), is an implementation of the approximate nearest neighbor search (NNS) algorithm that is used in Microsoft Bing search engine.
-
QCon San Francisco 2019: Registrations Open & Top Videos from QCon SF 2018
QCon San Francisco, the software conference that attracts attendees from all over the world, returns November 11-13, 2019, for the 13th year. QCon is organized by the people behind InfoQ & is dedicated to providing a platform for innovators & early adopters to share their story in the global epicenters of software development: Beijing, London, New York, Sao Paulo, Shanghai, and San Francisco.
-
Xipeng Shen on a New Technique to Reduce Deep-Learning Training Time
Researchers at North Carolina State University recently presented a paper at the 35th IEEE International Conference on Data Engineering (ICDE 2019) on their new technique that can reduce training time for deep-neural-networks by up to 69%.
-
Micronaut 1.1 Features Enhanced Support for Building Cloud-Native Applications
During the recent Google Cloud Next conference, Object Computing, Inc. (OCI) announced the release of Micronaut 1.1 featuring support for gRPC, GraphQL, Google Cloud Platform (GCP), RabbitMQ and Amazon Web Services (AWS). There is also a new Bean Introspection API that replaces the JDK Introspector class and new templates for the Micronaut Test project.
-
Investigating Near Misses to Prevent Disasters: QCon London Q&A
Investigating near misses by gathering data from the field and exploring anything that looks wrong or is a bit odd can help to prevent disasters, said Ed Holland, software development manager at Metaswitch Networks. At QCon London 2019 he gave a talk about avoiding being in the news by investigating near misses.
-
Google Releases Google-Landmarks-V2, a Large-Scale Dataset for Landmark Recognition & Retrieval
Google has released Google-Landmarks-v2, an improved dataset for Landmark Recognition & Retrieval, along with Detect-to-Retrieve, a Tensorflow codebase for large-scale instance-level image recognition. Two companion Kaggle competitions based on Google-Landmarks-v2 were also launched. With over 200,000 landmarks in 5 million images, it is the largest landmark dataset ever published.
-
PyTorch 1.1 Release Improves Performance, Adds New APIs and Tools
Facebook AI Research announced the release of PyTorch 1.1. The latest version of the open-source deep learning framework includes improved performance via distributed training, new APIs, and new visualization tools including native support for TensorBoard.
-
Amazon Updates SageMaker Ground Truth with New Labeling Features, Vendor Support and Availability
Amazon announced that SageMaker Ground Truth now offers simplified labeling workflows, support for additional labeling vendors, and is available in the Asia Pacific (Sydney) AWS region – bringing the total to six supported AWS regions in the Americas, Europe, and Asia.
-
Google Launches AI Platform - an End-to-End Platform to Build, Run, and Manage ML Projects
Google has recently launched AI Platform, an end-to-end platform to build, test, and deploy machine learning models. It brings together a host of products and services to help businesses solve complex challenges using AI in a way that is easier and collaborative.
-
QCon NY (Jun 24-28): New Talks, a Focus on the Skills That Matter & Why You Should Join Us This Year
In the recent Stack Overflow 9th annual survey of over 90,000 software developers, we learned that non-development work remains a productivity challenge for software managers and leaders. At QCon New York, the conference for senior software developers, we have many sessions to help you learn how others have overcome those challenges.
-
Google Scales Weak Supervision to Overcome Labeled Dataset Problem
Google recognizes that the need for labeled data in machine learning (ML) is a significant bottleneck and recently adapted the open-source Snorkel framework to overcome the problem at scale. Google enhanced Snorkel by integrating it with Tensorflow, using the file system for sharing data instead of a database, and creating separate executables for labeling functions.
-
Teaching the Computer to Play the Chrome Dinosaur Game with TensorFlow.js Machine Learning Library
A simple, yet entertaining and useful for educational purposes application of machine learning, was recently made available on Fritz's HeartBeat Medium publication. Google's machine learning TensorFlow.js library is leveraged in the browser to teach the computer to play the Chrome Dinosaur Game.
-
Microsoft Releases High-Performance C# and F# Support for Apache Spark
Microsoft announced the release of .NET for Apache Spark, adding new high-performance C# and F# binding to the big-data computation engine.
-
Salesforce Adds Intelligence to its Einstein Services Offering
In a recent press release, Salesforce announced additions to their Einstein platform that target bringing AI solutions to Salesforce developers and admins using a low code, point and click configuration-based solution. The recent additions to the platform include Einstein Translation and Einstein Optical Character Recognition (OCR).