InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Apple Adds Core ML Support for Stable Diffusion on Apple Silicon
Apple released a set of optimizations to Core ML to enable running the Stable Diffusion text-to-image model on Apple Silicon-powered devices running the latest iOS or macOS versions, respectively iOS 16.2 and macOS 13.1.
-
Open Source SkyPilot Targets Cloud Cost Optimization for ML and Data Science
A team of researchers at the RISELab at UC Berkeley recently released Skypilot, an open-source framework for running machine learning workloads on the major cloud providers through a unified interface. The project focuses on cost optimization automatically finding the cheapest availability zone, region, and provider for the requested resources.
-
AWS Announces DataZone, a New Data Management Service to Govern Data
At AWS re:Invent, Amazon Web Services announced Amazon DataZone, a new data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on-premises, and third-party sources.
-
Google's Code-as-Policies Lets Robots Write Their Own Code
Researchers from Google's Robotics team have open-sourced Code-as-Policies (CaP), a robot control method that uses a large language model (LLM) to generate robot-control code that achieves a user-specified goal. CaP uses a hierarchical prompting technique for code generation that outperforms previous methods on the HumanEval code-generation benchmark.
-
Galactica: Large Language Model for Scientific Knowledge
Meta AI and Papers with Code recently released Galactica, a 120-billion-parameter scientific-language model which can search and summarize academic literature, solve math problems, and write scientific code. Galactica’s architecture is based on a transformer, an attention mechanism which draws global dependencies between input and output.
-
Neo4j 5 Graph Database Supports Automated Scaling and Deployment Interoperability
Graph NoSQL database Neo4j version 5 supports query language and performance improvements, automated scale-up and scale-out capabilities, and cross-platform deployment interoperability. Neo4j team announced last week the general availability (GA) of latest product release of the graph database.
-
Google Open-Sources Secure ML Operating System KataOS
Google's AmbiML team recently open-sourced KataOS, a provably secure operating system for embedded ML hardware. KataOS is based on the seL4 microkernel and is implemented in Rust. Along with KataOS, Google is releasing Sparrow, a reference implementation of the operating system targeted for a secure hardware platform based on the RISC-V architecture.
-
Instagram Notification Management Now Uses Causal Inference Instead of Click-through Rate
Instagram has changed the way of creating notifications, moving from CTR to causal inference and ML models to identify highly active users who are likely to receive the notifications.
-
AWS Researchers Propose a Method That Predicts Bias in Face Recognition Models Using Unlabeled Data
AWS researchers presented a novel method for evaluating bias in face recognition algorithms, which does not require data with identity annotations. The tests show that, despite the method's limitation to estimating a model's performance on data from various demographic groups, those estimates are reliable enough to identify performance discrepancies that are indicative of bias.
-
First Open Source Copyright Lawsuit Challenges GitHub Copilot
A class-action lawsuit has been filed in a US federal court challenging the legality of GitHub Copilot and the related OpenAI Codex. The suit against GitHub, Microsoft, and OpenAI claims violation of open-source licenses and could have a wide impact in the world of artificial intelligence.
-
Applying Machine Learning for Business Outcomes at Travelopia
Travelopia changed its focus from a technology approach to business outcomes, and adapted agile and lean for delivering machine learning solutions. This enabled them to deliver machine-learning business models faster and better.
-
Salesforce Open-Sources Language-Vision AI Toolkit LAVIS
Salesforce Research recently open-sourced LAnguage-VISion (LAVIS), a unified library for deep-learning language-vision research. LAVIS supports more than 10 language-vision tasks on 20 public datasets and includes pre-trained model weights for over 30 fine-tuned models.
-
Meta Announces Next Generation AI Hardware Platform Grand Teton
Meta recently announced Grand Teton, their next-generation hardware platform for AI training. Grand Teton features several improvements over the previous generation, including 2x the network bandwidth and 4x the host-to-GPU bandwidth.
-
Uber Freight Near-Real-Time Analytics Architecture
Uber Freight is the Uber platform dedicated to connecting shippers with carriers. Providing reliable service to shippers is crucial for Uber Freight. This is why the Carrier Scorecard was developed, with several metrics including on-time pickup/delivery, tracking automation, and late cancellations.
-
Unraveling Techno-Solutionism: How I Fell out of Love with “Ethical” Machine Learning
At the recent QCon San Francisco conference, Katherine Jarmul gave a talk on unravelling techno-solutionism, in which she explored the inherent bias in AI training datasets, the bias that assumes there will be a technical solution to almost any problem and that those technical solutions will be beneficial for mankind. She posed questions for technologists to consider when building products.