InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
QCon SF: Large Scale Search and Ranking Systems at Netflix
Moumita Bhattacharya spoke at QCon SF 2024 about state-of-the-art search and ranking systems. She gave an overview of the typical structure of these systems and followed with a deep dive into how Netflix created a single combined model to handle both tasks.
-
QCon SF: Using Metaflow to Support Diverse ML Systems at Netflix
At QCon SF 2024, David Berg and Romain Cledat gave a talk about how Netflix uses Metaflow, an open-source framework, to support a variety of ML systems. The pair gave an overview of Metaflow's design principles and illustrated several of Netflix's use cases, including media processing, content demand modeling, and meta-models for explaining models.
-
How Uber Sped up SQL-based Data Analytics with Presto and Express Queries
Uber uses Presto, an open-source distributed SQL query engine, to provide analytics across several data sources, including Apache Hive, Apache Pinot, MySQL, and Apache Kafka. To improve its performance, Uber engineers explored the advantages of dealing with quick queries, a.k.a. express queries, in a specific way and found they could improve both Presto utilization and response times.
-
Google Introduces Gemini AI Features to Android Studio
Google has released a set of updates to Gemini in Android Studio, aiming to enhance the developer productivity through AI-powered features. This release is designed to bring AI to every stage of the development lifecycle, such as AI-assisted coding, refactoring, generating documentation, analyzing and test code, and suggesting fixes.
-
Meta Releases NotebookLlama: Open-Source PDF to Podcast Toolkit
Meta has released NotebookLlama, an open-source toolkit designed to convert PDF documents into podcasts, providing developers with a structured, accessible PDF-to-audio workflow. As an open-source alternative to Google’s NotebookLM, NotebookLlama guides users through a four-step process that converts PDF text into audio content.
-
GitHub Universe 2024 Unveils AI Innovations and Developer-Centric Tools
GitHub Universe 2024 unveiled groundbreaking updates emphasizing developer autonomy and AI capabilities. With multi-model support for Copilot, the introduction of AI-driven GitHub Spark, enhanced security features, and improved workflows in popular IDEs, GitHub aims to democratize coding and empower developers, regardless of skill level, to harness the full potential of artificial intelligence.
-
Native Vector Support in Azure SQL Database in Public Preview
Azure SQL Database now supports native vector storage and processing, streamlining AI development by integrating vector search with SQL queries. This update simplifies database management, enhances data analysis, and boosts performance by eliminating data movement. Ideal for diverse applications, it empowers sectors like e-commerce and healthcare with advanced, context-aware functionalities.
-
Amazon Q Adds Inline Chat Support for Visual Studio Code and JetBrains IDEs
AWS recently announced that Amazon Q Developer now supports inline chat, enabling developers to discuss and resolve code issues directly within the editor. Combining the benefits of in-IDE chat with the ability to directly update codes, this feature is powered by Claude 3.5 and available in Visual Studio Code and JetBrains IDEs.
-
Hugging Face Launches SmolTools: Practical AI Apps Powered by SmolLM2 Model
Hugging Face has introduced SmolTools, a set of applications built on the recently launched SmolLM2 model, a compact 1.7-billion parameter language model. SmolTools includes specialized tools for summarization, rewriting, and task automation, bringing efficient AI functionality to a broader range of users.
-
Google Debuts OpenAI-compatible API for Gemini
In an effort to make it easier for developers who adopted OpenAI for their LLM-based solutions to switch to Gemini, Google has launched a new endpoint for its Gemini API that allows them to easily switch from one service to the other. The new endpoint is still in beta and provides only partial coverage of OpenAI capabilities.
-
Anthropic Releases New Claude Models and Computer Use Feature
Anthropic released two new models: Claude 3.5 Haiku and an improved version of Claude 3.5 Sonnet. They also released a new feature for Claude 3.5 Sonnet that allows the model to interact with a computer's GUI the same way a human user does.
-
Decart and Etched Release Oasis, a New AI Model Transforming Gaming Worlds
Decart.ai and Etched.ai recently introduced Oasis, an AI-driven model that generates a fully interactive, real-time open-world experience inspired by Minecraft.
-
OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility
The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.
-
Grab Employs LLMs for Conversational Data Discovery with GPT-4, Glean and Slack
Grab responded to the challenges of finding valuable datasets among 200k+ tables by enhancing Hubble, the data discovery tool, with new capabilities leveraging GenAI technologies. The company reduced the data discovery process by incorporating LLMs to generate dataset documentation and created a Slack bot to bring effective data discovery to data consumers.
-
Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models
Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.