InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Google Debuts OpenAI-compatible API for Gemini
In an effort to make it easier for developers who adopted OpenAI for their LLM-based solutions to switch to Gemini, Google has launched a new endpoint for its Gemini API that allows them to easily switch from one service to the other. The new endpoint is still in beta and provides only partial coverage of OpenAI capabilities.
-
Anthropic Releases New Claude Models and Computer Use Feature
Anthropic released two new models: Claude 3.5 Haiku and an improved version of Claude 3.5 Sonnet. They also released a new feature for Claude 3.5 Sonnet that allows the model to interact with a computer's GUI the same way a human user does.
-
Decart and Etched Release Oasis, a New AI Model Transforming Gaming Worlds
Decart.ai and Etched.ai recently introduced Oasis, an AI-driven model that generates a fully interactive, real-time open-world experience inspired by Minecraft.
-
OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility
The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.
-
Grab Employs LLMs for Conversational Data Discovery with GPT-4, Glean and Slack
Grab responded to the challenges of finding valuable datasets among 200k+ tables by enhancing Hubble, the data discovery tool, with new capabilities leveraging GenAI technologies. The company reduced the data discovery process by incorporating LLMs to generate dataset documentation and created a Slack bot to bring effective data discovery to data consumers.
-
Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models
Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.
-
GitHub and Google Cloud Collaborate to Bring Gemini 1.5 Pro to GitHub Copilot
GitHub's partnership with Google Cloud brings Gemini 1.5 Pro to GitHub Copilot, revolutionizing development with AI that processes two million tokens. This natively multimodal tool excels at code generation, analysis, and optimization, empowering developers to effortlessly manage extensive codebases in platforms like Visual Studio Code.
-
xAI Unveils a New API Service for Grok Models
Elon Musk’s xAI has launched a public beta for its API service, enabling developers to integrate xAI's large language models (LLMs) into their applications.
-
OpenAI Releases ChatGPT Search Feature
OpenAI recently released ChatGPT Search which allows ChatGPT to search the web when answering user questions. Instead of being limited to knowledge available at the time of training, ChatGPT can now incorporate current information from the web and include links to its sources.
-
Meta MobileLLM Advances LLM Design for On-Device Use Cases
With MobileLLM, Meta researchers aim to show that, for smaller models, quality is not a direct product of how many billions parameters they have; rather, it is the result of carefully designing their architecture. To prove their point, they coupled deep and thin architectures with embedding sharing and grouped-query attention mechanisms to improve accuracy over prior state-of-the-art models.
-
Microsoft Introduces Vector Data Abstractions Library for .NET
On October 29th 2024, Microsoft released Microsoft.Extensions.VectorData.Abstractions library for .NET in preview. It makes it easier to integrate .NET solutions with the AI Semantic Kernel SDK, using abstractions over concrete AI implementations and models.
-
Meta AI Introduces Thought Preference Optimization Enabling AI Models to Think before Responding
Researchers from Meta FAIR, the University of California, Berkeley, and New York University have introduced Thought Preference Optimization (TPO), a new method aimed at improving the response quality of instruction-fine tuned LLMs.
-
PostgreSQL 17 Released with Improved Vacuum Process and Performance Gains
The PostgreSQL Global Development Group recently announced the general availability of PostgreSQL 17, the latest version of the popular open-source database. This release focuses on performance improvements, including a new memory management implementation for vacuum, storage access optimizations, and enhancements for high-concurrency workloads.
-
Meta Spirit LM Integrates Speech and Text in New Multimodal GenAI Model
Presented in a recent paper, Spirit LM enables the creation of pipelines that mixes spoken and written text to integrate speech and text in the same multimodal model. According to Meta, their novel approach, based on interleaving text and speech tokens, makes it possible to circumvent the inherent limitations of prior solutions that use distinct pipelines for speech and text.
-
PyTorch 2.5 Release Includes Support for Intel GPUs
The PyTorch Foundation recently released PyTorch version 2.5, which contains support for Intel GPUs. The release also includes several performance enhancements, such as the FlexAttention API, TorchInductor CPU backend optimizations, and a regional compilation feature which reduces compilation time. Overall, the release contains 4095 commits since PyTorch 2.4.