InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Debuts OpenAI-compatible API for Gemini

In an effort to make it easier for developers who adopted OpenAI for their LLM-based solutions to switch to Gemini, Google has launched a new endpoint for its Gemini API that allows them to easily switch from one service to the other. The new endpoint is still in beta and provides only partial coverage of OpenAI capabilities.

Sergio De Simone
on Nov 12, 2024
AI, ML & Data Engineering

Anthropic Releases New Claude Models and Computer Use Feature

Anthropic released two new models: Claude 3.5 Haiku and an improved version of Claude 3.5 Sonnet. They also released a new feature for Claude 3.5 Sonnet that allows the model to interact with a computer's GUI the same way a human user does.

Anthony Alford
on Nov 12, 2024
AI, ML & Data Engineering

Decart and Etched Release Oasis, a New AI Model Transforming Gaming Worlds

Decart.ai and Etched.ai recently introduced Oasis, an AI-driven model that generates a fully interactive, real-time open-world experience inspired by Minecraft.

Daniel Dominguez
on Nov 10, 2024
AI, ML & Data Engineering

OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility

The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.

Robert Krzaczyński
on Nov 07, 2024
Architecture & Design

Grab Employs LLMs for Conversational Data Discovery with GPT-4, Glean and Slack

Grab responded to the challenges of finding valuable datasets among 200k+ tables by enhancing Hubble, the data discovery tool, with new capabilities leveraging GenAI technologies. The company reduced the data discovery process by incorporating LLMs to generate dataset documentation and created a Slack bot to bring effective data discovery to data consumers.

Rafal Gancarz
on Nov 07, 2024
AI, ML & Data Engineering

Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models

Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.

Vinod Goje
on Nov 06, 2024
AI, ML & Data Engineering

GitHub and Google Cloud Collaborate to Bring Gemini 1.5 Pro to GitHub Copilot

GitHub's partnership with Google Cloud brings Gemini 1.5 Pro to GitHub Copilot, revolutionizing development with AI that processes two million tokens. This natively multimodal tool excels at code generation, analysis, and optimization, empowering developers to effortlessly manage extensive codebases in platforms like Visual Studio Code.

Robert Krzaczyński
on Nov 06, 2024
AI, ML & Data Engineering

xAI Unveils a New API Service for Grok Models

Elon Musk’s xAI has launched a public beta for its API service, enabling developers to integrate xAI's large language models (LLMs) into their applications.

Daniel Dominguez
on Nov 05, 2024
AI, ML & Data Engineering

OpenAI Releases ChatGPT Search Feature

OpenAI recently released ChatGPT Search which allows ChatGPT to search the web when answering user questions. Instead of being limited to knowledge available at the time of training, ChatGPT can now incorporate current information from the web and include links to its sources.

Anthony Alford
on Nov 05, 2024
AI, ML & Data Engineering

Meta MobileLLM Advances LLM Design for On-Device Use Cases

With MobileLLM, Meta researchers aim to show that, for smaller models, quality is not a direct product of how many billions parameters they have; rather, it is the result of carefully designing their architecture. To prove their point, they coupled deep and thin architectures with embedding sharing and grouped-query attention mechanisms to improve accuracy over prior state-of-the-art models.

Sergio De Simone
on Nov 05, 2024
.NET

Microsoft Introduces Vector Data Abstractions Library for .NET

On October 29th 2024, Microsoft released Microsoft.Extensions.VectorData.Abstractions library for .NET in preview. It makes it easier to integrate .NET solutions with the AI Semantic Kernel SDK, using abstractions over concrete AI implementations and models.

Edin Kapić
on Nov 04, 2024
AI, ML & Data Engineering

Meta AI Introduces Thought Preference Optimization Enabling AI Models to Think before Responding

Researchers from Meta FAIR, the University of California, Berkeley, and New York University have introduced Thought Preference Optimization (TPO), a new method aimed at improving the response quality of instruction-fine tuned LLMs.

Daniel Dominguez
on Nov 04, 2024
AI, ML & Data Engineering

PostgreSQL 17 Released with Improved Vacuum Process and Performance Gains

The PostgreSQL Global Development Group recently announced the general availability of PostgreSQL 17, the latest version of the popular open-source database. This release focuses on performance improvements, including a new memory management implementation for vacuum, storage access optimizations, and enhancements for high-concurrency workloads.

Renato Losio
on Nov 01, 2024
AI, ML & Data Engineering

Meta Spirit LM Integrates Speech and Text in New Multimodal GenAI Model

Presented in a recent paper, Spirit LM enables the creation of pipelines that mixes spoken and written text to integrate speech and text in the same multimodal model. According to Meta, their novel approach, based on interleaving text and speech tokens, makes it possible to circumvent the inherent limitations of prior solutions that use distinct pipelines for speech and text.

Sergio De Simone
on Oct 31, 2024
AI, ML & Data Engineering

PyTorch 2.5 Release Includes Support for Intel GPUs

The PyTorch Foundation recently released PyTorch version 2.5, which contains support for Intel GPUs. The release also includes several performance enhancements, such as the FlexAttention API, TorchInductor CPU backend optimizations, and a regional compilation feature which reduces compilation time. Overall, the release contains 4095 commits since PyTorch 2.4.

Anthony Alford
on Oct 29, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

News