InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Meta Releases NotebookLlama: Open-Source PDF to Podcast Toolkit

Meta has released NotebookLlama, an open-source toolkit designed to convert PDF documents into podcasts, providing developers with a structured, accessible PDF-to-audio workflow. As an open-source alternative to Google’s NotebookLM, NotebookLlama guides users through a four-step process that converts PDF text into audio content.

Robert Krzaczyński
on Nov 17, 2024
AI, ML & Data Engineering

Google Debuts OpenAI-compatible API for Gemini

In an effort to make it easier for developers who adopted OpenAI for their LLM-based solutions to switch to Gemini, Google has launched a new endpoint for its Gemini API that allows them to easily switch from one service to the other. The new endpoint is still in beta and provides only partial coverage of OpenAI capabilities.

Sergio De Simone
on Nov 12, 2024
AI, ML & Data Engineering

Anthropic Releases New Claude Models and Computer Use Feature

Anthropic released two new models: Claude 3.5 Haiku and an improved version of Claude 3.5 Sonnet. They also released a new feature for Claude 3.5 Sonnet that allows the model to interact with a computer's GUI the same way a human user does.

Anthony Alford
on Nov 12, 2024
AI, ML & Data Engineering

Decart and Etched Release Oasis, a New AI Model Transforming Gaming Worlds

Decart.ai and Etched.ai recently introduced Oasis, an AI-driven model that generates a fully interactive, real-time open-world experience inspired by Minecraft.

Daniel Dominguez
on Nov 10, 2024
Architecture & Design

Grab Employs LLMs for Conversational Data Discovery with GPT-4, Glean and Slack

Grab responded to the challenges of finding valuable datasets among 200k+ tables by enhancing Hubble, the data discovery tool, with new capabilities leveraging GenAI technologies. The company reduced the data discovery process by incorporating LLMs to generate dataset documentation and created a Slack bot to bring effective data discovery to data consumers.

Rafal Gancarz
on Nov 07, 2024
AI, ML & Data Engineering

Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models

Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.

Vinod Goje
on Nov 06, 2024
AI, ML & Data Engineering

GitHub and Google Cloud Collaborate to Bring Gemini 1.5 Pro to GitHub Copilot

GitHub's partnership with Google Cloud brings Gemini 1.5 Pro to GitHub Copilot, revolutionizing development with AI that processes two million tokens. This natively multimodal tool excels at code generation, analysis, and optimization, empowering developers to effortlessly manage extensive codebases in platforms like Visual Studio Code.

Robert Krzaczyński
on Nov 06, 2024
AI, ML & Data Engineering

xAI Unveils a New API Service for Grok Models

Elon Musk’s xAI has launched a public beta for its API service, enabling developers to integrate xAI's large language models (LLMs) into their applications.

Daniel Dominguez
on Nov 05, 2024
AI, ML & Data Engineering

OpenAI Releases ChatGPT Search Feature

OpenAI recently released ChatGPT Search which allows ChatGPT to search the web when answering user questions. Instead of being limited to knowledge available at the time of training, ChatGPT can now incorporate current information from the web and include links to its sources.

Anthony Alford
on Nov 05, 2024
AI, ML & Data Engineering

Meta MobileLLM Advances LLM Design for On-Device Use Cases

With MobileLLM, Meta researchers aim to show that, for smaller models, quality is not a direct product of how many billions parameters they have; rather, it is the result of carefully designing their architecture. To prove their point, they coupled deep and thin architectures with embedding sharing and grouped-query attention mechanisms to improve accuracy over prior state-of-the-art models.

Sergio De Simone
on Nov 05, 2024
.NET

Microsoft Introduces Vector Data Abstractions Library for .NET

On October 29th 2024, Microsoft released Microsoft.Extensions.VectorData.Abstractions library for .NET in preview. It makes it easier to integrate .NET solutions with the AI Semantic Kernel SDK, using abstractions over concrete AI implementations and models.

Edin Kapić
on Nov 04, 2024
AI, ML & Data Engineering

Meta AI Introduces Thought Preference Optimization Enabling AI Models to Think before Responding

Researchers from Meta FAIR, the University of California, Berkeley, and New York University have introduced Thought Preference Optimization (TPO), a new method aimed at improving the response quality of instruction-fine tuned LLMs.

Daniel Dominguez
on Nov 04, 2024
AI, ML & Data Engineering

Meta Spirit LM Integrates Speech and Text in New Multimodal GenAI Model

Presented in a recent paper, Spirit LM enables the creation of pipelines that mixes spoken and written text to integrate speech and text in the same multimodal model. According to Meta, their novel approach, based on interleaving text and speech tokens, makes it possible to circumvent the inherent limitations of prior solutions that use distinct pipelines for speech and text.

Sergio De Simone
on Oct 31, 2024
AI, ML & Data Engineering

Stable Diffusion 3.5 Improves Text Rendering, Image Quality, Consistency, and More

Stability AI has released Stable Diffusion 3.5 Large, its most powerful text-to-image generation model to date, and Stable Diffusion 3.5 Large Turbo, with special emphasis on customizability, efficiency, and flexibility. Both models come with a free licensing model for non commercial and limited commercial use.

Sergio De Simone
on Oct 25, 2024
AI, ML & Data Engineering

AI and ML Tracks at QCon San Francisco 2024 – a Deep Dive into GenAI & Practical Applications

At QCon San Francisco 2024, explore two AI/ML-focused tracks highlighting real-world applications and innovations. Learn from industry experts on deploying LLMs, GenAI, and recommendation systems, gaining practical strategies for integrating AI into software development.

Artenisa Chatziou
on Oct 25, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

News