InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Meta Releases NotebookLlama: Open-Source PDF to Podcast Toolkit
Meta has released NotebookLlama, an open-source toolkit designed to convert PDF documents into podcasts, providing developers with a structured, accessible PDF-to-audio workflow. As an open-source alternative to Google’s NotebookLM, NotebookLlama guides users through a four-step process that converts PDF text into audio content.
-
GitHub Universe 2024 Unveils AI Innovations and Developer-Centric Tools
GitHub Universe 2024 unveiled groundbreaking updates emphasizing developer autonomy and AI capabilities. With multi-model support for Copilot, the introduction of AI-driven GitHub Spark, enhanced security features, and improved workflows in popular IDEs, GitHub aims to democratize coding and empower developers, regardless of skill level, to harness the full potential of artificial intelligence.
-
Native Vector Support in Azure SQL Database in Public Preview
Azure SQL Database now supports native vector storage and processing, streamlining AI development by integrating vector search with SQL queries. This update simplifies database management, enhances data analysis, and boosts performance by eliminating data movement. Ideal for diverse applications, it empowers sectors like e-commerce and healthcare with advanced, context-aware functionalities.
-
Amazon Q Adds Inline Chat Support for Visual Studio Code and JetBrains IDEs
AWS recently announced that Amazon Q Developer now supports inline chat, enabling developers to discuss and resolve code issues directly within the editor. Combining the benefits of in-IDE chat with the ability to directly update codes, this feature is powered by Claude 3.5 and available in Visual Studio Code and JetBrains IDEs.
-
Hugging Face Launches SmolTools: Practical AI Apps Powered by SmolLM2 Model
Hugging Face has introduced SmolTools, a set of applications built on the recently launched SmolLM2 model, a compact 1.7-billion parameter language model. SmolTools includes specialized tools for summarization, rewriting, and task automation, bringing efficient AI functionality to a broader range of users.
-
Google Debuts OpenAI-compatible API for Gemini
In an effort to make it easier for developers who adopted OpenAI for their LLM-based solutions to switch to Gemini, Google has launched a new endpoint for its Gemini API that allows them to easily switch from one service to the other. The new endpoint is still in beta and provides only partial coverage of OpenAI capabilities.
-
Anthropic Releases New Claude Models and Computer Use Feature
Anthropic released two new models: Claude 3.5 Haiku and an improved version of Claude 3.5 Sonnet. They also released a new feature for Claude 3.5 Sonnet that allows the model to interact with a computer's GUI the same way a human user does.
-
Decart and Etched Release Oasis, a New AI Model Transforming Gaming Worlds
Decart.ai and Etched.ai recently introduced Oasis, an AI-driven model that generates a fully interactive, real-time open-world experience inspired by Minecraft.
-
OSI Releases New Definition for Open Source AI, Setting Standards for Transparency and Accessibility
The Open Source Initiative (OSI) released Version 1.0 of its Open Source AI Definition (OSAID) after two years of development with contributions from global experts. The OSAID sets criteria defining open-source AI, aiming to bring clarity to the concept and establish benchmarks for transparency and accessibility in AI.
-
Grab Employs LLMs for Conversational Data Discovery with GPT-4, Glean and Slack
Grab responded to the challenges of finding valuable datasets among 200k+ tables by enhancing Hubble, the data discovery tool, with new capabilities leveraging GenAI technologies. The company reduced the data discovery process by incorporating LLMs to generate dataset documentation and created a Slack bot to bring effective data discovery to data consumers.
-
Amazon SageMaker JumpStart Expands Portfolio with Bria AI's Text-to-Image Models
Amazon Web Services has integrated Bria AI's latest text-to-image foundation models into Amazon SageMaker JumpStart, marking a significant expansion of its enterprise-grade generative AI capabilities. The addition includes three variants - Bria 2.3, Bria 2.2 HD, and Bria 2.3 Fast, each designed to address specific enterprise needs in visual content generation.
-
GitHub and Google Cloud Collaborate to Bring Gemini 1.5 Pro to GitHub Copilot
GitHub's partnership with Google Cloud brings Gemini 1.5 Pro to GitHub Copilot, revolutionizing development with AI that processes two million tokens. This natively multimodal tool excels at code generation, analysis, and optimization, empowering developers to effortlessly manage extensive codebases in platforms like Visual Studio Code.
-
xAI Unveils a New API Service for Grok Models
Elon Musk’s xAI has launched a public beta for its API service, enabling developers to integrate xAI's large language models (LLMs) into their applications.
-
OpenAI Releases ChatGPT Search Feature
OpenAI recently released ChatGPT Search which allows ChatGPT to search the web when answering user questions. Instead of being limited to knowledge available at the time of training, ChatGPT can now incorporate current information from the web and include links to its sources.
-
Meta MobileLLM Advances LLM Design for On-Device Use Cases
With MobileLLM, Meta researchers aim to show that, for smaller models, quality is not a direct product of how many billions parameters they have; rather, it is the result of carefully designing their architecture. To prove their point, they coupled deep and thin architectures with embedding sharing and grouped-query attention mechanisms to improve accuracy over prior state-of-the-art models.