InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
PlanetScale Vectors Now GA: MySQL's Missing Feature?
PlanetScale has recently announced that vector support is now generally available. Created as a fork of MySQL, this new feature allows storing vector data alongside an application's relational MySQL data, removing the need for a separate specialized vector database.
-
QCon London 2025: Achieving AI Precision through Intelligent Data Retrieval
Adi Polak, a Confluent expert, addressed AI precision challenges at QCOn London 2025, introducing agentic RAG to enhance data retrieval accuracy. With insights on the limitations of current systems and actionable strategies for implementation, Polak emphasized precision as a crucial factor in operationalizing AI, building trust, and improving business outcomes.
-
Radical AI Releases TorchSim: a PyTorch-Native Engine for Next-Generation Atomistic Simulations
Radical AI has announced the release of TorchSim, a next-generation atomistic simulation engine built natively in PyTorch and designed for the MLIP (machine-learned interatomic potentials) era.
-
Optimize AI Workloads: Google Cloud’s Tips and Tricks
Google Cloud has announced a suite of new tools and features designed to help organizations reduce costs and improve efficiency of AI workloads across their cloud infrastructure. The announcement comes as enterprises increasingly seek ways to optimize spending on AI initiatives while maintaining performance and scalability.
-
AMD’s Gaia Framework Brings Local LLM Inference to Consumer Hardware
AMD has released Gaia, an open-source project allowing developers to run large language models (LLMs) locally on Windows machines with AMD hardware acceleration. The framework supports retrieval-augmented generation (RAG) and includes tools for indexing local data sources. Gaia is designed to offer an alternative to LLMs hosted on a cloud service provider (CSP).
-
Meta AI Releases Llama 4: Early Impressions and Community Feedback
Meta has officially released the first models in its new Llama 4 family—Scout and Maverick—marking a step forward in its open-weight large language model ecosystem. Designed with a native multimodal architecture and a mixture-of-experts (MoE) framework, these models aim to support a broader range of applications, from image understanding to long-context reasoning.
-
Announcing QCon AI: Focusing on Practical, Scalable AI Implementation for Engineering Teams
QCon AI focuses on practical, real-world AI for senior developers, architects, and engineering leaders. Join us Dec 16-17, 2025, in NYC to learn how teams are building and scaling AI in production—covering MLOps, system reliability, cost optimization, and more. No hype, just actionable insights from those doing the work.
-
Recap of Cloudflare Security Week 2025: From Quantum Cryptography to AI Labyrinth
During the recent Cloudflare Security Week 2025, the cloud provider announced various improvements to its cybersecurity services and multiple reports analyzing trends and challenges in security threats. Additionally, they announced AI Labyrinth, a new version of honeypots against unauthorized crawlers, and Cloudflare for AI, a suite of tools aimed at helping the adoption of secure AI technologies.
-
How Observability Can Improve the UX of LLM Based Systems: Insights of Honeycomb's CEO at KubeCon EU
During her KubeCon Europe keynote, Christine Yen, CEO and co-founder of Honeycomb, provided insights on how observability can help cope with the rapid shifts introduced by the integration of LLMs in software systems, which transformed not only the way we develop software but also the release methodology. She explained how to adapt your development feedback loop based on production observations.
-
Amazon Unveils Nova Act SDK and Expands Access to Advanced AI Models
Amazon has announced an expansion of its generative AI capabilities with the introduction of nova.amazon.com, a platform designed to give developers easier access to its foundation models. This includes the newly unveiled Amazon Nova Act, an AI model specifically trained to execute actions within web browsers.
-
GitLab Introduces GitLab Duo with Amazon Q
The integration of Amazon Q Developer with GitLab, introduced as GitLab Duo with Amazon Q, embeds generative AI capabilities directly into GitLab, enabling developers to receive AI-driven assistance for tasks such as feature development, code upgrades, reviews, and unit testing.
-
OpenAI Releases Improved Image Generation in GPT-4o
OpenAI released a new version of GPT-4o with native image generation capability. The model can modify uploaded images or create new ones from prompts and exhibits multi-turn consistency when refining images and improved generation of text in images.
-
OpenAI Introduces New Speech Models for Transcription and Voice Generation
OpenAI has introduced new speech-to-text and text-to-speech models in its API, focusing on improving transcription accuracy and offering more control over AI-generated voices. These updates aim to enhance automated speech applications, making them more adaptable to different environments and use cases.
-
Mistral AI Launches API for LLM-Based OCR of Multimodal Documents
Now available on Mistral AI's la Plateforme SaaS, Mistral OCR aims to provide an OCR solution for digitizing complex documents that interleave text and images, tables, mathematical expressions, and advanced layouts. This makes it particularly suitable for digitizing scientific research, historical documents and artifacts, user manuals, and more, the company says.
-
Google DeepMind Launches TxGemma: Advancing AI-Driven Drug Discovery and Development
Designed to enhance the efficiency of drug discovery and clinical trial predictions. Built on the Gemma model family, TxGemma aims to streamline the drug development process and accelerate the discovery of new treatments.