InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Roblox Releases Cube 3D, an AI Open-Source Model for 3D Model Generation

Roblox has introduced Cube 3D, a generative AI system designed for creating 3D and 4D objects and environments.

Daniel Dominguez
on Mar 22, 2025
Architecture & Design

Dapr Agents: Scalable AI Workflows with LLMs, Kubernetes & Multi-Agent Coordination

Introducing Dapr Agents—a groundbreaking framework for creating scalable AI agents using Large Language Models (LLMs). With robust workflows, multi-agent coordination, and cloud-neutral architecture, it enables enterprises to deploy thousands of resilient agents. Built on Dapr’s proven infrastructure, Dapr Agents ensures reliability and observability in AI-driven applications.

Eran Stiller
on Mar 20, 2025
Mobile

Google Launches Gemma 3 1B for Mobile and Web Apps

Requiring a "mere" 529MB, Gemma 3 1B is a small language model (SLM) specifically meant for distribution across mobile and Web apps, where models must download quickly and be responsive to keep user engagement high.

Sergio De Simone
on Mar 17, 2025
AI, ML & Data Engineering

Google Report Reveals How Threat Actors Are Currently Using Generative AI

Google's Threat Intelligence Group (GTIG) recently released a report on the adversarial misuse of generative AI. The team investigated prompts used by advanced persistent threat (APT) and coordinated information operations (IO) actors, finding that they have so far achieved productivity gains but have not yet developed novel capabilities.

Renato Losio
on Mar 15, 2025
AI, ML & Data Engineering

Google Introduces AI Co-Scientist System to Aid Scientific Research

Google has announced the development of an AI co-scientist system designed to assist scientists in generating hypotheses and research proposals. Built using Gemini 2.0, the system aims to accelerate scientific and biomedical discoveries by emulating the scientific method and fostering collaboration between humans and AI.

Daniel Dominguez
on Mar 12, 2025
AI, ML & Data Engineering

OpenAI Introduces Software Engineering Benchmark

OpenAI has introduced the SWE-Lancer benchmark, to evaluate the capabilities of advanced AI language models in real-world freelance software engineering tasks.

Daniel Dominguez
on Mar 08, 2025
Mobile

Google's Image Generation Model Imagen 3 Now Available in Vertex AI in Firebase as a Preview

Google's most advanced GenAI image generation model, Imagen 3, is now available in preview through Vertex AI in Firebase enabling seamless integration into Android and iOS apps through its Kotlin and Swift SDKs.

Sergio De Simone
on Mar 08, 2025
AI, ML & Data Engineering

instructlab.ai Uses Synthetic Data to Reduce Complexity of Fine-Tuning LLMs

InstructLab.ai implements the large-scale alignment for the chatbots concept(LAB), which intends to overcome the scalability challenges in the instruction-tuning phase of a large language model (LLM). Its approach leverages a synthetic data-based alignment tuning method for LLMs. Crafted taxonomies deliver the synthesization seeds for training data, reducing the need for human-annotated data.

Olimpiu Pop
on Mar 07, 2025
AI, ML & Data Engineering

Mistral AI Introduces Saba: Regional Language Model for Arabic and South Indian Language

Mistral AI has introduced Mistral Saba, a 24-billion-parameter language model designed to improve AI performance in Arabic and several Indian-origin languages, particularly South Indian languages like Tamil.

Robert Krzaczyński
on Mar 06, 2025
AI, ML & Data Engineering

Hugging Face Publishes Guide on Efficient LLM Training across GPUs

Hugging Face has published the Ultra-Scale Playbook: Training LLMs on GPU Clusters, an open-source guide that provides a detailed exploration of the methodologies and technologies involved in training LLMs across GPU clusters.

Daniel Dominguez
on Mar 04, 2025
AI, ML & Data Engineering

IBM Granite 3.2 Brings New Vision Language Model, Chain of Thought Reasoning, Improved TimeSeries

IBM has introduced its new Granite 3.2 multi-modal and reasoning model. Granite 3.2 features experimental chain-of-thought reasoning capabilities that significantly improve its predecessor's performance, a new vision language model (VLM) outperforming larger models on several benchmarks, and smaller models for more efficient deployments.

Sergio De Simone
on Mar 02, 2025
AI, ML & Data Engineering

GitHub Copilot Extensions Integrate IDEs with External Services

Now generally available, GitHub Copilot Extensions allow developers to use natural language to query documentation, generate code, retrieve data, and execute actions on external services without leaving their IDEs. Besides using public extensions from companies like Docker, MongoDB, Sentry, and many more, developers can create their own extensions to work with internal libraries or APIs.

Sergio De Simone
on Feb 26, 2025
AI, ML & Data Engineering

Google DeepMind’s AlphaGeometry2 AI Achieves Gold-Medal Math Olympiad Performance

Google DeepMind's AlphaGeometry2 (AG2) AI model solved 84% of the geometry problems from the last 25 years of International Math Olympiads (IMO), outperforming the average human gold-medalist performance.

Anthony Alford
on Feb 25, 2025
AI, ML & Data Engineering

Perplexity Unveils Deep Research: AI-Powered Tool for Advanced Analysis

Perplexity has introduced Deep Research, an AI-powered tool designed for conducting in-depth analysis across various fields, including finance, marketing, and technology. The system automates the research process by performing multiple searches, analyzing extensive sources, and synthesizing findings into structured reports within minutes.

Robert Krzaczyński
on Feb 24, 2025
AI, ML & Data Engineering

Google Gemini's Long-term Memory Vulnerable to a Kind of Phishing Attack

AI security hacker Johann Rehberger described a prompt injection attack against Google Gemini able to modify its long-term memories using a technique he calls delayed tool invocation. The researcher described the attack as a sort of social engineering/phishing attack triggered by the user interacting with a malicious document.

Sergio De Simone
on Feb 21, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News