InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Dapr Agents: Scalable AI Workflows with LLMs, Kubernetes & Multi-Agent Coordination
Introducing Dapr Agents—a groundbreaking framework for creating scalable AI agents using Large Language Models (LLMs). With robust workflows, multi-agent coordination, and cloud-neutral architecture, it enables enterprises to deploy thousands of resilient agents. Built on Dapr’s proven infrastructure, Dapr Agents ensures reliability and observability in AI-driven applications.
-
Gemini Code Assist Now Grants Generous Free-Usage Limits to Everyone
Born as an enterprise-focused AI-based code generation tool, Gemini Code Assist now provides a free tier to individual developers with a limit of 6,000 code completions and 240 chat requests daily.
-
Google DeepMind Unveils Gemini Robotics
Google DeepMind has introduced Gemini Robotics, an advanced AI model designed to enhance robotics by integrating vision, language, and action. This innovation, based on the Gemini 2.0 framework, aims to make robots smarter and more capable, particularly in real-world settings.
-
Google Launches Gemma 3 1B for Mobile and Web Apps
Requiring a "mere" 529MB, Gemma 3 1B is a small language model (SLM) specifically meant for distribution across mobile and Web apps, where models must download quickly and be responsive to keep user engagement high.
-
OpenAI Launches New API, SDK, and Tools to Develop Custom Agents
OpenAI has announced the new Responses API, the Agents SDK, and observability tools to address the challenges that creating production-ready agents pose, such as building custom orchestration, and handling prompt iteration across complex, multi-step tasks.
-
Google Report Reveals How Threat Actors Are Currently Using Generative AI
Google's Threat Intelligence Group (GTIG) recently released a report on the adversarial misuse of generative AI. The team investigated prompts used by advanced persistent threat (APT) and coordinated information operations (IO) actors, finding that they have so far achieved productivity gains but have not yet developed novel capabilities.
-
Microsoft Launches New AI Chat Web App Template for .NET Development
Last week, Microsoft announced a new AI Chat Web App template, available in preview, designed to simplify AI development with .NET. This template is part of Microsoft's ongoing efforts to make AI more accessible, offering scaffolding and guidance in Visual Studio, Visual Studio Code, and the .NET CLI.
-
Google Introduces AI Co-Scientist System to Aid Scientific Research
Google has announced the development of an AI co-scientist system designed to assist scientists in generating hypotheses and research proposals. Built using Gemini 2.0, the system aims to accelerate scientific and biomedical discoveries by emulating the scientific method and fostering collaboration between humans and AI.
-
Google Enhances Data Privacy with Confidential Federated Analytics
Google has announced Confidential Federated Analytics (CFA), a technique designed to increase transparency in data processing while maintaining privacy. Building on federated analytics, CFA leverages confidential computing to ensure that only predefined and inspectable computations are performed on user data without exposing raw data to servers or engineers.
-
OpenAI Introduces Software Engineering Benchmark
OpenAI has introduced the SWE-Lancer benchmark, to evaluate the capabilities of advanced AI language models in real-world freelance software engineering tasks.
-
Google DeepMind Enhances AMIE for Long-Term Disease Management
Google DeepMind has extended the capabilities of its Articulate Medical Intelligence Explorer (AMIE) beyond diagnosis to support longitudinal disease management. The system is now designed to assist clinicians in monitoring disease progression, adjusting treatments, and adhering to clinical guidelines across multiple patient visits.
-
How Engineering Teams Are Tackling AI, Platform Engineering & DevEx: InfoQ Dev Summit Boston 2025
The InfoQ Dev Summit Boston 2025 conference (June 9-10) will bring together senior software practitioners to share proven strategies for integrating AI, scaling resilient architectures, and optimizing developer experience - three key areas that will define engineering success in the next 18 months.
-
instructlab.ai Uses Synthetic Data to Reduce Complexity of Fine-Tuning LLMs
InstructLab.ai implements the large-scale alignment for the chatbots concept(LAB), which intends to overcome the scalability challenges in the instruction-tuning phase of a large language model (LLM). Its approach leverages a synthetic data-based alignment tuning method for LLMs. Crafted taxonomies deliver the synthesization seeds for training data, reducing the need for human-annotated data.
-
Mistral AI Introduces Saba: Regional Language Model for Arabic and South Indian Language
Mistral AI has introduced Mistral Saba, a 24-billion-parameter language model designed to improve AI performance in Arabic and several Indian-origin languages, particularly South Indian languages like Tamil.
-
Google Enhances AlloyDB Vector Search with Inline Filtering and Enterprise Observability
Google enhanced its AlloyDB service with inline filtering and enterprise observability for vector search. This fully-managed PostgreSQL-compatible database now allows direct filtering during queries, offering improved speed and efficiency. Enhanced monitoring features provide deep insights, addressing scaling vector search operations challenges.