InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Meta Releases Llama 3.3: a Multilingual Model with Enhanced Performance and Efficiency

Meta has released Llama 3.3, a multilingual large language model aimed at supporting a range of AI applications in research and industry. Featuring a 128k-token context window and architectural improvements for efficiency, the model demonstrates strong performance in benchmarks for reasoning, coding, and multilingual tasks. It is available under a community license on Hugging Face.

Robert Krzaczyński
on Dec 14, 2024
AI, ML & Data Engineering

Google AI Agent Jules Aims at Helping Developers with Their GitHub-Based Workflows

Part of Gemini 2.0, Google has launched its new AI-based coding assistant in closed preview. Dubbed "Jules", the assistant aims at helping developers to work with Python and JavaScript issues and pull requests, handle bug fixes, and other related tasks.

Sergio De Simone
on Dec 14, 2024
AI, ML & Data Engineering

New LangChain Report Reveals Growing Adoption of AI Agents

LangChain presented the State of AI Agents where they examined the current state of AI agent adoption across industries, gathering insights from over 1,300 professionals, including engineers, product managers, and executives. The findings provide a detailed view of how AI agents are being integrated into workflows and the challenges companies face in deploying these systems effectively.

Daniel Dominguez
on Dec 13, 2024
AI, ML & Data Engineering

Google DeepMind Unveils Gemini 2.0: a Leap in AI Performance and Multimodal Integration

Google DeepMind has introduced Gemini 2.0, an AI model that outperforms its predecessor, Gemini 1.5 Pro, with double the processing speed. The model supports complex multimodal tasks, combining text, images, and other inputs for advanced reasoning. Built on the JAX/XLA framework, Gemini 2.0 is optimized at scale and includes new features like Deep Research for exploring complex topics.

Robert Krzaczyński
on Dec 12, 2024
AI, ML & Data Engineering

Amazon Introduces Amazon Nova, a Series of Foundation Models

Amazon has announced Amazon Nova, a family of foundation models designed for generative AI tasks. The announcement, made during AWS re:Invent, highlights the models' capabilities in tasks such as document and video analysis, chart comprehension, video content generation, and AI agent development.

Daniel Dominguez
on Dec 09, 2024
Cloud

From Aurora DSQL to Amazon Nova: Highlights of re:Invent 2024

The 2024 edition of re:Invent has just ended in Las Vegas. As anticipated, AI was a key focus of the conference, with Amazon Nova and a new version of Sagemaker among the most significant highlights. However, the announcement that generated the most excitement in the community was the preview of Amazon Aurora DSQL, a serverless, distributed SQL database with active-active high availability.

Renato Losio
on Dec 09, 2024
AI, ML & Data Engineering

Micro Metrics for LLM System Evaluation at QCon SF 2024

Denys Linkov's QCon San Francisco 2024 talk dissected the complexities of evaluating large language models (LLMs). He advocated for nuanced micro-metrics, robust observability, and alignment with business objectives to enhance model performance. Linkov’s insights highlight the need for multidimensional evaluation and actionable metrics that drive meaningful decisions.

Andrew Hoblitzell
on Dec 06, 2024
AI, ML & Data Engineering

Ai2 Launches OLMo 2, a Fully Open-Source Foundation Model

The Allen Institute for AI research team has introduced OLMo 2, a new family of open-source language models available in 7 billion (7B) and 13 billion (13B) parameter configurations. Trained on up to 5 trillion tokens, these models redefine training stability, adopting staged training processes, and incorporating diverse datasets.

Daniel Dominguez
on Dec 05, 2024
AI, ML & Data Engineering

Mistral AI Releases Pixtral Large: a Multimodal Model for Advanced Image and Text Analysis

Mistral AI released Pixtral Large, a 124-billion-parameter multimodal model designed for advanced image and text processing with a 1-billion-parameter vision encoder. Built on Mistral Large 2, it achieves leading performance on benchmarks like MathVista and DocVQA, excelling in tasks that require reasoning across text and visual data.

Robert Krzaczyński
on Dec 04, 2024
AI, ML & Data Engineering

AISuite is a New Open Source Python Library Providing a Unified Cross-LLM API

Recently announced by Andrew Ng, aisuite aims to provide an OpenAI-like API around the most popular large language models (LLMs) currently available to make it easy for developers to try them out and compare results or switch from one LLM to another without having to change their code.

Sergio De Simone
on Dec 04, 2024
AI, ML & Data Engineering

Nexa AI Unveils Omnivision: a Compact Vision-Language Model for Edge AI

Nexa AI unveiled Omnivision, a compact vision-language model tailored for edge devices. By significantly reducing image tokens from 729 to 81, Omnivision lowers latency and computational requirements while maintaining strong performance in tasks like visual question answering and image captioning.

Robert Krzaczyński
on Dec 03, 2024
AI, ML & Data Engineering

Physical Intelligence Unveils Robotics Foundation Model Pi-Zero

Physical Intelligence recently announced π0 (pi-zero), a general-purpose AI foundation model for robots. Pi-zero is based on a pre-trained vision-language model (VLM) and outperforms other baseline models in evaluations on five robot tasks.

Anthony Alford
on Dec 03, 2024
AI, ML & Data Engineering

AWS Reveals Multi-Agent Orchestrator Framework for Managing AI Agents

AWS has introduced Multi-Agent Orchestrator, a framework designed to manage multiple AI agents and handle complex conversational scenarios. The system routes queries to the most suitable agent, maintains context across interactions, and integrates seamlessly with a variety of deployment environments, including AWS Lambda, local setups, and other cloud platforms.

Daniel Dominguez
on Dec 02, 2024
AI, ML & Data Engineering

Microsoft Introduces Magentic-One, a Generalist Multi-Agent System

Microsoft has announced the release of Magentic-One, a new generalist multi-agent system designed to handle open-ended tasks involving web and file-based environments. This system aims to assist with complex, multi-step tasks across various domains, improving efficiency in activities such as software development, data analysis, and web navigation.

Daniel Dominguez
on Nov 30, 2024
AI, ML & Data Engineering

QCon SF 2024 - Ten Reasons Your Multi-Agent Workflows Fail

At QCon SF 2024, Victor Dibia from Microsoft Research explored the complexities of multi-agent systems powered by generative AI. Highlighting common pitfalls like inadequate prompts and poor orchestration, he shared strategies for enhancing reliability and scalability. Dibia emphasized the need for meticulous design and oversight to unlock the full potential of these innovative systems.

Andrew Hoblitzell
on Nov 29, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

News