InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

Cloud

Amazon Aurora PostgreSQL Adds pgvector to Support Embeddings from Generative AI

AWS recently announced that the PostgreSQL-compatible edition of Amazon Aurora now supports pgvector for vector storage and similarity search. Aurora is the latest managed PostgreSQL database supporting the open-source extension to store and search embeddings from machine learning models.

Renato Losio
on Jul 29, 2023
AI, ML & Data Engineering

Meta Open Sources New AI Model Llama 2

Meta is open-sourcing its large language model, Llama 2. The model’s code and weights are being made available free of charge for both research and commercial use. Llama 2 is the result of the expanded partnership between Meta and Microsoft, with the latter being the preferred partner for the new model.

Andrew Hoblitzell
on Jul 28, 2023
Architecture & Design

Grab Reduces Traffic Cost for Kafka Consumers on AWS to Zero

Grab took advantage of the ability of Apache Kafka consumers to connect to the broker node in the same availability zone (AZ) introduced in Kafka 2.3 and reduced the traffic cost on AWS to zero for reconfigured consumers. The change has substantially reduced overall infrastructure costs for running Apache Kafka on AWS.

Rafal Gancarz
on Jul 28, 2023
Web Development

LangChain - Working with Large Language Models, Made Easy

LangChain is a framework that simplifies working with large language models (LLMs) such as OpenAI GPT4 or Google PaLM by providing abstractions for common use cases. It supports both JavaScript and Python.

Guy Nesher
on Jul 26, 2023
Architecture & Design

Pfizer Uses Serverless Architecture on AWS to Scale Processing of Digital Biomarkers

Pfizer upgraded the serverless architecture for processing digital biomarker data at scale to make it more flexible and configurable. They created a framework that uses a file processing pipeline built with AWS Step Functions and other serverless services, as well as a custom Python package for data ingestion and processing.

Rafal Gancarz
on Jul 26, 2023
AI, ML & Data Engineering

Meta's Voicebox Outperforms State-of-the-Art Models on Speech Synthesis

Meta recently announced Voicebox, a speech generation model that can perform text-to-speech (TTS) synthesis in six languages, as well as edit and remove noise from speech recordings. Voicebox is trained on over 50k hours of audio data and outperforms previous state-of-the-art models on several TTS benchmarks.

Anthony Alford
on Jul 25, 2023
AI, ML & Data Engineering

AI, ML, Data Engineering News Round up: Claude 2, Stable Doodle, CM3leon, Llama 2, Azure and xAI

The most recent update, covering developments from July 17th, 2023, showcases significant progress and announcements in the fields of data science, machine learning, and artificial intelligence. This week's focus centers on Anthropic, Stability AI, Microsoft, Meta and xAI.

Daniel Dominguez
on Jul 25, 2023
AI, ML & Data Engineering

Grammarly Replaces its in-House Data Lake with Databricks Platform Using Medallion Architecture

Grammarly adopted the medallion architecture while migrating from their in-house data lake, storing Parquet files in AWS S3, to the Delta Lake lakehouse. The company created a new event store for over 6000 event types from 40 internal and external clients and, in the process, improved data quality and reduced the data-delivery time by 94%.

Rafal Gancarz
on Jul 24, 2023
AI, ML & Data Engineering

GitHub Details Key Prompt Engineering Practices Used to Build Copilot

Prompt engineering is key to creating effective LLM-based applications and does not require to have a PhD in machine learning or generative AI, say GitHub engineers Albert Ziegler and John Berryman, who also shared the lessons they learned developing GitHub Copilot.

Sergio De Simone
on Jul 24, 2023
Java

JetBrains Unveils AI Assistant for IntelliJ-Based IDEs and .NET Tools

JetBrains, the software development company known for creating the IntelliJ IDEA, has announced the introduction of a new AI Assistant in its Early Access Program (EAP) builds for all IntelliJ-based IDEs and .NET tools. This significant addition is aimed at transforming the landscape of software development tools by integrating generative AI and large language models into JetBrains' products.

A N M Bazlur Rahman
on Jul 24, 2023
Cloud

Google Releases Hive-BigQuery Open-Source Connector

Google recently announced the general availability of the Hive-BigQuery Connector, simplifying integration and migrations between Apache Hive and Google BigQuery. The open-source connector is a Hive storage handler that enables Hive to interact with BigQuery's storage layer.

Renato Losio
on Jul 22, 2023
Cloud

Microsoft Introduces the Public Preview of Vector Search Feature in Azure Cognitive Search

At its annual Inspire conference, Microsoft recently announced the public preview of Vector search in Azure Cognitive Search, a capability for building applications powered by large language models. It is a new capability for indexing, storing, and retrieving vector embeddings from a search index.

Steef-Jan Wiggers
on Jul 21, 2023
AI, ML & Data Engineering

Meta AI Reveals CM3leon, an Advanced Text-to-Image Generative Model

Meta AI has introduced CM3leon, a novel multimodal model combining text and image production. This model is the first of its type, using a modified formula from text-only language models to deliver remarkable outcomes with unequaled computational efficiency.

Daniel Dominguez
on Jul 20, 2023
Cloud

Microsoft Azure Managed Lustre for HPC and AI Workloads Now Generally Available

Microsoft recently announced the general availability (GA) of Azure Managed Lustre, a managed file system for high-performance computing (HPC) and AI workloads.

Steef-Jan Wiggers
on Jul 20, 2023
AI, ML & Data Engineering

Introduction to Mojo Programming Language

Mojo is a newly presented programming language that combines the simplicity of Python with the speed and memory security of Rust. It is at an early stage of development and offers users an online playground to explore its features. Mojo aims for excellence in data science and machine learning, providing a fast alternative to Python. There are gradual plans to make it available to open-source.

Robert Krzaczyński
on Jul 19, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News