InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Cohere Unveils Advanced Embedding Model Embed v3

Cohere has unveiled Embed v3, their most advanced embedding model designed to transform semantic search and generative AI.

Daniel Dominguez
on Nov 09, 2023
AI, ML & Data Engineering

Mojo Language SDK Available: Mojo Driver, VS Code extension, and Jupyter Kernel

Mojo SDK is available for developers. It contains the mojo driver, the Visual Studio Code extension and the Jupyter kernel. For now, SDK is available for MacOS and Linux.

Robert Krzaczyński
on Nov 09, 2023
AI, ML & Data Engineering

AI Researchers Improve LLM-Based Reasoning by Mimicking Learning from Mistakes

Researchers from Microsoft, Peking University, and Xiâ€™an Jiaotong University claim to have developed a technique to improve large language models' (LLMs) ability to solve math problems by replicating how humans learn from their own mistakes.

Sergio De Simone
on Nov 08, 2023
AI, ML & Data Engineering

OpenAI Announces New Models and APIs at First Developer Day Conference

OpenAI announced additions and price reductions across its platform at its first Developer Day. The updates include the introduction of a new GPT-4 Turbo model, an Assistants API, and multimodal capabilities, among others.

Andrew Hoblitzell
on Nov 08, 2023
AI, ML & Data Engineering

Microsoft Releases DeepSpeed-FastGen for High-Throughput Text Generation

Microsoft has announced the alpha release of DeepSpeed-FastGen, a system designed to improve the deployment and serving of large language models (LLMs). DeepSpeed-FastGen is the synergistic composition of DeepSpeed-MII and DeepSpeed-Inference . DeepSpeed-FastGen is based on the Dynamic SplitFuse technique. The system currently supports several model architectures.

Andrew Hoblitzell
on Nov 07, 2023
AI, ML & Data Engineering

Jina AI's Open-Source Embedding Model Outperforms OpenAI's Ada

Multimodal AI company Jina AI recently released jina-embeddings-v2, a sentence embedding model. The model supports context lengths up to 8192 tokens and outperforms OpenAI's text-embedding-ada-002 on several embedding benchmarks.

Anthony Alford
on Nov 07, 2023
Java

Java News Roundup: JHipster 8.0, Implicit Classes and Instance Main Methods, Kotlin 1.9.20

This week's Java roundup for October 30th, 2023, features news from OpenJDK, JDK 22, GlassFish 7.0.10, Spring Boot 3.2-RC2, Spring Cloud 2023.0-RC1, Spring Cloud Stream Applications 2022.0, Spring Statemachine 4.0-M1, Spring Tools 4.20.1, Open Liberty 23.0.11-beta, Micronaut 4.1.6, Grails 6.1, TomEE 8.0.16, Infinispan 14.0.20, JHipster 8.0, JHipster Lite 0.47, JReleaser 1.9 and Kotlin 1.9.20.

Michael Redlich
on Nov 06, 2023
Java

Do Gen AI and OSS Regulation Bring Us Further Away from Exiting the Dependency Hell?

“The security of the software supply chain problem” still persists according to the yearly State Of Supply Chain report. It improved, but there is still a long way to go, given that 96% of all vulnerable downloads were avoidable. Besides the usual insights of how far from exiting the "dependency hell" we are, the novel challenges of 2023 include the legislative adoption of Gen AI-associated risks.

Olimpiu Pop
on Nov 06, 2023
AI, ML & Data Engineering

AWS Adds New Code Generation Models to Amazon SageMaker JumpStart

AWS recently announced the availability of two new foundation models in Amazon SageMaker JumpStart: Code Llama and Mistral 7B. These models can be deployed with one click to provide AWS users with private inference endpoints for code generation tasks.

Anthony Alford
on Oct 31, 2023
Architecture & Design

Goldsky’s Streaming-First Architecture for Blockchain Data with Flink, Redpanda and Kubernetes

Goldsky created a platform for the real-time processing of blockchain data. The platform allows clients to extract data from blockchains into their own databases to support product features, but without running the data pipeline infrastructure. The event-driven architecture (EDA) of Goldsky leverages Apache Flink, Redpanda, Kubernetes, and cloud provider services.

Rafal Gancarz
on Oct 30, 2023
Java

Java News Roundup: Helidon 4.0, Eclipse Serializer 1.0, JEPs for JDK 22

This week's Java roundup for October 23rd, 2023, features news from OpenJDK, JDK 22, Jakarta Data 1.0-M1, GraalVM 21.0.1, Spring 6.1-RC2, Spring Modulith 1.1-RC1, Spring Vault 3.1-RC1, Helidon 4.0, Eclipse Serializer 1.0, Quarkus 3.5, Liberica NIK 22.3.4, Hibernate ORM 6.4-CR1, Hibernate Search 7.0-CR1, Maven 4.0.0-alpha8, Camel 4.0.2, Camel Quarkus 3.5, JHipster Lite 0.46 and JDKMonitor.

Michael Redlich
on Oct 30, 2023
Cloud

Amazon MSK Replicator: Active-Passive and Active-Active Clusters for Apache Kafka Service

AWS has recently announced MSK Replicator, a new option for cross-region and same-region streaming data replication. The new feature of the Amazon Managed Streaming for Apache Kafka service provides automatic asynchronous replication across clusters, enhancing availability and ensuring business continuity.

Renato Losio
on Oct 29, 2023
AI, ML & Data Engineering

PyTorch 2.1 Release Supports Automatic Dynamic Shape Support and Distributed Training Enhancements

PyTorch Conference 2023 presented an overview of PyTorch 2.1. ExecuTorch was introduced to enhance PyTorch's performance on mobile and edge devices. The conference also had a focus on community with new members added to the PyTorch Foundation and a Docathon announced.

Andrew Hoblitzell
on Oct 25, 2023
AI, ML & Data Engineering

Google Open-Sources AI Fine-Tuning Method Distilling Step-by-Step

A team from the University of Washington and Google Research recently open-sourced Distilling Step-by-Step, a technique for fine-tuning smaller language models. Distilling Step-by-Step requires less training data than standard fine-tuning and results in smaller models that can outperform few-shot prompted large language models (LLMs) that have 700x the parameters.

Anthony Alford
on Oct 24, 2023
AI, ML & Data Engineering

Nvidia Introduces Eureka, an AI Agent Powered by GPT-4 That Can Train Robots

Nvidia Research revealed that it has created a brand-new AI agent named Eureka that is driven by OpenAI's GPT-4 and is capable of teaching robots sophisticated abilities on its own.

Daniel Dominguez
on Oct 24, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News