InfoQ Homepage Large language models Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Google Renames Bard to Gemini

Google announced that their Bard chatbot will now be called Gemini. The company also announced the launch of Gemini Advanced, the largest version of their Gemini language model, along with two new mobile apps for interacting with the model.

Anthony Alford
on Feb 20, 2024
AI, ML & Data Engineering

OpenAI Launches AI Text-to-Video Generator Sora

Sora is OpenAI's new generative AI model to create videos from textual prompts. Currently in preview, the new model is able to create photorealistic videos up to 60 seconds long leveraging its ability to understand how things exist in the real world and combining multiple shots together without character or style disruption.

Sergio De Simone
on Feb 16, 2024
AI, ML & Data Engineering

NVIDIA Introduces Metropolis Microservices for Jetson to Run AI Apps at the Edge

NVIDIA has expanded its Nvidia Metropolis Microservices Cloud-based AI solution to run on the NVIDIA Jetson IoT embedded platform, including support for video streaming and AI-based perception.

Sergio De Simone
on Feb 08, 2024
AI, ML & Data Engineering

Meta Releases Code Generation Model Code Llama 70B, Nearing GPT-3.5 Performance

Code Llama 70B is Meta's new code generation AI model. Thanks to its 70 billion parameters, it is "the largest and best-performing model in the Code Llama family", Meta says.

Sergio De Simone
on Jan 31, 2024
AI, ML & Data Engineering

Stability AI Releases 1.6 Billion Parameter Language Model Stable LM 2

Stability AI released two sets of pre-trained model weights for Stable LM 2, a 1.6B parameter language model. Stable LM 2 is trained on 2 trillion tokens of text data from seven languages and can be run on common laptop computers.

Anthony Alford
on Jan 30, 2024
DevOps

LeftoverLocals May Leak LLM Responses on Apple, Qualcomm, and AMD GPUs

Security firm Trail of Bits disclosed a vulnerability allowing malicious actors to recover data from GPU local memory on Apple, Qualcomm, AMD, and Imagination GPUs. Dubbed LeftoverLocals, the vulnerability affects any application using the GPU, including Large Language Models (LLMs) and machine learning (ML) models.

Sergio De Simone
on Jan 25, 2024
AI, ML & Data Engineering

Mistral AI's Open-Source Mixtral 8x7B Outperforms GPT-3.5

Mistral AI recently released Mixtral 8x7B, a sparse mixture of experts (SMoE) large language model (LLM). The model contains 46.7B total parameters, but performs inference at the same speed and cost as models one-third that size. On several LLM benchmarks, it outperformed both Llama 2 70B and GPT-3.5, the model powering ChatGPT.

Anthony Alford
on Jan 23, 2024
AI, ML & Data Engineering

LLMs May Learn Deceptive Behavior and Act as Persistent Sleeper Agents

AI researchers at OpenAI competitor Anthropic trained proof-of-concept LLMs showing deceptive behavior triggered by specific hints in the prompts. Furthermore, they say, once deceptive behavior was trained into the model, there was no way to circumvent it using standard techniques.

Sergio De Simone
on Jan 20, 2024
AI, ML & Data Engineering

Google Announces Video Generation LLM VideoPoet

Google Research recently published their work on VideoPoet, a large language model (LLM) that can generate video. VideoPoet was trained on 2 trillion tokens of text, audio, image, and video data, and in evaluations by human judges its output was preferred over that of other models.

Anthony Alford
on Jan 16, 2024
AI, ML & Data Engineering

OpenAI GPT Store is a Nascent Marketplace for Custom ChatGPTs

OpenAI has started rolling out its new GPT Store, announced a few months ago along with GPTs, to provide a mechanism for ChatGPT Plus, Team and Enterprise users to share custom ChatGPT-based chatbots they create.

Sergio De Simone
on Jan 12, 2024
AI, ML & Data Engineering

OpenAI Adopts Preparedness Framework for AI Safety

OpenAI recently published a beta version of their Preparedness Framework for mitigating AI risks. The framework lists four risk categories and definitions of risk levels for each, as well as defining OpenAI's safety governance procedures.

Anthony Alford
on Jan 09, 2024
Architecture & Design

Griffin 2.0: Instacart Revamps Its Machine Learning Platform

Instacart created the next-generation platform based on experiences using the original Griffin machine-learning platform. The company wanted to improve user experience and help manage all ML workloads. The revamped platform leverages the latest developments in MLOps and introduces new capabilities for current and future applications.

Rafal Gancarz
on Jan 01, 2024
Java

Quarkus LangChain4J Extension Allows Developers to Integrate LLMs in Their Quarkus Applications

Inspired by the presentation “Java Meets AI” at Devoxx BE 2023, the Quarkus team started working on an extension based on the LangChain4J library, the Java re-implementation of the langchain library. This would allow developers to integrate LLMs Quarkus applications. The current is version, 0.5. The extension was built using Quarkus' usual declarative style, resembling the REST client.

Olimpiu Pop
on Dec 29, 2023
AI, ML & Data Engineering

OpenAI Publishes GPT Prompt Engineering Guide

OpenAI recently published a guide to Prompt Engineering. The guide lists six strategies for eliciting better responses from their GPT models, with a particular focus on examples for their latest version, GPT-4.

Anthony Alford
on Dec 26, 2023
AI, ML & Data Engineering

Microsoft Announces Small Language Model Phi-2

Microsoft Research announced Phi-2, a 2.7 billion-parameter Transformer-based language model. Phi-2 is trained on 1.4T tokens of synthetic data generated by GPT-3.5 and outperforms larger models on a variety of benchmarks.

Anthony Alford
on Dec 19, 2023

Newer News

Older News

InfoQ Software Architects' Newsletter

News