InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Java News Roundup: New JEP Drafts, Infinispan 15, Payara Platform, Alpaquita Containers with CRaC
This week's Java roundup for March 11th, 2024, features news highlighting: new JEP drafts, Stream Gatherers (Second Preview) and Hot Code Heap; Infinispan 15; the March 2024 edition of Payara Platform; Alpaquita Containers with CRaC; the first release candidate of JobRunr 7.0; and milestone and point releases for Spring projects, Quarkus, Helidon and Micronaut.
-
OpenAI Releases Transformer Debugger tool
OpenAI has unveiled a new tool called the Transformer Debugger (TDB), designed to provide insights into the inner workings of transformer models. The tool was developed by OpenAI's Superalignment team and combines automated interpretability techniques with sparse autoencoders.
-
Azure OpenAI's “Use Your Data” Feature Now Generally Available
Microsoft has officially made On Your Data generally available in Azure OpenAI Service. This feature enables users to harness the full power of OpenAI models, including GPT-4, and seamlessly integrate the advanced features of the RAG (Retrieval Augmented Generation) model with their data. According to the company, all this is backed by enterprise-grade security on Azure.
-
RWKV Project Open-Sources LLM Eagle 7B
The RWKV Project recently open-sourced Eagle 7B, a 7.52B parameter large language model (LLM). Eagle 7B is trained on 1.1 trillion tokens of text in over 100 languages and outperforms other similarly-sized models on multilingual benchmarks.
-
Java News Roundup: JHipster Blueprints, Jox Channel Library, Infinispan 15.0 Release Candidate
This week's Java roundup for March 4th, 2024, features news highlighting: Version 3.0 releases of Micronaut and Quarkus Blueprints for JHipster, introducing the Jox Channel library, first release candidate of Infinispan 15.0, point releases of Quarkus, Micronaut, Testcontainers for Java and JetBrains Ktor.
-
Google BigQuery Introduces Vector Search
Google recently announced that BigQuery now supports vector search. The new functionality enables vector similarity search required by data and AI use cases such as semantic search, similarity detection, and retrieval-augmented generation (RAG) with a large language model (LLM).
-
Anthropic Unveils Claude 3 Models, Highlighting Opus and Its Near-Human Capabilities
Anthropic has introduced the Claude 3 family models, surpassing other industry models such as GPT-4. The Claude 3 family consists of three distinct models: Haiku, Sonnet, and Opus, arranged in ascending order of capability, each designed to cater to diverse user needs in terms of intelligence, speed, and cost.
-
Amazon Announces One Billion Parameter Speech Model BASE TTS
Amazon Science recently published their work on Big Adaptive Streamable TTS with Emergent abilities (BASE TTS). BASE TTS supports voice-cloning and outperforms baseline TTS models when evaluated by human judges. Further, Amazon's experiments show that scaling model and data size improves the subjective quality of the model's output.
-
Mistral AI Models Are Now Available on Amazon Bedrock
Mistral AI has made its Mixtral 8x7B and Mistral 7B foundation models available on Amazon Bedrock. These models, now accessible via Amazon Bedrock's single API, aim to offer users a broader selection of high-performing models for building generative AI applications.
-
Java News Roundup: New OpenJDK JEPs, Spring Functions Catalog, Apache Kafka, Quarkus, JReleaser
This week's Java roundup for February 26th, 2024, features news highlighting: JEP 468, Derived Record Creation (Preview); JEP 467, Markdown Documentation Comments; a new Spring Functions Catalog; end-of-life planned for the Spring Framework 6.0 and 5.3 release trains; and point releases for Apache Kafka, Quarkus and JReleaser.
-
Data Solutions Framework: an Open Source Project for Building Data Solutions on AWS
AWS recently released the Data Solutions Framework (DSF), an opinionated open-source framework designed to accelerate the creation of data solutions on AWS. Built using the AWS CDK, the framework exposes abstractions and patterns as building blocks for constructing data solutions and is available in TypeScript (npm) and Python (PyPi).
-
LLM Strategies, Platform Engineering, Observability and More: InfoQ Dev Summit Boston 2024
Join us at InfoQ Dev Summit Boston, running June 24-25, to network with your peers and experience a curated agenda with topics such as Generative AI, security, and modern web applications. The conference will feature 20+ technical talks by senior software practitioners over two days, with parallel breakout sessions emphasizing the essential topics development teams should prioritize now.
-
Google Announces 200M Parameter AI Forecasting Model TimesFM
Google Research announced TimesFM, a 200M parameter Transformer-based foundation model for time-series forecasting. TimesFM is trained on nearly 100B data points and has zero-shot forecasting performance comparable to or better than supervised-learning models.
-
Google Introduces Gemma, a New Open Source AI Model for Developers
Google announced the launch of Gemma, a new open source AI model. Developed using the technology that underpinned Google's Gemini AI models, Gemma aims to provide developers with advanced tools to create AI applications conscientiously.
-
Baseline OpenAI End-to-End Chat Reference Architecture
Microsoft published the baseline OpenAI end-to-end chat reference architecture. This baseline contains information about components, flows and security. There are also details about performance, monitoring and deployment guidance. Microsoft also prepared the reference implementation to deploy and run the solution.