InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Google BigQuery Introduces Vector Search
Google recently announced that BigQuery now supports vector search. The new functionality enables vector similarity search required by data and AI use cases such as semantic search, similarity detection, and retrieval-augmented generation (RAG) with a large language model (LLM).
-
Anthropic Unveils Claude 3 Models, Highlighting Opus and Its Near-Human Capabilities
Anthropic has introduced the Claude 3 family models, surpassing other industry models such as GPT-4. The Claude 3 family consists of three distinct models: Haiku, Sonnet, and Opus, arranged in ascending order of capability, each designed to cater to diverse user needs in terms of intelligence, speed, and cost.
-
Amazon Announces One Billion Parameter Speech Model BASE TTS
Amazon Science recently published their work on Big Adaptive Streamable TTS with Emergent abilities (BASE TTS). BASE TTS supports voice-cloning and outperforms baseline TTS models when evaluated by human judges. Further, Amazon's experiments show that scaling model and data size improves the subjective quality of the model's output.
-
Mistral AI Models Are Now Available on Amazon Bedrock
Mistral AI has made its Mixtral 8x7B and Mistral 7B foundation models available on Amazon Bedrock. These models, now accessible via Amazon Bedrock's single API, aim to offer users a broader selection of high-performing models for building generative AI applications.
-
Java News Roundup: New OpenJDK JEPs, Spring Functions Catalog, Apache Kafka, Quarkus, JReleaser
This week's Java roundup for February 26th, 2024, features news highlighting: JEP 468, Derived Record Creation (Preview); JEP 467, Markdown Documentation Comments; a new Spring Functions Catalog; end-of-life planned for the Spring Framework 6.0 and 5.3 release trains; and point releases for Apache Kafka, Quarkus and JReleaser.
-
Data Solutions Framework: an Open Source Project for Building Data Solutions on AWS
AWS recently released the Data Solutions Framework (DSF), an opinionated open-source framework designed to accelerate the creation of data solutions on AWS. Built using the AWS CDK, the framework exposes abstractions and patterns as building blocks for constructing data solutions and is available in TypeScript (npm) and Python (PyPi).
-
LLM Strategies, Platform Engineering, Observability and More: InfoQ Dev Summit Boston 2024
Join us at InfoQ Dev Summit Boston, running June 24-25, to network with your peers and experience a curated agenda with topics such as Generative AI, security, and modern web applications. The conference will feature 20+ technical talks by senior software practitioners over two days, with parallel breakout sessions emphasizing the essential topics development teams should prioritize now.
-
Google Announces 200M Parameter AI Forecasting Model TimesFM
Google Research announced TimesFM, a 200M parameter Transformer-based foundation model for time-series forecasting. TimesFM is trained on nearly 100B data points and has zero-shot forecasting performance comparable to or better than supervised-learning models.
-
Google Introduces Gemma, a New Open Source AI Model for Developers
Google announced the launch of Gemma, a new open source AI model. Developed using the technology that underpinned Google's Gemini AI models, Gemma aims to provide developers with advanced tools to create AI applications conscientiously.
-
Baseline OpenAI End-to-End Chat Reference Architecture
Microsoft published the baseline OpenAI end-to-end chat reference architecture. This baseline contains information about components, flows and security. There are also details about performance, monitoring and deployment guidance. Microsoft also prepared the reference implementation to deploy and run the solution.
-
State of FinOps 2024: Reducing Waste and Embracing AI
In the 2024 State of FinOps survey, Engineering Enablement has been replaced by a focus on cost and waste reduction. This shows maturity of FinOps as the persona getting the most value from FinOps remains the engineer. The confluence of AI and FinOps observability aims to optimise cloud spend visibility and improve insights into early AI experimentation, as well as sustainability goals.
-
Java News Roundup: JDK 22 RC2, Multiple Releases in the Spring Ecosystem, Payara Platform
This week's Java roundup for February 19th, 2024, features news highlighting: JDK 22 in release candidate 2, the February 2024 edition of the Payara Platform, numerous milestone and point releases in the Spring ecosystem, multiple releases of Apache Tomcat and Log4j and Gradle 8.7-RC1.
-
Amazon Q Data Integration in AWS Glue Simplifies Data Transformation on AWS
Recently, AWS announced the preview of a new feature for AWS Glue, enabling customers to use natural language for authoring and troubleshooting data integration jobs. With Amazon Q data integration in AWS Glue, developers can provide a description of their data integration workload, and the service will generate an ETL script.
-
Google Introduces Firestore Multiple Databases
Google Cloud recently announced the general availability of Firestore Multiple Databases. The new feature is designed to isolate customer data and facilitate the management of microservices, as well as development, test, and staging environments.
-
Google Renames Bard to Gemini
Google announced that their Bard chatbot will now be called Gemini. The company also announced the launch of Gemini Advanced, the largest version of their Gemini language model, along with two new mobile apps for interacting with the model.