InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
InfoQ Dev Summit Munich: Learn from German Automotive, Banking, and TelCo Software Practitioners
InfoQ Dev Summit Munich is a two-day in-person software development conference for senior software engineers, architects, and team leaders in the Bavarian capital on September 26th and 27th. The sessions will cover critical topics such as generative AI and platform engineering, with use cases from the German automotive, banking, and telecommunication industries.
-
Java News Roundup: JEPs Targeted for JDK 23, JHipster 8.5, Gradle 8.8, Spring AI 1.0-M1
This week's Java roundup for May 27th, 2024, features news highlighting: four JEPs targeted for JDK 23, namely: JEP 482, Flexible Constructor Bodies (Second Preview), JEP 481, Scoped Values (Third Preview), JEP 480, Structured Concurrency (Third Preview) and JEP 471, Deprecate the Memory-Access Methods in Unsafe for Removal; and the releases of JHipster 8.5, Gradle 8.8 and Spring AI 1.0-M1.
-
Cloudflare AI Gateway Now Generally Available
Cloudflare has recently announced that AI Gateway is now generally available. Described as a unified interface for managing and scaling generative AI workloads, AI Gateway allows developers to gain visibility and control over AI applications.
-
University of Washington AI-Powered Headphones Let Users Listen to a Single Person in a Crowd
"Target speech hearing" is a new deep-learning algorithm developed at the University of Washington to allow users to "enroll" a speaker and cancel all environmental noise surrounding their voice.
-
JLama: The First Pure Java Model Inference Engine Implemented With Vector API and Project Panama
Karpathy's 700-line llama.c inference interface demystified how developers can interact with LLMs. Even before that, JLama started its journey of becoming the first pure Java-implemented inference engine for any Hugging Face model, from Gemma to Mixtral. Leveraging the new Vector API and PanamaTensorOperations class with native fallback the library is available in Maven Central.
-
Stanford AI Index 2024 Report: Growth of AI Regulations and Generative AI Investment
Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published its 2024 AI Index annual report. The report identifies top trends in AI, such as 8x growth in Generative AI investment since 2022.
-
NIST Launches Program to Discriminate How Far from "Human-Quality" are Gen AI Generated Summaries
NIST launched a public Gen AI evaluation program for systems developed by the international research community. The pilot program focuses on systems that can generate human-like summaries from multiple documents, or discriminators to identify whether a summary was AI-generated. For now, information about text-to-text modality is available. The registration closes in May.
-
Java News Roundup: Java Turns 29, Kotlin 2.0, Semantic Kernel for Java 1.0, More OpenJDK Updates
This week's Java roundup for May 20th, 2024, features news highlighting: Java’s 29th birthday; the release of Kotlin 2.0 and Semantic Kernel for Java 1.0; JEP 477, Implicitly Declared Classes and Instance Main Methods (Third Preview), targeted for JDK 23; and four JEPs proposed to target for JDK 23.
-
Spring Ecosystem Releases Focus on Spring Boot, Spring Session and Spring Security
There was a flurry of activity in the Spring ecosystem during the week of May 20th, 2024, highlighting GA releases of: Spring Boot 3.3.0, Spring Security 6.3.0, Spring Session 3.3.0 and Spring Integration 1.3.0.
-
AWS Introduces Amazon Bedrock Studio for Building Generative AI Applications
AWS has recently announced Amazon Bedrock Studio, a web interface for developers to collaborate and build generative AI applications. Currently in public preview, the rapid prototyping environment provides access to multiple foundation models, knowledge bases, agents, and guardrails.
-
Recap of Google I/O 2024: Gemini 1.5, Project Astra, AI-powered Search Engine
Google recently hosted its annual developer conference, Google I/O 2024, where numerous announcements were made regarding Google’s apps and services. As anticipated, AI was a focal point of the event, being incorporated into almost all Google products. Here is a summary of the major announcements from the event.
-
Google Brings Gemini Nano to Chrome to Enable On-Device Generative AI
At its Google I/O 2024 developer conference, Google announced it is working to make support for on-device large language models a reality by bringing the smallest of its Gemini models, Gemini Nano, to Chrome.
-
InfoQ Dev Summit Boston: Save up to 60% with Our Special Memorial Day Sale
InfoQ Dev Summit Boston (June 24-25) is running a special Memorial Day Sale with significant discounts of up to 60% off. This is your chance to learn actionable insights from 20+ senior software practitioners on today's critical development priorities. Get clarity on Generative AI, security, modern web apps, and more from senior software practitioners with two days of technical talks.
-
OpenAI Announces New Flagship Model GPT-4o
OpenAI recently announced the latest version of their GPT AI foundation model, GPT-4o. GPT-4o is faster than the previous version of GPT-4 and has improved capabilities in handling speech, vision, and multilingual tasks, outperforming all models except Google's Gemini on several benchmarks.
-
AI Lab Extension Allows Podman Desktop Users to Experiment with LLMs Locally
One year after its 1.0 release, Podman Desktop announced the Podman AI Lab plugin promising to help developers start working with Large Language Models on their machines. Podman AI Lab streamlines LLM workflows featuring generative AI exploration, built-in recipe catalogue, curated models, local model serving, OpenAI-compatible API, code snippets, and playground environments.