InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Airbnb Open-Sources its ML Feature Platform Chronon
Chronon, Airbnb's platform which creates the infrastructure required to transform raw data into ML-ready features, is now open source. As Airbnb ML infrastructure engineer Varant Zanoyan explains, Chronon supports a variety of data sources and aims to provide low-latency streaming.
-
Google Announces Agent Builder, Expanded Gemini 1.5, Open-Source Additions
At the Google Cloud Next 2024 event, Google announced the launch of Vertex AI Agent Builder, the public preview of Google's most advanced generative AI model, Gemini 1.5 Pro, and the addition of open-source language models to the Vertex AI platform.
-
Thoughtworks’ VP of Data and AI Shares Insights for Building a Robust Data Product at QCon London
During his QCon London presentation, Danilo Sato, vice president of data & AI at Thoughtworks, reemphasized the importance of using domain-driven design and Team Topologies principles when implementing data products. This ensures effective data encapsulation in a more complex landscape where data responsibilities are “shifting left” towards the developer.
-
Open Source Elastic's OpenTelemetry SDK for .NET
Elastic announced the alpha release of the Elastic distribution of the OpenTelemetry SDK for .NET. This distribution preconfigures the collection of tracing, metrics and logs. Additionally, the distribution ensures that the OTLP exporter is enabled by default. The project is open-sourced.
-
Fix Your SDLC before Adopting Gen AI in Your Organisation: Bannon’s Call to Action at QCon London
During her keynote at QCon London, Tracy Bannon, architect and researcher at MITRE, argued that AI will be able to enhance the software development lifecycle, though currently it’s at the “code completion” rather than “code generation” phase. Throughout her presentation, she continuously stresses the importance of keeping humans in the loop and fixing your company’s SDLC before embracing AI.
-
Google Trains User Interface and Infographics Understanding AI Model ScreenAI
Google Research recently developed ScreenAI, a multimodal AI model for understanding infographics and user interfaces. ScreenAI is based on the PaLI architecture and achieves state-of-the-art performance on several tasks.
-
Java News Roundup: JobRunr 7.0, Introducing the Commonhaus Foundation, Payara Platform, Devnexus
This week's Java roundup for April 8th, 2024, features news highlighting: JobRunr 7.0; introducing the Commonhaus Foundation; the April 2024 edition of Payara Platform; JEP 473, Stream Gatherers (Second Preview), and JEP 469, Vector API (Eighth Incubator), Proposed to Target for JDK 23; and Devnexus 2024.
-
QCon London: Lessons Learned from Building LinkedIn’s AI/ML Data Platform
At the QCon London 2024 conference, Félix GV from LinkedIn discussed the AI/ML platform powering the company’s products. He specifically delved into Venice DB, the NoSQL data store used for feature persistence. The presenter shared the lessons learned from evolving and operating the platform, including cluster management and library versioning.
-
Large Language Models for Code by Loubna Ben Allal at QCon London
At QCon London, Loubna Ben Allal discussed Large Language Models (LLMs) for code. She discussed the lifecycle of code completion models, which consists of pre-training on vast codebases and finetuning and continuous adaptation. She specifically discussed open-source models, which are powered by platforms like Hugging Face.
-
NVIDIA Announces Next-Generation AI Superchip Blackwell
NVIDIA recently announced their next generation GPU architecture, Blackwell. Blackwell is the largest GPU ever built, with over 200 billion transistors, and can train large language models (LLMs) up to 4x faster than previous generation hardware.
-
Navigating LLM Deployment: Tips, Tricks and Techniques by Meryem Arik at QCon London
At QCon London, Meryem Arik discussed deploying Large Language Models (LLMs). While initial proofs of concept benefit from hosted solutions, scaling demands self-hosting to cut costs, enhance performance with tailored models, and meet privacy and security requirements. She emphasized understanding deployment limits, quantization for efficiency, and optimizing inference to fully use GPU resources.
-
Java News Roundup: New JEP Candidates, Project Bisbane, Ktor Plugin Repository, JDKUpdater
This week's Java roundup for April 1st, 2024, features news highlighting: new JEP Candidates: JEP 469, Vector API (Eighth Incubator); JEP 473, Stream Gatherers (Second Preview); and JEP 474, ZGC: Generational Mode by Default, Project Bisbane, and introducing the Ktor Plugin Repository and JDKUpdater.
-
Microsoft Announces Garnet: a New Open-Source Cache-Store and Redis Alternative
Microsoft Research has recently announced Garnet, an open-source cache-store designed to accelerate applications and services. Using the RESP wire protocol, Garnet is a faster alternative to cache-stores and is compatible with existing Redis clients.
-
Nvidia Announces Robotics-Oriented AI Foundational Model
At its recent GTC 2024 event, Nvidia announced a new foundational model to build intelligent humanoid robots. Dubbed GR00T, short for Generalist Robot 00 Technology, the model will understand natural language and be able to observe human actions and emulate human movements.
-
KubeCon EU Keynotes: a Call to Action to Innovate Responsibly with Generative AI
The KubeCon EU morning keynotes were a veritable call to action encouraging the cloud-native community's involvement in building the scalable infrastructure needed by generative AI. This call was balanced with encouragement to make a cloud-native platform’s “golden path” green and sustainable, ensuring that any innovation is also responsible.