InfoQ Homepage Machine Learning Content on InfoQ
-
Scale out Batch Inference with Ray
Cody Yu discusses how to build a scalable and efficient batch inference stack using Ray.
-
Why Most Machine Learning Projects Fail to Reach Production and How to Beat the Odds
Wenjie Zi discusses common pitfalls that cause these failures, such as the inherent uncertainty of machine learning, misaligned optimization objectives, and skill gaps among practitioners.
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik discusses some of the best practices in model optimization, serving and monitoring - with practical tips and real case-studies.
-
Rethinking Connectivity at the Edge: Scaling Fleets of Low-Powered Devices Using NATS.io
Jeremy Saenz discusses NATS, an open-source project for services communication, and how to leverage NATS to streamline communication and fleet management for devices at the edge.
-
Generative Search: Practical Advice for Retrieval Augmented Generation (RAG)
Sam Partee discusses Vector embeddings in LLMs, a tool capable of capturing the essence of unstructured data used by LLMs to gain access to a wealth of contextually relevant knowledge.
-
Being a Responsible Developer in the Age of AI Hype
Justin Sheehy discusses the dramatic developments in some areas of artificial intelligence and the need for the responsible use of AI systems.
-
Unpacking How Ads Ranking Works @Pinterest
Aayush Mudgal discusses social media advertising, unpacking how Pinterest harnesses the power of Deep Learning Models and big data to tailor relevant advertisements to the pinners.
-
Large Language Models for Code: Exploring the Landscape, Opportunities, and Challenges
Loubna Ben Allal discusses Large Language Models (LLMs), exploring the current developments of these models, how they are trained, and how they can be leveraged with custom codebases.
-
Modern Compute Stack for Scaling Large AI/ML/LLM Workloads
Jules Damji discusses which infrastructure should be used for distributed fine-tuning and training, how to scale ML workloads, how to accommodate large models, and how CPUs and GPUs can be utilized.
-
Platform and Features MLEs, a Scalable and Product-Centric Approach for High Performing Data Products
Massimo Belloni discusses the lessons learnt in the last couple of years around organizing a Data Science Team and the Machine Learning Engineering efforts at Bumble Inc.
-
Going beyond the Case of Black Box AutoML
Kiran Kate covers the basics of AutoML and then presents Lale (https://github.com/IBM/lale), an open-source scikit-learn compatible AutoML library which implements Gradual AutoML.
-
Simplifying Real-Time ML Pipelines with Quix Streams
Tomáš Neubauer discusses Quix Streams, an open-source Python library that helps data scientists and ML engineers to build real-time ML pipelines.