InfoQ Homepage Large language models Content on InfoQ
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.
-
How GitHub Copilot Serves 400 Million Completion Requests a Day
David Cheney explains the architecture powering GitHub Copilot, detailing how they achieve sub-200ms response times for millions of daily requests.
-
Leveraging Open-source LLMs for Production
Andrey Cheptsov discusses the practical use of open-source LLMs for real-world applications, weighing their pros and cons, highlighting advantages like privacy and cost-efficiency.
-
Taking LLMs out of the Black Box: A Practical Guide to Human-in-the-Loop Distillation
Ines Montani discusses practical solutions for using the latest LLMs in real-world applications and explores how to distill knowledge into smaller and faster components.
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik discusses some of the best practices in model optimization, serving and monitoring - with practical tips and real case-studies.
-
Manipulating the Machine: Prompt Injections and Countermeasures
Georg Dresler discusses various methods to perform prompt injection to extract system prompts and documents used by GPTs, and ways to integrate countermeasures to protect against stealing information.
-
Poetry4Shellz – Avoiding Limerick Based Exploitation and Safely Using AI in Your Apps
Rich Smith provides a case study of a real world LLM based app that is vulnerable to a variety of attack vectors that illustrate the challenges to account for when integrating today's LLM technologies
-
Mind Your Language Models: an Approach to Architecting Intelligent Systems
Nischal HP discusses the intricacies of designing and implementing intelligent systems powered by LLMs, drawing upon practical insights gained from real-world deployments.
-
Generative Search: Practical Advice for Retrieval Augmented Generation (RAG)
Sam Partee discusses Vector embeddings in LLMs, a tool capable of capturing the essence of unstructured data used by LLMs to gain access to a wealth of contextually relevant knowledge.
-
Defensible Moats: Unlocking Enterprise Value with Large Language Models
Nischal HP discusses risk mitigation, environmental, social, and governance (ESG) framework implementation to achieve sustainability goals, strategic procurement, spend analytics, data compliance.
-
When AIOps Meets MLOps: What it Takes to Deploy ML Models at Scale
Ghida Ibrahim introduces the concept of AIOps referring to using AI and data-driven tooling to provision, manage and scale distributed IT infra.
-
Reach Next-Level Autonomy with LLM-Based AI Agents
Tingyi Li discusses the AI Agent, exploring how it extends the frontiers of Generative AI applications and leads to next-level autonomy in combination with enterprise data.