InfoQ Homepage Large language models Content on InfoQ
-
How Green is Green: LLMs to Understand Climate Disclosure at Scale
Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.
-
LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries
Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.
-
Unleashing Llama's Potential: CPU-Based Fine-Tuning
Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.
-
How GitHub Copilot Serves 400 Million Completion Requests a Day
David Cheney explains the architecture powering GitHub Copilot, detailing how they achieve sub-200ms response times for millions of daily requests.
-
Leveraging Open-source LLMs for Production
Andrey Cheptsov discusses the practical use of open-source LLMs for real-world applications, weighing their pros and cons, highlighting advantages like privacy and cost-efficiency.
-
Taking LLMs out of the Black Box: A Practical Guide to Human-in-the-Loop Distillation
Ines Montani discusses practical solutions for using the latest LLMs in real-world applications and explores how to distill knowledge into smaller and faster components.
-
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik discusses some of the best practices in model optimization, serving and monitoring - with practical tips and real case-studies.
-
Manipulating the Machine: Prompt Injections and Countermeasures
Georg Dresler discusses various methods to perform prompt injection to extract system prompts and documents used by GPTs, and ways to integrate countermeasures to protect against stealing information.
-
Poetry4Shellz – Avoiding Limerick Based Exploitation and Safely Using AI in Your Apps
Rich Smith provides a case study of a real world LLM based app that is vulnerable to a variety of attack vectors that illustrate the challenges to account for when integrating today's LLM technologies
-
Mind Your Language Models: an Approach to Architecting Intelligent Systems
Nischal HP discusses the intricacies of designing and implementing intelligent systems powered by LLMs, drawing upon practical insights gained from real-world deployments.
-
Generative Search: Practical Advice for Retrieval Augmented Generation (RAG)
Sam Partee discusses Vector embeddings in LLMs, a tool capable of capturing the essence of unstructured data used by LLMs to gain access to a wealth of contextually relevant knowledge.