InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

Prompt Injection for Large Language Models

This article will cover two common attack vectors against large language models and tools based on them, prompt injection and prompt stealing. We will additionally introduce three approaches to make your LLM-based systems and tools less vulnerable to this kind of attacks and review their benefits and limitations, including fine-tuning, adversarial detectors, and system prompt hardening.

Georg Dresler
on Feb 03, 2025
Architecture & Design

The End of the Bronze Age: Rethinking the Medallion Architecture

A shift left approach to data processing relies on data products that form the basis of data communication across the business. This addresses many flaws in traditional data processing and makes data more relevant, complete, and trustworthy.

Adam Bellemare
on Jan 29, 2025
AI, ML & Data Engineering

Elevate Developer Experience with Generative AI Capabilities on AWS

This is a summary of a talk I gave at InfoQ Dev Summit Munich 2024. I discussed the transformative potential of generative AI in enhancing developer experiences, particularly through AWS. I’ll introduce key tools like Amazon Bedrock, Code Review Assistant, Agentic Code Generation, and Code Summarization in this article.

Olalekan Elesin
on Jan 27, 2025
AI, ML & Data Engineering

A Framework for Building Micro Metrics for LLM System Evaluation

LLM accuracy is a challenging topic to address and is much more multi-dimensional than a simple accuracy score. Denys Linkov introduces a framework for creating micro metrics to evaluate LLM systems, focusing on goal-aligned metrics that improve performance and reliability. By adopting an iterative "crawl, walk, run" methodology, teams can incrementally develop observability.

Denys Linkov
on Jan 21, 2025
AI, ML & Data Engineering

Navigating Responsible AI in the FinTech Landscape

Explore the dynamic intersection of responsible AI, regulation, and ethics in the FinTech sector. This article highlights key challenges and innovative practices as organizations navigate compliance with evolving guidelines like the EU AI Act. Discover how to balance transparency, efficiency, and risk management for sustainable AI growth in your business.

Lexy Kassan
on Nov 27, 2024
Architecture & Design

Architectural Intelligence – the Next AI

Architectural Intelligence is the ability to look beyond AI hype and identify real AI components. Determining how, where, and when to use AI elements comes down to traditional trade-off analysis. Like any technology, AI can be used creatively, but inappropriately. Identify if AI makes sense for your use case, then work to use it effectively to meet your needs.

Thomas Betts
on Nov 26, 2024
AI, ML & Data Engineering

Efficient Resource Management with Small Language Models (SLMs) in Edge Computing

Small Language Models (SLMs) bring AI inference to the edge without overwhelming the resource-constrained devices. In this article, author Suruchi Shah dives into how SLMs can be used in edge computing applications for learning and adapting to patterns in real-time, reducing the computational burden and making edge devices smarter.

Suruchi Shah
on Nov 11, 2024
AI, ML & Data Engineering

Being a Responsible Developer in the Age of AI Hype

Justin Sheehy emphasizes that AI is code, not magic, and warns against inflated claims about AI capabilities. He urges developers to approach AI with healthy skepticism, seeking verifiable evidence and focusing on ethical practices, including addressing bias, privacy, and data integrity. Clear communication about AI’s limitations and accountable use are essential to prevent hype and misuse.

Justin Sheehy
on Nov 06, 2024
AI, ML & Data Engineering

Virtual Panel: What to Consider When Adopting Large Language Models

Four experts discuss some issues people should think about when adopting LLMs and how they can make the best choice for their specific use case. Topics include how to choose between an API-based vs. self-hosted LLM, when to fine-tune an LLM, how to mitigate LLM risks, and what non-technical changes organizations need to make when adopting LLMs.

Anthony Alford Meryem Arik Numa Dhamani Maggie Engler Tingyi Li
on Oct 01, 2024
AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

This article focuses on self-hosted LLMs and how to get the best performance from them. The author provides best practices on how to overcome challenges due to model size, GPU scarcity, and a rapidly evolving field.

Meryem Arik
on Sep 24, 2024
AI, ML & Data Engineering

Article Series: Practical Applications of Generative AI

Generative AI (GenAI) has become a major component of the artificial intelligence (AI) and machine learning (ML) industry. However, using GenAI comes with challenges and risks. In the InfoQ "Practical Applications of Generative AI" article series, we present real-world solutions and hands-on practices from leading GenAI practitioners.

Anthony Alford
on Sep 17, 2024
AI, ML & Data Engineering

Llama 3 in Action: Deployment Strategies and Advanced Functionality for Real-World Applications

This article details the enhanced capabilities of the open-source Llama 3 LLM, and how businesses can adopt the model in their applications. The author gives step-by-step instructions for deploying Llama 3 in the cloud or on-premise, and how to leverage fine-tuned versions for specific tasks.

Tingyi Li
on Sep 17, 2024

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles