InfoQ Homepage Hugging Face Content on InfoQ
Podcasts
RSS Feed-
Meryem Arik on LLM Deployment, State-of-the-Art RAG Apps, and Inference Architecture Stack
In this podcast, Meryem Arik, co-founder/CEO at TitanML, discusses the innovations in Generative AI and Large Language Model (LLM) technologies including current state of large language models, LLM Deployment, state-of-the-art Retrieval Augmented Generation (RAG) apps, and inference architecture stack for LLM applications.