InfoQ Homepage Benchmark Content on InfoQ
Podcasts
RSS Feed-
Elena Samuylova on Large Language Model (LLM)-Based Application Evaluation and LLM as a Judge
In this podcast, InfoQ spoke with Elena Samuylova from Evidently AI, on best practices in evaluating Large Language Model (LLM)-based applications. She also discussed the tools for evaluating, testing and monitoring applications powered by AI technologies.