InfoQ Homepage Caching Content on InfoQ
-
Reducing False Positives in Retrieval-Augmented Generation (RAG) Semantic Caching: a Banking Case Study
In this article, author Elakkiya Daivam discusses why Retrieval Augmented Generation (RAG) and semantic caching techniques are powerful levers for reducing false positives in AI powered applications. She shares the insights from a production-grade evaluation with 1,000 query variations tested across seven bi-encoder models.
-
Building a Global Caching System at Netflix: a Deep Dive to Global Replication
Netflix's EVCache system handles 400M ops/second across 22,000 servers, managing 14.3 PB of data. This infrastructure ensures global availability and resilience through intelligent data routing and flexible replication strategies. By implementing batch compression and switching to DNS-based discovery, Netflix optimizes efficiency, reduces bandwidth usage and significantly lowers operational costs.
-
Custom Response Caching Using NCache in ASP.NET Core
NCache is a cross-platform, open-source distributed caching framework from Alachisoft. It is an extremely fast distributed caching framework that is linearly scalable. This article presents a discussion on how we can work with NCache and response caching middleware in ASP.NET Core.
-
Article Series - .NET Core - 2nd Series
In this series, we explore some of the benefits .NET Core and how it can help traditional .NET developers and all technologists who need to bring robust, performant and economical solutions to market
-
Chris Fregly on the PANCAKE STACK Workshop and Data Pipelines
InfoQ Interviews Chris Fregly, organizer for the 4000+ member Advanced Spark and TensorFlow Meetup about the PANCAKE STACK workshop, Spark and building data pipelines for a machine learning pipeline
-
Mobile Apps Offline Support
Offline support for mobile applications can be thought of as the ability for the app to react gracefully to the lack of connectivity. The rather new context of mobile devices introduced problems such as presence or absence of a network connection or even high latency and low bandwidth. This article covers approaches to these problems in the field of mobile app development.
-
Review and Interview with Dean Hume - Author of Fast ASP.NET Websites
In the review, Anand examines the coverage of Fast ASP.NET Websites authored by Dean Hume. The book helps developers to learn the various techniques associated with the optimization of ASP.NET websites.
-
Infinispan's GridFileSystem - An In-Memory Grid File System
Infinispan is an open source data grid platform that makes use of distributing state across nodes in a cluster. GridFileSystem is a new, experimental API that exposes an Infinispan-backed data grid as a file system. In this article, authors discuss distributed mode of Infinispan and how GridFS framework manages data caching by chunking up data using a new streaming API and storing them in a grid.
-
Extreme Transaction Processing Patterns: Write-behind Caching
Lan Vuong shows how to optimize the performance of an application by leveraging the write-behind caching pattern which sends batch updates to the back-end database asynchronously within a user configurable interval of time, instead of doing sychronous write-through updates typical in web apps.
-
Implementing Master-Worker with Terracotta
A real world case study of a consultancy that distributed the load & increased scalability of its applications using Terracotta using the Master/Worker pattern.
-
Interview: Frank Cohen on FastSOA
InfoQ today publishes a one-chapter excerpt from Frank Cohen's book "FastSOA". On this occasion, InfoQ had a chance to talk to Frank Cohen, creator of the FastSOA methodology, about the issues when trying to process XML messages, scalability, using XQuery in the middle tier, and document-object-relational-mapping.
-
Web Applications with Spring Web Flow and Terracotta for Spring
In this article we will first give you an overview of Spring Web Flow and Terracotta for Spring, and after that show you how you can use these technologies together to enter a new dimension in writing stateful, conversational, scalable and highly available web applications.