Episodic memory is what lets an agent remember what happened in past sessions. This part builds a complete production episodic memory system using PostgreSQL with pgvector, implementing hybrid time-based and semantic retrieval in Node.js so your agent never starts from zero again.
Tag: vector embeddings
Semantic Caching with Redis 8.6: Vector Similarity Matching for LLM Cost Optimization in Production
Semantic caching operates above the model layer, using vector embeddings to match similar queries to previously computed responses. With Redis 8.6, you can achieve 80 percent or higher cache hit rates without calling the LLM at all. This part covers the full architecture, similarity thresholds, cache invalidation, and production implementations in both Node.js and Python.