
Three Types of Memory
- Session Storage - Conversation history within a thread (
session_id). Multiple users can share the same thread. - User Memories - Personal facts about each user (
user_id) that persist across all conversations. - Agent Memories - Global knowledge that applies to all users.
Session Storage
Controls how many conversation exchanges the agent sees.- 3 runs = Last 3 back-and-forth exchanges (6 messages)
- 10 runs = Last 10 exchanges (20 messages)
Terminology: “No. of history runs” (UI) =
num_history_responses (code) = max_runs (override)User Memories
Stores personal facts about each user across all conversations. Agentic managed:- ✅ Enabled - Agent automatically creates/updates memories
- ❌ Disabled - You manually create memories via API
Agent Memories
Stores global knowledge that affects all users. Agentic managed:- ✅ Enabled - Agent learns patterns automatically
- ❌ Disabled (Recommended for Production) - Only admins create memories
Max Tool Calls from History
Limits tool calls in context. Set to 0 (disabled) for most cases. Enable only when hitting token limits.Session Summaries
Generates metadata for monitoring/observability dashboards. Does NOT affect context window size.Tool Calls Compression
Compresses verbose tool outputs after N calls. Set threshold to 3-5 for tool-heavy agents.Usage
Memory settings from Workbench are applied automatically.Multi-user threads: Multiple users can share the same
session_id (e.g., support tickets). Each user keeps their own memories, but sees the same thread history.Debug Logs
Enable debug mode to see memory behavior:- Local
- Kubernetes
Troubleshooting
| Issue | Fix |
|---|---|
| Agent doesn’t remember previous messages | Pass same session_id for all messages. Increase history runs to 3+. |
| User memories not working | Pass user_id consistently. Enable User Memories toggle. |
| Hitting token limits | Reduce history runs, enable tool compression, or manage memory programmatically. |

