Agno is the framework xpander.ai has the deepest integration with. In this guide, we’ll build a production-ready agent with credentials, instructions, tools, knowledge-base access, session DB, memory, guardrails, and a context-optimization pipeline.Documentation Index
Fetch the complete documentation index at: https://docs.xpander.ai/llms.txt
Use this file to discover all available pages before exploring further.
Prerequisites
- Complete the Quickstart so the CLI, SDK, and
xpander loginare already set up. - Python 3.12+ for the local handler.
- An LLM provider key in your shell like
OPENAI_API_KEY,ANTHROPIC_API_KEY. (These keys will only be used locally.)
1. Install
2. Set up scaffolding
xpander_config.json reference
xpander_config.json
3. Create task handler
The full pattern, wrapped in@on_task so the platform routes tasks to it:
xpander_handler.py
Backend(configuration=task.configuration)picks up the API key, organization ID, and base URL from the active task. No need to read.envdirectly.await backend.aget_args(task=task)calls the control plane and returns a dict with the full agent configuration (instructions, tools, model, knowledge bases, session storage, memory, guardrails). Always passtask=taskinside an@on_taskhandler so task-level overrides (instructions_override,expected_output,output_schema) are merged in.Agent(**agno_args, debug_mode=True)splats that dict into Agno’s ownAgentclass.debug_modeprints tool calls and token usage; remove it for production.task.to_message()flattens the prompt text, file URLs, and any inline-readable file content into a single string ready for Agno.task.get_files()andtask.get_images()return Agno-typedagno.media.Fileandagno.media.Imageobjects.- Reporting
task.tokensandtask.used_toolsis optional. Skipping them just means the metrics view in Agent Studio shows “no usage data” for that run.
backend.aget_args reference
Input parameters
backend.aget_args accepts these arguments:
| Key | Type | Notes |
|---|---|---|
task | Task | The active task inside an @on_task handler. Pass it so task-level overrides (instructions_override, expected_output, output_schema) are merged into the resolved args. Required inside @on_task; optional in scripts and notebooks where you can pass agent_id= instead. |
override | dict | dict.update()-merged onto the resolved args after everything else is built. Accepts any key Agno’s Agent.__init__ takes (see below). Optional. Defaults to None. |
tools | list[Callable] | Appended to the resolved tools list (connectors plus custom plus MCP). Use it to inject ephemeral tools without touching the agent’s graph. Optional. Defaults to []. |
override change?
override accepts any key Agno’s Agent.__init__ takes. Two ways to use it:
- Replace any value the SDK already resolves: any key from the output-params table below (
model,instructions,tools,db,knowledge_retriever,pre_hooks,output_schema, and so on). - Add Agno-native kwargs the SDK doesn’t set itself. Common ones:
| Key | What it controls |
|---|---|
temperature | Sampling temperature for the model. |
max_tokens | Cap on tokens generated per response. |
show_tool_calls | Print tool calls during the run. |
debug_mode | Print full LLM payloads and timings. |
use_json_mode | Force JSON-mode responses. |
markdown | Render responses as Markdown. |
override["model"] skips the SDK’s own model resolution entirely, so use it whenever you want a different model client without re-implementing credential handling.
Example using override to A/B-test two models against the same agent definition:
override to tune Agno-native sampling parameters:
tools to inject an ephemeral test tool:
Agent(...). The dict is yours; the SDK won’t reach back in.
Output parameters
Callingbackend.aget_args returns these fields:
| Key | Type | Notes |
|---|---|---|
id | str | Agent ID from xpander_config.json. Always set. |
name | str | Agent name from the control plane. Always set. |
description | str | Sourced from agent_instructions.json general. Always set. |
model | agno.models.base.Model | Instantiated with credentials resolved. The agent’s custom LLM key wins on cloud, your shell env var wins locally. Always set. |
instructions | str | The Agent Studio instructions (or task.instructions_override), with context-optimization, workspace-output, and compact-tool guidance appended. Always set. |
tools | list[Callable] | Connectors, custom @register_tool functions, MCP servers (with OAuth handled), xpcompact_context, plus Agno’s think / analyze when reasoning_tools_enabled=True. Always set. |
tool_hooks | list[Callable] | One internal hook for retries on transient errors, stuck-loop detection, activity reporting, and Layer 1 microcompaction. Append to it, don’t replace. Always set. |
compression_manager | XPanderContextOptimizer | Layered context-optimization pipeline (microcompaction, auto-compaction, manual via xpcompact_context, emergency, pre-retry). Always set. |
add_datetime_to_context | bool | Inject the current datetime into context on every run. Always set. Defaults to True. |
store_events | bool | Persist Agno run events. Always set. Defaults to True. |
db | AsyncPostgresDb or PostgresDb | Scoped to a per-agent schema. Set when session_storage, user_memories, or agent_memories is on. |
add_history_to_context, session_id, user_id, num_history_runs, max_tool_calls_from_history, enable_session_summaries | various | Driven by agno_settings.session_storage (see settings table below). Set when session_storage=True. |
enable_user_memories, memory_manager, enable_agentic_memory | various | Per-user facts that persist across sessions. Set when user_memories=True. |
add_culture_to_context, update_cultural_knowledge, enable_agentic_culture | bool | Org-wide facts. Single-agent only (skipped on Teams). Set when agent_memories=True. |
learning | bool | Set when agno_settings.learning=True. |
tool_call_limit | int | Cap on tool calls per run. Set when configured on the agent. |
knowledge_retriever, search_knowledge | callable, bool | Agno calls the retriever automatically during the loop. Set when KBs are attached. |
pre_hooks | list | PIIDetectionGuardrail, PromptInjectionGuardrail, OpenAIModerationGuardrail per agno_settings. Set when guardrails are on. |
output_schema, use_json_mode, markdown | various | Structured output and Markdown formatting. Task-level output format and schema overrides applied. Set per output settings. |
expected_output, additional_context | str | Forwarded from the agent definition and task. Set when present. |
members, add_member_tools_to_context, share_member_interactions, show_members_responses | various | The SDK recursively builds each sub-agent as AgnoAgent or AgnoTeam. Set when agent.is_a_team. |
4. Edit the agent’s system prompt
agent_instructions.json contains the agent’s system prompt and has exactly three fields:
agent_instructions.json
xpander agent dev or xpander agent deploy syncs it to the control plane.
5. Set up streaming (optional)
For token-by-token output, decorate anasync def that yields TaskUpdateEvent objects instead of returning a Task. The decorator detects the difference automatically.
streaming_handler.py
stream=True, stream_events=True, yield_run_output=Truetell Agno to emit events instead of buffering. The handler receives chunks, tool-call events, and a finalRunOutput.- The
Chunkevent forwards each token to the platform’s SSE stream so clients render output as it arrives. - The
TaskFinishedevent signals the end of the stream and carries the final task back to the platform.
POST /invoke, returning Server-Sent Events. The platform’s SSE listener for cloud-deployed agents expects a regular handler that returns a Task. So if you need both an interactive streaming experience and platform-routed tasks, run two handlers, or have your streaming endpoint proxy through a regular handler.
6. Test local development
Run the handler with the dev server. Tasks created from any channel (REST, Slack, Agent Studio) route to your laptop:--output_format and --output_schema are useful for testing structured output without changing the agent’s settings in the control plane.
7. Deploy to xpander cloud
When the local handler works, push it as a managed container:- The CLI bundles
xpander_handler.py,requirements.txt, theDockerfile, and the rest of the project. - xpander builds a Docker image, pushes it, and rolls out a new immutable version. The previous version stays available for instant rollback.
- Once the rollout finishes, the platform routes inbound tasks to the new container. The first deploy takes a couple of minutes; subsequent deploys are faster thanks to layer caching.
Secrets and environment variables
.env ships with the deploy by default. For values you don’t want bundled into the image (production keys, rotating secrets), upload them to xpander’s secret store instead:
xpander secrets-sync whenever you rotate a secret. Don’t commit .env to source control either way.
Lifecycle hooks
Containers support@on_boot and @on_shutdown for one-time resource setup and teardown. Use them for caches you want to warm before the first task lands, or open connections you want to close cleanly when the container is replaced:
When to redeploy
Anything that changes Python code, dependencies, or the Dockerfile needs a redeploy. The control-plane bits stay live without one:- Live (no redeploy): instructions, model selection, memory settings, attached agents, attached knowledge bases, tool selection from the catalog.
- Needs
xpander agent deploy: any change toxpander_handler.py,requirements.txt,Dockerfile, or other files in the container.
Inspect the deployment settings
After everyxpander agent dev or xpander agent deploy, the live Agno settings for the cloud version are saved on the agent. Read them back from the Agents() class to confirm what’s actually running:
| Setting | Default | What it does |
|---|---|---|
session_storage | True | Postgres-backed conversation history within a thread. Sets add_history_to_context, session_id, and user_id on the args. |
num_history_runs | 10 | How many prior runs to load into context. |
max_tool_calls_from_history | 0 | Cap on tool calls replayed from history. 0 means no cap. |
session_summaries | False | Generate summaries of completed sessions (enable_session_summaries). |
user_memories | False | Per-user facts that persist across sessions. Manual mode. Adds enable_user_memories and a MemoryManager. |
agentic_memory | False | Per-user facts, agentic-managed (the agent decides when to remember). Sets enable_agentic_memory. Requires user_memories. |
agent_memories | False | Org-wide facts the agent carries into every conversation. Adds add_culture_to_context and update_cultural_knowledge. Single-agent only (skipped on Teams). |
agentic_culture | False | Org-wide facts, agentic-managed. Sets enable_agentic_culture instead of update_cultural_knowledge. |
learning | False | The agent learns and improves with every interaction. |
tool_call_limit | None | Max tool calls per run. None means unlimited. |
coordinate_mode | False | Force Team mode (also auto-detected when the agent has sub-agents attached). |
pii_detection_enabled | False | Adds a PIIDetectionGuardrail pre-hook. |
pii_detection_mask | True | Mask detected PII rather than blocking the request. |
prompt_injection_detection_enabled | False | Adds a PromptInjectionGuardrail pre-hook. |
openai_moderation_enabled | False | Adds an OpenAIModerationGuardrail pre-hook. |
openai_moderation_categories | None | Restrict moderation to specific categories. None means all. |
reasoning_tools_enabled | False | Add Agno’s think and analyze reasoning tools. Skipped automatically for Teams. |
agno_settings in Agent Studio, not in code. There’s no SDK call to flip them by design: changing memory settings affects billing and persistence semantics, so they live in the control plane.
Troubleshooting
`AttributeError: 'NoneType' object has no attribute 'instructions_override'`
`AttributeError: 'NoneType' object has no attribute 'instructions_override'`
Backend.aget_args() reads task.instructions_override while building the args. Inside an @on_task handler, always pass the active task: await backend.aget_args(task=task). The agent_id-only form is supported outside a handler (in scripts and notebooks), but inside one the active task is the source of truth for instruction overrides.Postgres connection errors when running the handler
Postgres connection errors when running the handler
Session storage is on by default, so the args dict includes a
db wired to xpander’s Postgres. For cloud-hosted agents this is automatic. For self-hosted or air-gapped deployments, the database needs to be reachable from your container. Check the connection string with await agent.aget_connection_string() and confirm the host is reachable. To turn session storage off, flip agno_settings.session_storage to False in Agent Studio.Wrong model or wrong key used at runtime
Wrong model or wrong key used at runtime
Custom LLM keys configured on the agent take precedence on cloud deployments. Locally, your shell’s
OPENAI_API_KEY (or the equivalent for your provider) wins. If you want the cloud-side custom key locally too, mirror it into your .env.`pip install xpander-sdk[agno]` says 'no matches found' on macOS
`pip install xpander-sdk[agno]` says 'no matches found' on macOS
zsh expands the brackets. Quote the package name:
pip install "xpander-sdk[agno]".Next steps
Quickstart
The 10-minute scaffold-to-deploy walkthrough that produced the handler shown above.
Custom Tools
Wrap private APIs as tools with
@register_tool and pass them through the args dict.Memory & State
The deep dive on
session_storage, user memories, and agent memories.Containers
Ship the handler as a container managed by xpander.
Core Concepts
The SDK class names mapped onto agents, tasks, threads, and memory.
Frameworks overview
What’s auto-wired vs. manual for Agno, OpenAI Agents SDK, LangChain, and AWS Strands.

