Agent 存算分离架构

What it is

Agent 存算分离是将 Agent 运行时的"计算"（LLM 调用、工具执行、上下文拼接）与"存储"（提示词、技能、对话记录、长期记忆、工作产物）解耦的架构设计。本地 Agent（OpenClaw、Codex）通常将两者绑定在同一 workspace，而云端规模化 Agent 需要 serverless 化的分层存储。

Why it matters

云端 Agent 托管服务（Manus、genspark claw、Kimi claw）面临的核心矛盾是：多租户、多任务、长时间运行要求每个用户有独立的运行空间，但常驻 pod 成本极高。存算分离通过"计算层动态扩缩 + 存储层分层持久化"解决这一矛盾，使 Agent 服务从"常驻虚拟机"模式演进为"serverless 函数"模式。

Key points

Agent 生命周期中的存储需求：提示词文件（soul.md、identity.md）、工具与技能、对话记录、交付产物都需要持久化；只有上下文拼接、LLM 调用、工具调用属于纯计算
三种运行模式：本地裸机（OpenClaw，存算一体，简单但不安全）、本地沙盒（Codex，工具调用环节隔离）、云端多副本（Manus，需 k8s + PVC）
四层存储模型：热状态用 Redis KV（checkpoint、游标）、对话记录用 Postgres、长期记忆用向量数据库（pgvector、Milvus）、工作产物用对象存储（S3/OSS）
核心挑战：分布式多副本场景下的数据一致性，需要锁机制和负载均衡策略配合惰性 sandbox 回收

Evidence across sources

Source	Key Claim	Relevance
AI 简报 2026-06-03	FastClaw 演示：gateway → 加载用户资料 → sandbox 执行 → checkpoint 到 KV → 惰性回收	端到端实现参考

Open questions

存储层 IO 延迟是否会瓶颈化 Agent Loop 的响应速度？
e2b sandbox 池化在秒级启动场景下的实际稳定性如何？
与 Anthropic Managed Agents 的"brain/harness/hands"三层解耦相比，存算分离更偏向 infra 层还是 session 层？

harness-engineering/managed-agents-architecture — Anthropic 的三层解耦视角
harness-engineering/9-layer-production-ai-architecture — 生产级 Agent 完整架构
harness-engineering/cloudflare-sandboxes — 沙盒即基础设施

Agent 存算分离架构

Agent 存算分离架构

What it is

Why it matters

Key points

Evidence across sources

Open questions

Sources

Evolution

Derived from source material

Agent 存算分离架构

What it is

Why it matters

Key points

Evidence across sources

Open questions

Related

Sources

Evolution

Derived from source material