方案选型对比

技术研究人工智能 LLM

本节为每个内置 Agent 提供 2-3 个替代模型选项，并说明选择的理由。

3.1 Agent 替代模型推荐

本节为每个内置 Agent 提供 2-3 个替代模型选项，并说明选择的理由。

3.1.1 Sisyphus (主编排 Agent)

当前模型: anthropic/claude-opus-4-5 (EXPENSIVE)

替代方案对比:

替代模型	成本	reasoning	tool_call	coding	推荐指数
openai/gpt-5.2	EXPENSIVE	★★★★★	★★★★★	★★★★☆	⭐⭐⭐⭐⭐
anthropic/claude-sonnet-4-5	MEDIUM	★★★★☆	★★★★☆	★★★★☆	⭐⭐⭐⭐
google/gemini-3-pro	MEDIUM	★★★★☆	★★★★☆	★★★★★	⭐⭐⭐

选择建议:

GPT-5.2：如果你需要更强的推理能力，能接受略微降低的指令遵循准确性
Claude Sonnet：预算敏感场景的理想选择，保持 Anthropic 生态一致性
Gemini 3 Pro：超长上下文场景（2M tokens），适合巨型代码库

迁移配置示例:

{
  "agents": {
    "sisyphus": {
      "model": "openai/gpt-5.2",
      "temperature": 0.7,
      "maxTokens": 32768
    }
  }
}

3.1.2 Oracle (战略顾问 Agent)

当前模型: openai/gpt-5.2 (EXPENSIVE)

替代方案对比:

替代模型	成本	reasoning	coding	architecture	推荐指数
anthropic/claude-opus-4-5	EXPENSIVE	★★★★☆	★★★★★	★★★★★	⭐⭐⭐⭐⭐
anthropic/claude-sonnet-4-5	MEDIUM	★★★★☆	★★★★☆	★★★★☆	⭐⭐⭐⭐
deepseek/deepseek-r1	CHEAP	★★★★★	★★★★☆	★★★☆☆	⭐⭐⭐

选择建议:

Claude Opus：代码审查场景的最佳选择，架构文档生成质量极高
DeepSeek R1：开源推理模型，适合本地部署或成本敏感场景
Claude Sonnet：平衡成本和质量的中间选项

3.1.3 Librarian (文档研究 Agent)

当前模型: opencode/glm-4.7-free (FREE)

替代方案对比:

替代模型	成本	speed	tool_call	documentation	推荐指数
google/gemini-3-flash	CHEAP	★★★★★	★★★☆☆	★★★★☆	⭐⭐⭐⭐⭐
ollama/llama3.2	FREE (本地)	★★★☆☆	★★★☆☆	★★★☆☆	⭐⭐⭐⭐
opencode/grok-code	CHEAP	★★★★★	★★★★☆	★★★☆☆	⭐⭐⭐

选择建议:

Gemini Flash：速度极快，适合高频文档查询，成本极低
Llama 3.2：完全免费（本地运行），隐私敏感场景首选
Grok Code：适合代码仓库探索，响应速度快

迁移配置示例:

{
  "agents": {
    "librarian": {
      "model": "google/gemini-3-flash",
      "temperature": 0.3
    }
  }
}

3.1.4 Explore (代码探索 Agent)

当前模型: google/gemini-3-flash 或 opencode/grok-code (CHEAP)

替代方案对比:

替代模型	成本	speed	search_quality	推荐指数
opencode/glm-4.7-free	FREE	★★★★☆	★★★☆☆	⭐⭐⭐⭐
anthropic/claude-haiku-4-5	CHEAP	★★★★★	★★★★☆	⭐⭐⭐⭐
mistral/mistral-small	CHEAP	★★★★☆	★★★☆☆	⭐⭐⭐

选择建议:

GLM-4.7 Free：完全免费替代方案，适合预算极其有限的场景
Claude Haiku：Anthropic 的高速轻量模型，保持生态一致性

3.1.5 Frontend UI/UX Engineer (前端开发 Agent)

当前模型: google/gemini-3-pro-preview (CHEAP)

替代方案对比:

替代模型	成本	coding	creativity	attachment	推荐指数
anthropic/claude-sonnet-4-5	MEDIUM	★★★★★	★★★★☆	★★☆☆☆	⭐⭐⭐⭐⭐
openai/gpt-4o	CHEAP	★★★★☆	★★★★☆	★★★☆☆	⭐⭐⭐⭐
google/gemini-3-flash	CHEAP	★★★☆☆	★★★☆☆	★★★★★	⭐⭐⭐

选择建议:

Claude Sonnet：前端代码生成的最佳选择，Tailwind/React 支持极好
GPT-4o：OpenAI 的快速多模态模型，图像转代码能力强
Gemini Flash：需要处理大量图像/UI 设计稿时的经济选择

3.1.6 Hephaestus (自主深度工作 Agent)

当前模型: anthropic/claude-opus-4-5 (EXPENSIVE)

替代方案对比:

替代模型	成本	autonomy	coding	reasoning	推荐指数
openai/gpt-5.2	EXPENSIVE	★★★★★	★★★★★	★★★★★	⭐⭐⭐⭐⭐
deepseek/deepseek-v3	MEDIUM	★★★★☆	★★★★★	★★★★☆	⭐⭐⭐⭐
anthropic/claude-sonnet-4-5	MEDIUM	★★★☆☆	★★★★☆	★★★☆☆	⭐⭐⭐

选择建议:

GPT-5.2：自主规划能力极强，适合复杂的多文件重构任务
DeepSeek V3：中文代码理解能力强，适合中文注释的代码库

3.2 模型选择决策矩阵

按 Agent 类型推荐的模型组合

┌─────────────────────────────────────────────────────────────────┐
│                    Agent 模型选择决策树                          │
├─────────────────────────────────────────────────────────────────┤
│                                                                  │
│  预算充足场景 (推荐)                                              │
│  ├── Sisyphus: Claude Opus 4.5 (EXPENSIVE)                     │
│  ├── Oracle: GPT-5.2 (EXPENSIVE)                               │
│  ├── Librarian: GLM-4.7 Free (FREE)                            │
│  ├── Explore: Gemini Flash (CHEAP)                             │
│  └── Frontend: Claude Sonnet (MEDIUM)                          │
│                                                                  │
│  经济型场景                                                      │
│  ├── Sisyphus: Claude Sonnet (MEDIUM)                          │
│  ├── Oracle: Claude Sonnet (MEDIUM)                            │
│  ├── Librarian: GLM-4.7 Free (FREE)                            │
│  ├── Explore: GLM-4.7 Free (FREE)                              │
│  └── Frontend: Gemini Flash (CHEAP)                            │
│                                                                  │
│  本地部署场景 (隐私优先)                                          │
│  ├── Sisyphus: Llama 3.3 70B / Qwen 2.5                        │
│  ├── Oracle: DeepSeek R1                                       │
│  ├── Librarian: Llama 3.2                                      │
│  ├── Explore: Qwen 2.5 Coder                                   │
│  └── Frontend: CodeLlama / DeepSeek Coder                      │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘

3.3 成本效益分析

典型使用场景成本估算

假设每月处理 100 个开发任务：

场景	模型组合	月均成本	质量评分	性价比
专业版	全默认配置	$150-300	9.5/10	⭐⭐⭐⭐
经济版	Claude Sonnet + Gemini Flash	$30-60	8.5/10	⭐⭐⭐⭐⭐
极简版	全免费模型	$0	7.0/10	⭐⭐⭐
本地版	Ollama 本地模型	$0 (硬件成本)	7.5/10	⭐⭐⭐⭐

成本优化建议

Sisyphus 降级：在简单任务中降级为 Claude Sonnet，可节省 70% 成本
Librarian 并行：使用免费模型并行处理多个文档查询
Explore 缓存：对于重复的代码搜索，启用结果缓存
Oracle 按需调用：仅在架构设计时调用 Oracle，日常开发使用 Sisyphus

3.4 选型理由总结

oh-my-opencode 的默认模型选择基于以下核心原则：

主编排 Agent 使用最强模型：Sisyphus 作为核心编排器，使用 Claude Opus 确保决策质量
探索类 Agent 使用经济模型：Librarian 和 Explore 任务量大，使用免费/低价模型控制成本
专业 Agent 使用专用模型：Frontend UI/UX 使用 Gemini 利用其多模态能力
保持生态一致性：多个 Agent 使用 Anthropic/OpenAI 模型，便于统一管理和监控

这种设计在保证核心工作流质量的同时，实现了成本的最优化。

参考资料

Best LLM Models 2026 - LLM 模型排名
Open Source Reasoning Models 2026 - 开源推理模型对比
AI Model Comparison Guide - AI 模型选型指南