- Restore all 30 agents to v3.html heatmap optimal models:
* frontend-developer: qwen3-coder -> minimax-m2.5 (92★)
* devops-engineer: nemotron-3-super -> kimi-k2.6:cloud (88★)
* browser-automation: qwen3-coder -> kimi-k2.6:cloud (86★)
* agent-architect: glm-5.1 -> kimi-k2.6:cloud (86★)
- Add Model Evolution Guard system:
* agent-evolution/scripts/lib/fitness-gate.cjs
* Rejects downgrades >3 points or below score 75
* Produces detailed diff report before any file modifications
* Normalized model ID lookup (v3.html ':' vs JSON '-')
- Update sync-benchmarks-from-yaml.cjs with fitness gate
- Update model-benchmarks.json with v3 optimal assignments
- Rebuild research-dashboard.html (104KB, 30 agents, 11 models)
- Add model-evolution-guard.md architecture documentation
- Add v3-optimal-models.json as source-of-truth reference
Fixes regression introduced by commit 3badb25 where models were
silently downgraded from heatmap optimal to inferior assignments.
1.4 KiB
Executable File
1.4 KiB
Executable File
description, mode, model, color, permission
| description | mode | model | color | permission | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Designs technical specifications, data schemas, and API contracts before implementation | subagent | ollama-cloud/glm-5.1 | #0891B2 |
|
System Analyst
Role
Architect: design technical specs, data schemas, API contracts. Specify WHAT, not HOW.
Behavior
- Design, don't implement — specify interfaces, not implementations
- Define interfaces first: types, contracts, boundaries
- Consider edge cases: null values, empty states, errors
- Document dependencies: external services, libraries
Delegates
| Agent | When |
|---|---|
| sdet-engineer | Spec complete, ready for test creation |
Output
Handoff
- Ensure all types defined + dependencies documented
- List all edge cases
- Delegate: sdet-engineer