refactor: full agent system revision — migrate to GLM-5.1, fix delegation chains, audit consistency

- Migrate 8 agents from openrouter/qwen3.6-plus:free to ollama-cloud/glm-5.1 - Assign thinking/variant/instant depth by role complexity - Fix broken delegation chains: system-analyst, all developer agents, devops-engineer now can reach orchestrator - Add task permissions to browser-automation, visual-tester, capability-analyst, markdown-validator - Add visual-tester permission to flutter-developer and frontend-developer - Fix capability-index.yaml routing map indentation (go_* keys misplaced) - Add delegates_to and variant fields to capability-index.yaml - Update KILO_SPEC.md agent table with Variant column - Update AGENTS.md with Model/Variant/CanCall columns - Update kilo.jsonc ask agent model - Fix YAML indentation in capability-analyst.md and markdown-validator.md - Update agent-architect.md template models (remove gpt-oss, qwen3.6-plus) - Add Skills Reference tables to 7 previously unlinked agents - Full audit: 10/10 consistency checks passed
2026-04-12 22:38:41 +01:00
parent 1f4536ab93
commit e19fa3effd
33 changed files with 365 additions and 141 deletions
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -32,45 +32,56 @@ Agent: Runs full pipeline for issue #42 with Gitea logging
 These agents are invoked automatically by `/pipeline` or manually via `@mention`:

 ### Core Development
-| Agent | Role | When Invoked |
-|-------|------|--------------|
-| `@requirement-refiner` | Converts ideas to User Stories | Issue status: new |
-| `@history-miner` | Finds duplicates in git | Status: planned |
-| `@system-analyst` | Designs specifications | Status: researching |
-| `@sdet-engineer` | Writes tests (TDD) | Status: designed |
-| `@lead-developer` | Implements code | Status: testing (tests fail) |
-| `@frontend-developer` | UI implementation | When UI work needed |
-| `@backend-developer` | Node.js/Express/APIs | When backend needed |
-| `@flutter-developer` | Flutter mobile apps | When mobile development |
-| `@go-developer` | Go backend services | When Go backend needed |
+| Agent | Role | Model | Variant | Can Call |
+|-------|------|-------|---------|----------|
+| `@requirement-refiner` | Converts ideas to User Stories | glm-5.1 | thinking | history-miner, system-analyst |
+| `@history-miner` | Finds duplicates in git | nemotron-3-super | — | *(read-only)* |
+| `@system-analyst` | Designs specifications | glm-5.1 | thinking | sdet-engineer, orchestrator |
+| `@sdet-engineer` | Writes tests (TDD) | qwen3-coder:480b | thinking | lead-developer, orchestrator |
+| `@lead-developer` | Implements code | qwen3-coder:480b | thinking | code-skeptic, orchestrator |
+| `@frontend-developer` | UI implementation | qwen3-coder:480b | — | code-skeptic, orchestrator |
+| `@backend-developer` | Node.js/Express/APIs | qwen3-coder:480b | — | code-skeptic, orchestrator |
+| `@go-developer` | Go backend services | qwen3-coder:480b | — | code-skeptic, orchestrator |
+| `@flutter-developer` | Flutter mobile apps | qwen3-coder:480b | — | code-skeptic, orchestrator |

 ### Quality Assurance
-| Agent | Role | When Invoked |
-|-------|------|--------------|
-| `@code-skeptic` | Adversarial review | Status: implementing |
-| `@the-fixer` | Fixes issues | When review fails |
-| `@performance-engineer` | Performance review | After code-skeptic |
-| `@security-auditor` | Security audit | After performance |
-| `@visual-tester` | Visual regression | When UI changes |
+| Agent | Role | Model | Variant | Can Call |
+|-------|------|-------|---------|----------|
+| `@code-skeptic` | Adversarial review | minimax-m2.5 | — | the-fixer, performance-engineer, orchestrator |
+| `@the-fixer` | Fixes issues | minimax-m2.5 | — | code-skeptic, orchestrator |
+| `@performance-engineer` | Performance review | nemotron-3-super | — | the-fixer, security-auditor, orchestrator |
+| `@security-auditor` | Security audit | nemotron-3-super | — | the-fixer, release-manager, orchestrator |
+| `@visual-tester` | Visual regression | qwen3-coder:480b | — | the-fixer, orchestrator |
+| `@browser-automation` | E2E testing | qwen3-coder:480b | — | orchestrator |

-### Cognitive Enhancement (New)
-| Agent | Role | When Invoked |
-|-------|------|--------------|
-| `@planner` | Task decomposition (CoT/ToT) | Complex tasks |
-| `@reflector` | Self-reflection (Reflexion) | After each agent |
-| `@memory-manager` | Memory systems | Context management |
+### DevOps & Infrastructure
+| Agent | Role | Model | Variant | Can Call |
+|-------|------|-------|---------|----------|
+| `@devops-engineer` | Docker/K8s/CI-CD | nemotron-3-super | — | code-skeptic, security-auditor, orchestrator |
+| `@release-manager` | Git operations, releases | glm-5.1 | — | evaluator |

 ### Meta & Process
-| Agent | Role | When Invoked |
-|-------|------|--------------|
-| `@release-manager` | Git operations | Status: releasing |
-| `@evaluator` | Scores effectiveness | Status: evaluated |
-| `@pipeline-judge` | Objective fitness scoring | After workflow completes |
-| `@prompt-optimizer` | Improves prompts | When fitness < 0.70 |
-| `@capability-analyst` | Analyzes task coverage | When starting new task |
-| `@agent-architect` | Creates new agents | When gaps identified |
-| `@workflow-architect` | Creates workflows | New workflow needed |
-| `@markdown-validator` | Validates Markdown | Before issue creation |
+| Agent | Role | Model | Variant | Can Call |
+|-------|------|-------|---------|----------|
+| `@evaluator` | Scores effectiveness | glm-5.1 | thinking | prompt-optimizer, product-owner, orchestrator |
+| `@pipeline-judge` | Objective fitness scoring | glm-5.1 | — | prompt-optimizer |
+| `@prompt-optimizer` | Improves prompts | glm-5.1 | instant | *(edits files)* |
+| `@product-owner` | Manages issues/tracking | glm-5.1 | — | *(read-only)* |
+
+### Analysis & Design
+| Agent | Role | Model | Variant | Can Call |
+|-------|------|-------|---------|----------|
+| `@capability-analyst` | Analyzes task coverage | glm-5.1 | — | agent-architect, orchestrator |
+| `@agent-architect` | Creates new agents | glm-5.1 | thinking | capability-analyst, requirement-refiner, system-analyst |
+| `@workflow-architect` | Creates workflows | glm-5.1 | thinking | *(edits files)* |
+| `@markdown-validator` | Validates Markdown | nemotron-3-nano:30b | — | orchestrator |
+
+### Cognitive Enhancement
+| Agent | Role | Model | Variant | Can Call |
+|-------|------|-------|---------|----------|
+| `@planner` | Task decomposition | nemotron-3-super | — | *(read-only)* |
+| `@reflector` | Self-reflection | nemotron-3-super | — | *(read-only)* |
+| `@memory-manager` | Memory systems | nemotron-3-super | — | *(read-only)* |

 ## Workflow State Machine