Commit Graph

10 Commits

Author SHA1 Message Date
Deploy Bot
36455ccf24 feat:apply model recommendations - 18 agents migrated to kimi-k2.6\n\nSources from agent-evolution/data/evolution.json\nAgents: architect-indexer, backend-developer, browser-automation,\n code-skeptic, evaluator, flutter-developer, frontend-developer,\n history-miner, lead-developer, markdown-validator, php-developer,\n product-owner, prompt-optimizer, python-developer,\n requirement-refiner, sdet-engineer, visual-tester,\n workflow-architect\nAlso synced 4 agents via sync-agents.cjs 2026-05-27 13:38:49 +01:00
Deploy Bot
047a87afb4 feat(agent-models): apply MEDIUM+LOW priority model migrations
- markdown-validator: deepseek-v4-pro-max → nemotron-3-nano (90% cost cut)
- release-manager: glm-5.1 → kimi-k2.6 (+2 matrix, 1M context for diffs)
- capability-analyst: glm-5.1 → deepseek-v4-pro-max (+4 matrix, 1M ctx)
- browser-automation: qwen3-coder → deepseek-v4-flash (3× faster inference)
- history-miner: nemotron-3-super → qwen3.5-122b (+14 IF, 12.4M pulls)
2026-05-25 15:07:17 +01:00
NW
bd154f24d0 feat(gns2): mass-update all 30 agents with GNS-2 protocol
- 29 agents updated with GNS-2 checkpoint/event protocol
- 12 Tier 0 (leaf) agents: read checkpoint, write event footer, no cascade
- 17 Tier 1 (task) agents: read checkpoint, recommend next agent, no direct task calls
- 2 Tier 2 (meta) agents already updated: capability-analyst, agent-architect, evaluator
- All agents now include GNS_EVENT footer template in comments
- Frontmatter updated with '(GNS-2 Tier N)' classification

Scripts added:
- scripts/mass-update-gns-agents.py — idempotent mass updater
- scripts/validate-gns-agents.py — protocol checker

Refs: Milestone #67, Issues #99-#107
2026-05-08 22:03:08 +01:00
NW
994ca58821 fix(agents): add missing permissions + complete kilo-meta.json
- Fix 12 agents missing edit/write/bash permissions
- Add 5 missing agents to kilo-meta.json (architect-indexer, flutter-developer, php-developer, pipeline-judge, python-developer)
- Remove BOM from kilo.jsonc
- All 32 agents now consistent between files and meta
2026-05-07 07:22:32 +01:00
¨NW¨
fb552e0020 feat: v3 optimal model assignments + fitness gate
- Update 30 agents to v3 heatmap maximum-score models:
  * go-dev: qwen3-coder -> deepseek-v4-pro-max (85->88 +3)
  * planner: nemotron -> deepseek-v4-pro-max (80->88 +8)
  * perf-engineer: nemotron -> deepseek-v4-pro-max (78->84 +6)
  * reflector: nemotron -> deepseek-v4-pro-max (78->84 +6)
  * security: nemotron -> deepseek-v4-pro-max (76->80 +4)
  * memory-manager: nemotron -> qwen3.6-plus (86->87 +1)
  * frontend: kimi-k2.5 -> minimax-m2.5 (92)
  * the-fixer: minimax-m2.5 -> kimi-k2.6 (88->90 +2)
  * browser-auto: kimi-k2.6 -> qwen3-coder (86->87 +1)
  * prompt-opt: glm-5.1 -> qwen3.6-plus (82->83 +1)
  * backend: deepseek-v3.2 -> qwen3-coder (91)
  * capability-analyst: nemotron -> glm-5.1 (85)
  * release-man: devstral-2 -> glm-5.1 (82)
  * evaluator: nemotron -> glm-5.1 (86)
  * workflow-arch: gpt-oss -> glm-5.1 (84)

- Add Model Evolution Guard:
  * fitness-gate.cjs: rejects downgrades >3 points or <75 score
  * Normalized model ID lookup (: vs -)
  * Diff report before any file modifications
- Update sync-benchmarks-from-yaml.cjs with fitness gate
- Sync kilo-meta.json, kilo.jsonc, .md agent files
- Rebuild research-dashboard.html (104KB, 30 agents, 11 models)

Total improvement: +105 points across 11 agents
Source: v3.html heatmap IF-adjusted composite scores
2026-04-30 08:42:10 +01:00
¨NW¨
28a3b648cc refactor(prompts): compress 29 agents (-77%) and 7 rules (-55%), delete 2 duplicates
Agents: 6,235 → 1,454 lines (-77%). Each agent compressed to Role/Behavior/Delegates/Output/Handoff format.
Gitea commenting extracted to shared block (.kilo/shared/gitea-commenting.md).
Self-evolution protocol extracted to shared block (.kilo/shared/self-evolution.md).
Gitea API client centralized (.kilo/shared/gitea-api.md).

Rules: 2,358 → 1,189 lines (-50%). Deleted sdet-engineer.md (duplicate of agent)
and orchestrator-self-evolution.md (moved to shared/). Compressed docker (549→26),
flutter (521→28), go (283→21), nodejs (271→27), code-skeptic (59→14) to checklists
with skill references.

Fitness: 54/54 tests pass, 29/29 agents validated, fitness=0.92
2026-04-18 13:49:24 +01:00
NW
e19fa3effd refactor: full agent system revision — migrate to GLM-5.1, fix delegation chains, audit consistency
- Migrate 8 agents from openrouter/qwen3.6-plus:free to ollama-cloud/glm-5.1
- Assign thinking/variant/instant depth by role complexity
- Fix broken delegation chains: system-analyst, all developer agents, devops-engineer now can reach orchestrator
- Add task permissions to browser-automation, visual-tester, capability-analyst, markdown-validator
- Add visual-tester permission to flutter-developer and frontend-developer
- Fix capability-index.yaml routing map indentation (go_* keys misplaced)
- Add delegates_to and variant fields to capability-index.yaml
- Update KILO_SPEC.md agent table with Variant column
- Update AGENTS.md with Model/Variant/CanCall columns
- Update kilo.jsonc ask agent model
- Fix YAML indentation in capability-analyst.md and markdown-validator.md
- Update agent-architect.md template models (remove gpt-oss, qwen3.6-plus)
- Add Skills Reference tables to 7 previously unlinked agents
- Full audit: 10/10 consistency checks passed
2026-04-12 22:38:41 +01:00
¨NW¨
caf77f53c8 feat: update agent models based on recommendations
- Update security-auditor to ollama-cloud/nemotron-3-super-120b-a12b
- Update performance-engineer to ollama-cloud/nemotron-3-super-120b-a12b
- Update memory-manager to ollama-cloud/nemotron-3-super-120b-a12b
- Update evaluator to ollama-cloud/nemotron-3-super-120b-a12b
- Update planner to ollama-cloud/nemotron-3-super-120b-a12b
- Update reflector to ollama-cloud/nemotron-3-super-120b-a12b
- Update system-analyst to ollama-cloud/glm-5
- Update go-developer to ollama-cloud/qwen3-coder:480b
- Update markdown-validator to ollama-cloud/nemotron-3-nano
- Update prompt-optimizer to ollama-cloud/nemotron-3-super-120b-a12b
- Update product-owner to ollama-cloud/glm-5

Based on archive/agent-model-recommendations.json analysis
2026-04-05 05:21:02 +01:00
swp
a1a15473a0 feat: add mandatory Gitea commenting to all agents
- Add gitea-commenting skill with comment templates
- Update orchestrator with mandatory commenting requirements
- Add Gitea Commenting section to all 17 agent files
- Create email validation module (validateEmail function)
- Add efficiency_score.json for pipeline logging
- Create test-error-recovery.js for error recovery testing

Refs: Milestone #43 - System Consistency Testing
All 6 test issues closed successfully
2026-04-04 03:18:28 +01:00
swp
e58a5b6380 feat: add markdown-validator agent and scoped-labels skill
- Add markdown-validator agent for validating Gitea issue descriptions
- Add scoped-labels skill for managing exclusive labels (status::, priority::, type::)
- Add init-scoped-labels.sh script to create standard label sets
- Add GiteaClient methods: createLabel, updateLabel, setScopedLabel, setScopedStatus, setScopedPriority
- Support exclusive labels (scoped labels) in Gitea API 1.21+

Created scoped labels:
- status::new, status::planned, status::in-progress, status::review, status::testing, status::done, status::blocked, status::cancelled
- priority::critical, priority::high, priority::medium, priority::low
- type::bug, type::feature, type::enhancement, type::documentation, type::refactor, type::test, type::chore
- size::xs, size::s, size::m, size::l, size::xl

Scoped labels are mutually exclusive within their scope - applying status::in-progress automatically removes status::new
2026-04-04 01:42:12 +01:00