Deploy Bot
36455ccf24
feat:apply model recommendations - 18 agents migrated to kimi-k2.6\n\nSources from agent-evolution/data/evolution.json\nAgents: architect-indexer, backend-developer, browser-automation,\n code-skeptic, evaluator, flutter-developer, frontend-developer,\n history-miner, lead-developer, markdown-validator, php-developer,\n product-owner, prompt-optimizer, python-developer,\n requirement-refiner, sdet-engineer, visual-tester,\n workflow-architect\nAlso synced 4 agents via sync-agents.cjs
2026-05-27 13:38:49 +01:00
Deploy Bot
047a87afb4
feat(agent-models): apply MEDIUM+LOW priority model migrations
...
- markdown-validator: deepseek-v4-pro-max → nemotron-3-nano (90% cost cut)
- release-manager: glm-5.1 → kimi-k2.6 (+2 matrix, 1M context for diffs)
- capability-analyst: glm-5.1 → deepseek-v4-pro-max (+4 matrix, 1M ctx)
- browser-automation: qwen3-coder → deepseek-v4-flash (3× faster inference)
- history-miner: nemotron-3-super → qwen3.5-122b (+14 IF, 12.4M pulls)
2026-05-25 15:07:17 +01:00
Deploy Bot
4a0c78e5c9
feat(agent-models): apply CRITICAL+HIGH model migrations from research
...
Migrations based on model-research-2026-05-24:
- prompt-optimizer: qwen3.6-plus → qwen3.5-122b (CRITICAL, IF=92)
- memory-manager: qwen3.6-plus → deepseek-v4-pro-max (CRITICAL, 1M ctx)
- system-analyst: glm-5.1 → deepseek-v4-pro-max (HIGH, matrix +6)
- evaluator: glm-5.1 → qwen3.5-122b (HIGH, IF=92)
- pipeline-judge: glm-5.1 → kimi-k2.6 (HIGH, matrix +8, 1M ctx)
- workflow-architect: glm-5.1 → qwen3.5-122b (HIGH, IF=92)
7 files changed, 12 insertions(+), 12 deletions(-)
Closes: model-research data gaps for idle models
2026-05-25 14:36:31 +01:00
Deploy Bot
e6e8e9cb2a
feat(workflow-cross-checker): add pre-flight inter-agent validation agent with gate protocol
...
- Create .kilo/agents/workflow-cross-checker.md as a process inspector
- Requires bash: ask, task: deny (subagent security compliant)
- Defines Role Boundaries clarifying it does NOT replace code-skeptic, planner, or capability-analyst
- Adds 7-question Uncomfortable Questions Protocol for architecture and conflict validation
- Adds Error Handling table (Gitea API failure, corrupted checkpoint, unreadable logs)
- Inserts Cross-Check Verification (Gate #1/#2/#3) into orchestrator state machine
- Registers agent in kilo-meta.json, kilo.jsonc, capability-index.yaml, AGENTS.md, KILO_SPEC.md
- Model: ollama-cloud/kimi-k2.6 (higher IF 91, better instruction following for structured verdicts)
2026-05-24 00:11:25 +01:00
Kilo Orchestrator
ded8e3022d
feat(parallel-coordination): evolution — Gitea comment-based task claiming for parallel agent execution
...
New rule:
- parallel-coordination.md — claim protocol, overlap check, claim release, deadlock prevention
Updated:
- orchestrator.md — Overlap Verification MANDATORY before parallel spawn
- capability-index.yaml — implementation_phase parallel group with claim_protocol
- gns-agent-protocol.md — task_claim and task_claim_release event types
- EVOLUTION_LOG.md — evolution entry #6
Fixes: parallel agents writing to same files, migration collisions, worktree merge conflicts.
No new agent, no new Docker service (per TCA rule).
2026-05-18 16:13:33 +01:00
Kilo Orchestrator
46d6752890
feat(context-window): evolution — Gitea-centric checkpoint pruning + agent context hygiene
...
New rules:
- context-window-budget.md — budget per task size, what to load/offload, recovery protocol
- gns-checkpoint-pruning.md — minimal checkpoint v2 schema, agent entry/exit protocols
Updated:
- orchestrator.md — Context Budget Governance section (prune if consumed > 80%)
- gns-agent-protocol.md — checkpoint schema trimmed (history → history_tail), added current_task + agent_chain
- EVOLUTION_LOG.md — logged evolution entry #5
Fixes: context window overflow, agents loading 15,000+ tokens of irrelevant comments,
state held in RAM instead of offloaded to Gitea.
2026-05-18 15:54:15 +01:00
Kilo Orchestrator
4e9ea678bd
feat(orchestrator): evolution — capability-first routing, parallelization, zero-work policy
...
- orchestrator.md: add Capability-First Routing Protocol (5-step anti-regression)
- orchestrator.md: add Testing Task Routing Matrix (browser-automation, visual-tester)
- orchestrator.md: add Parallelization Protocol (review_phase + testing_phase parallel groups)
- orchestrator.md: add Orchestrator Self-Delegation Prohibition (ZERO WORK POLICY)
- capability-index.yaml: enrich parallel_groups with trigger/criteria/aggregator
- capability-index.yaml: enrich iteration_loops with trigger_on fields
- global.md: add Orchestrator Capability-First Check under Tooling Infrastructure
- docker.md: add Host Installation Prohibition (STOP/READ/DELEGATE/REPORT)
- EVOLUTION_LOG.md: log both evolution entries (2026-05-16T13:00 and 13:06)
Addresses: orchestrator host tool install regression, serial execution waste,
orchestrator self-work bypass of specialized agents.
2026-05-16 13:10:06 +01:00
NW
f65bbf9420
feat: add visual quality rules to frontend-developer agent + new screenshot page
2026-05-13 16:54:29 +01:00
NW
2287122f91
fix(agents): add Tool-First Enforcement to agent definitions and global rules
2026-05-13 09:37:40 +01:00
NW
4c9a95661f
evolution: remove obsolete :cloud suffix from kimi-k2.6 model id across all configs
2026-05-13 09:27:48 +01:00
NW
c031c4b9e5
feat(evolution): add incident-responder agent for server incident response and forensics
2026-05-09 13:31:20 +01:00
NW
bd154f24d0
feat(gns2): mass-update all 30 agents with GNS-2 protocol
...
- 29 agents updated with GNS-2 checkpoint/event protocol
- 12 Tier 0 (leaf) agents: read checkpoint, write event footer, no cascade
- 17 Tier 1 (task) agents: read checkpoint, recommend next agent, no direct task calls
- 2 Tier 2 (meta) agents already updated: capability-analyst, agent-architect, evaluator
- All agents now include GNS_EVENT footer template in comments
- Frontmatter updated with '(GNS-2 Tier N)' classification
Scripts added:
- scripts/mass-update-gns-agents.py — idempotent mass updater
- scripts/validate-gns-agents.py — protocol checker
Refs: Milestone #67 , Issues #99-#107
2026-05-08 22:03:08 +01:00
NW
47b027a02f
feat(gns2): Gitea-Nervous-System v2.0 - distributed agent state machine
...
- Add GNS-2 label taxonomy (66 labels) with semantic routing
- Tier 2 agents (capability-analyst, agent-architect, evaluator) enabled for self-cascade
- GNS agent protocol: checkpoint v2 in issue body, machine-readable event footers
- GiteaClient extended: checkpoint CRUD, event parsing, assignee/lock control, triggered issue polling
- PipelineRunner rewritten as PollingSupervisor: reactive instead of active dispatch
- Security: circuit breakers (is_locked), budget governance, depth limits
- Scripts: init-gns-labels.py, validate-gns-agents.py
- Milestone #67 + 7 phase issues (#99-#105) tracking evolution
Refs: Milestone #67 , Issues #99-#105
2026-05-08 21:25:38 +01:00
NW
f01e2064fb
feat(evolution): Kilo Code release sync & APAW system hardening (v2026-05-07)
...
Security & Permissions:
- All 30 agents: task[*]=deny, task[subagent]=deny (cascade prevention)
- orchestrator & release-manager: bash=ask (hardening)
- New .kilo/rules/subagent-security.md with audit rules
- Updated .kilo/rules/global.md with Security & Permissions section
- Updated .kilo/agents/orchestrator.md with Security Enforcement block
Session Management:
- New .kilo/rules/session-persistence.md (checkpoint format, worktree isolation)
- Updated .kilo/rules/branch-strategy.md (worktree per agent)
- pipeline-runner.ts: Checkpoint interface + save/load/resume methods
Plan Persistence:
- Updated .kilo/rules/lead-developer.md (plan handover section)
Per-Agent Reasoning:
- capability-index.yaml: reasoning_effort for all 30 agents (xhigh/high/medium/low)
MCP Cleanup:
- New .kilo/skills/docker-security/SKILL.md (--rm, orphaned process cleanup)
Config Validation:
- Updated .kilo/rules/docker.md (startup checks, commit scoping, location awareness)
Docs:
- README.md: v2026-05-07 evolution badges
- .kilo/EVOLUTION_LOG.md: Entry #6 with full metrics
- .gitignore: ignore dist/ + bun.lock
Gitea: Milestone #66 , Issues #91-#98
Architect: 9/9 sections fresh (express project type)
2026-05-08 18:54:08 +01:00
NW
994ca58821
fix(agents): add missing permissions + complete kilo-meta.json
...
- Fix 12 agents missing edit/write/bash permissions
- Add 5 missing agents to kilo-meta.json (architect-indexer, flutter-developer, php-developer, pipeline-judge, python-developer)
- Remove BOM from kilo.jsonc
- All 32 agents now consistent between files and meta
2026-05-07 07:22:32 +01:00
¨NW¨
80dca09ae0
fix: unquoted color, duplicate key, GLM downgrade + cross-platform validator
...
- Fix security-auditor.md color bare hex to quoted
- Fix orchestrator.md duplicate devops-engineer key
- Fix .kilo/kilo.jsonc: orchestrator + root model to kimi-k2.6:cloud
- Update agent-frontmatter-validation.md with diagnostic guide
- Update global.md with YAML frontmatter rules for all agents
- Update agent-architect.md + workflow-architect.md with color checklist
- Add scripts/validate-agents.cjs: zero-dependency, cross-platform, --fix flag, scans worktrees
2026-05-04 22:01:45 +01:00
¨NW¨
fb552e0020
feat: v3 optimal model assignments + fitness gate
...
- Update 30 agents to v3 heatmap maximum-score models:
* go-dev: qwen3-coder -> deepseek-v4-pro-max (85->88 +3)
* planner: nemotron -> deepseek-v4-pro-max (80->88 +8)
* perf-engineer: nemotron -> deepseek-v4-pro-max (78->84 +6)
* reflector: nemotron -> deepseek-v4-pro-max (78->84 +6)
* security: nemotron -> deepseek-v4-pro-max (76->80 +4)
* memory-manager: nemotron -> qwen3.6-plus (86->87 +1)
* frontend: kimi-k2.5 -> minimax-m2.5 (92)
* the-fixer: minimax-m2.5 -> kimi-k2.6 (88->90 +2)
* browser-auto: kimi-k2.6 -> qwen3-coder (86->87 +1)
* prompt-opt: glm-5.1 -> qwen3.6-plus (82->83 +1)
* backend: deepseek-v3.2 -> qwen3-coder (91)
* capability-analyst: nemotron -> glm-5.1 (85)
* release-man: devstral-2 -> glm-5.1 (82)
* evaluator: nemotron -> glm-5.1 (86)
* workflow-arch: gpt-oss -> glm-5.1 (84)
- Add Model Evolution Guard:
* fitness-gate.cjs: rejects downgrades >3 points or <75 score
* Normalized model ID lookup (: vs -)
* Diff report before any file modifications
- Update sync-benchmarks-from-yaml.cjs with fitness gate
- Sync kilo-meta.json, kilo.jsonc, .md agent files
- Rebuild research-dashboard.html (104KB, 30 agents, 11 models)
Total improvement: +105 points across 11 agents
Source: v3.html heatmap IF-adjusted composite scores
2026-04-30 08:42:10 +01:00
¨NW¨
9e48a4960e
fix: restore optimal v3 models + add fitness gate protection
...
- Restore all 30 agents to v3.html heatmap optimal models:
* frontend-developer: qwen3-coder -> minimax-m2.5 (92★)
* devops-engineer: nemotron-3-super -> kimi-k2.6:cloud (88★)
* browser-automation: qwen3-coder -> kimi-k2.6:cloud (86★)
* agent-architect: glm-5.1 -> kimi-k2.6:cloud (86★)
- Add Model Evolution Guard system:
* agent-evolution/scripts/lib/fitness-gate.cjs
* Rejects downgrades >3 points or below score 75
* Produces detailed diff report before any file modifications
* Normalized model ID lookup (v3.html ':' vs JSON '-')
- Update sync-benchmarks-from-yaml.cjs with fitness gate
- Update model-benchmarks.json with v3 optimal assignments
- Rebuild research-dashboard.html (104KB, 30 agents, 11 models)
- Add model-evolution-guard.md architecture documentation
- Add v3-optimal-models.json as source-of-truth reference
Fixes regression introduced by commit 3badb25 where models were
silently downgraded from heatmap optimal to inferior assignments.
2026-04-29 23:19:16 +01:00
¨NW¨
3badb259cc
feat: bidirectional research dashboard + agent config fixes
...
- Integrate apaw_agent_model_research_v3.html as standalone dashboard
- Add model-benchmarks.json with 32 agents, 11 scored models, 11 recommendations
- Add build-research-dashboard.ts: inject live data into template → standalone HTML
- Add rebuild-template.cjs: regenerate template from v3.html source
- Add sync-benchmarks-from-yaml.cjs: sync YAML → JSON round-trip
- Add sync-model-research.ts: apply recommendation matrix to config files
- Add model-benchmarks.schema.json and model-research.schema.json for validation
- Add bidirectional-data-flow.md architecture documentation
- Add log-execution.cjs pipeline hook
- Update capability-index.yaml: add fallback_models, failover_strategy
- Update kilo-meta.json, kilo.jsonc, KILO_SPEC.md with synced models
- Update evolution.md / research.md / self-evolution.md / evolutionary-sync.md docs
- Fix security-auditor.md: quote YAML color (#DC2626)
- Fix orchestrator.md: remove duplicate devops-engineer key
- Build research-dashboard.html (106KB standalone) + dated archive
2026-04-29 21:04:22 +01:00
¨NW¨
dbea8c90db
feat: evolutionary agent model upgrades based on recommendation matrix
...
- devops-engineer: deepseek-v3.2 → kimi-k2.6:cloud (★88)
- browser-automation: glm-5 → kimi-k2.6:cloud (★86)
- visual-tester: glm-5 → qwen3-coder:480b (★82)
- agent-architect: nemotron-3-super → kimi-k2.6:cloud (★86)
- orchestrator: glm-5 → kimi-k2.6:cloud (dispatch critical)
- product-owner: glm-5 → glm-5.1 (★84)
- prompt-optimizer: qwen3.6-plus:free → glm-5.1 (stable fallback)
- system-analyst: qwen3.6-plus:free → glm-5.1 (★90)
- Add autonomous-mode.md rule for zero-confirmation workflow
2026-04-27 12:09:36 +01:00
¨NW¨
af43eaef80
Merge remote-tracking branch 'origin/agent-sync-features'
2026-04-24 07:21:39 +01:00
¨NW¨
6b71ea2b57
feat: add .architect/ project mapping system with architect-indexer agent and Docker containerization
...
- Add .architect/ directory structure (10 template files) as project brain for agent orientation
- Add architect-indexer agent that scans codebase and generates structured architecture docs
- Add Docker containerization: Dockerfile.architect-indexer, docker-compose.architect.yml
- Add TypeScript project-mapper module with staleness detection and context injection
- Add /index-project command, architect-first-contact rule, project-mapping skill
- Integrate orchestrator first-contact check: triggers indexing before any task delegation
- Add npm arch:* scripts for Docker-based indexing workflow
- Register agent in capability-index.yaml and AGENTS.md
2026-04-22 20:01:38 +01:00
¨NW¨
7445e66676
feat: add Next.js, Vue/Nuxt, React, Python (Django/FastAPI) skills and agents
...
- python-developer agent: Django/FastAPI backend specialist
- nextjs-patterns skill: App Router, Server Components, Server Actions, Auth.js
- vue-nuxt-patterns skill: Composition API, Pinia, Nitro server, SSR
- react-patterns skill: hooks, Context, TanStack Query, React Hook Form
- python-django-patterns skill: DRF, services, repositories
- python-fastapi-patterns skill: async, Pydantic, SQLAlchemy, dependencies
- /nextjs pipeline command for full-stack Next.js apps
- /vue pipeline command for full-stack Vue/Nuxt apps
- Updated frontend-developer with framework-specific skills
- Updated orchestrator, capability-index for Python + frontend routing
- Updated README, STRUCTURE, EVOLUTION_LOG with all new stacks
Total agents: 30. Stacks: PHP, Next.js, Vue/Nuxt, React, Python, Go, Flutter, Node.js
2026-04-19 10:04:51 +01:00
¨NW¨
b46a1a20a8
feat: add PHP development stack, atomic tasks, modular code rules, agent monitoring, fix target project detection
...
7 evolutionary tasks implemented:
1. PHP web development: php-developer agent + 6 skills (Laravel, Symfony, WordPress, security, testing, modular architecture) + 2 pipeline commands (/laravel, /wordpress)
2. Atomic task decomposition: 1 action = 1 task rule, task sizing guide, decomposition protocol for orchestrator, token budgets per complexity
3. Modular code rules: max 100 lines/file, max 30 lines/function, service/repository patterns, cross-module communication via events only
4. Gitea-centric workflow: mandatory issue creation before work, research with links, progress checkboxes, screenshots on test, git history as knowledge base
5. Fix: target project auto-detection — removed all hardcoded UniqueSoft/APAW from API calls, added get_target_repo() via git remote, GITEA_TARGET_REPO env override
6. Agent execution monitoring: agent-executions.jsonl logging, agent-stats.ts statistics script, required fields per invocation, Gitea comment includes duration/tokens
7. Token optimization: 1 action = 1 task principle, token budgets by task type, routing matrix, no scope creep, skip unnecessary pipeline steps
2026-04-18 23:43:04 +01:00
¨NW¨
28a3b648cc
refactor(prompts): compress 29 agents (-77%) and 7 rules (-55%), delete 2 duplicates
...
Agents: 6,235 → 1,454 lines (-77%). Each agent compressed to Role/Behavior/Delegates/Output/Handoff format.
Gitea commenting extracted to shared block (.kilo/shared/gitea-commenting.md).
Self-evolution protocol extracted to shared block (.kilo/shared/self-evolution.md).
Gitea API client centralized (.kilo/shared/gitea-api.md).
Rules: 2,358 → 1,189 lines (-50%). Deleted sdet-engineer.md (duplicate of agent)
and orchestrator-self-evolution.md (moved to shared/). Compressed docker (549→26),
flutter (521→28), go (283→21), nodejs (271→27), code-skeptic (59→14) to checklists
with skill references.
Fitness: 54/54 tests pass, 29/29 agents validated, fitness=0.92
2026-04-18 13:49:24 +01:00
¨NW¨
2573d81cff
refactor: remove CBS-specific e2e-booking flow — belongs to CBS project, not APAW starter
2026-04-17 20:21:29 +01:00
NW
c258d16ef5
feat: add Gitea integration, E2E booking flow, Docker DNS fix, browser-launcher module
...
- Add tests/scripts/lib/gitea-client.js: Gitea API client with auth, comments,
attachments, and Markdown report formatters for visual and console reports
- Add tests/scripts/lib/browser-launcher.js: shared Playwright launch config with
--dns-resolution-order=hostname-first, realistic UA, and navigateTo() helper
using waitUntil:'commit' + waitForLoadState('domcontentloaded')
- Add tests/scripts/e2e-booking-flow-v2.js: full E2E scenario for irina-vik.ru
(register → book service → login → personal cabinet) with Gitea reporting
- Update visual-test-pipeline.js: GITEA_ISSUE env var, Gitea comment+attachment
posting, browser-launcher integration, waitUntil:'commit' navigation
- Update console-error-monitor-standalone.js: same Gitea + DNS fixes
- Update capture-screenshots.js: browser-launcher integration, DNS fix
- Update docker-compose.web-testing.yml: NETWORK_MODE env var (bridge),
DNS_RESOLUTION_ORDER, GITEA_USER/PASSWORD env passthrough, e2e-booking service
- Update tests/package.json: pin playwright to exact 1.52.0 (matches Docker image)
- Update .gitignore: add tests/visual/e2e/ for E2E screenshots
- Update .kilo/agents/visual-tester.md: Docker networking note, Gitea scripts,
e2e-booking service, updated script table
- Update .kilo/commands/web-test.md: Docker Networking section, --issue flag,
Gitea Integration section, e2e-booking service
- Update .kilo/commands/e2e-test.md: complete rewrite — Docker-based Playwright,
no more MCP dependency, proper service table, Gitea integration docs
- Update .kilo/capability-index.yaml: add gitea_integration, e2e_booking_flow,
docker_networking capabilities to visual-tester; add routing entries
2026-04-17 09:27:27 +01:00
NW
3a8aa6b416
docs: update visual testing agent docs, remove test artifacts from git, add pipeline documentation
...
- Remove baseline screenshots from git tracking (test artifacts, not code)
- Add tests/visual/baseline/ to .gitignore
- Rewrite .kilo/agents/visual-tester.md: Docker-first pipeline, bbox extraction, console/network error detection
- Rewrite .kilo/commands/web-test.md: accurate commands, output format, agent flow
- Update .kilo/capability-index.yaml: add bbox_extraction, console_error_detection, button_overflow_detection to visual-tester
- Update AGENTS.md: add /web-test and /e2e-test commands, update visual-tester description
2026-04-16 22:48:46 +01:00
NW
e19fa3effd
refactor: full agent system revision — migrate to GLM-5.1, fix delegation chains, audit consistency
...
- Migrate 8 agents from openrouter/qwen3.6-plus:free to ollama-cloud/glm-5.1
- Assign thinking/variant/instant depth by role complexity
- Fix broken delegation chains: system-analyst, all developer agents, devops-engineer now can reach orchestrator
- Add task permissions to browser-automation, visual-tester, capability-analyst, markdown-validator
- Add visual-tester permission to flutter-developer and frontend-developer
- Fix capability-index.yaml routing map indentation (go_* keys misplaced)
- Add delegates_to and variant fields to capability-index.yaml
- Update KILO_SPEC.md agent table with Variant column
- Update AGENTS.md with Model/Variant/CanCall columns
- Update kilo.jsonc ask agent model
- Fix YAML indentation in capability-analyst.md and markdown-validator.md
- Update agent-architect.md template models (remove gpt-oss, qwen3.6-plus)
- Add Skills Reference tables to 7 previously unlinked agents
- Full audit: 10/10 consistency checks passed
2026-04-12 22:38:41 +01:00
¨NW¨
e074612046
feat: add web testing infrastructure
...
- Docker configurations for Playwright MCP (no host pollution)
- Visual regression testing with pixelmatch
- Link checking for 404/500 errors
- Console error detection with Gitea issue creation
- Form testing capabilities
- /web-test and /web-test-fix commands
- web-testing skill documentation
- Reorganize project structure (docker/, scripts/, tests/)
- Update orchestrator model to ollama-cloud/glm-5
Structure:
- docker/ - Docker configurations (moved from archive)
- scripts/ - Utility scripts
- tests/ - Test suite with visual, console, links testing
- .kilo/commands/ - /web-test and /web-test-fix commands
- .kilo/skills/ - web-testing skill
Issues: #58 #60 #62
2026-04-07 08:55:24 +01:00
¨NW¨
b9abd91d07
feat: orchestrator evolution — full access + model upgrades + self-evolution protocol
...
- Add 9 missing agents to orchestrator task whitelist (20→28 agents)
- Fix 2 broken agents: debug (gpt-oss:20b→qwen3.6-plus), release-manager (devstral-2→qwen3.6-plus)
- Upgrade orchestrator (glm-5→qwen3.6-plus, IF:80→90, 128K→1M context)
- Upgrade pipeline-judge (nemotron→qwen3.6-plus, IF:85→90)
- Add orchestrator escalation path to 7 agents (lead-dev, sdet, skeptic, perf, security, evaluator, devops)
- Create self-evolution protocol (.kilo/rules/orchestrator-self-evolution.md)
- Create evolution log (.kilo/EVOLUTION_LOG.md)
- Full audit of all 29 agents with verification tests
2026-04-06 22:55:12 +01:00
¨NW¨
ae471dcd6b
docs: remove Docker references from pipeline-judge
...
Use local bun runtime only for evolution testing.
2026-04-06 01:35:29 +01:00
¨NW¨
1703247651
feat: add Docker-based evolution testing with precise measurements
...
- Add docker/evolution-test/Dockerfile with bun, TypeScript
- Add docker/evolution-test/docker-compose.yml for parallel workflow testing
- Add run-evolution-test.sh and .bat scripts for cross-platform
- Update pipeline-judge.md with Docker-first approach:
- Millisecond precision timing (date +%s%3N)
- 2 decimal places for test pass rate and coverage
- Docker container for consistent test environment
- Multiple workflow types (feature/bugfix/refactor/security)
Enables:
- Parallel testing with docker-compose
- Consistent environment across machines
- Precise fitness measurements (ms, 2 decimals)
- Multi-workflow testing in containers
2026-04-06 00:48:21 +01:00
¨NW¨
fa68141d47
feat: add pipeline-judge agent and evolution workflow system
...
- Add pipeline-judge agent for objective fitness scoring
- Update capability-index.yaml with pipeline-judge, evolution config
- Add fitness-evaluation.md workflow for auto-optimization
- Update evolution.md command with /evolve CLI
- Create .kilo/logs/fitness-history.jsonl for metrics logging
- Update AGENTS.md with new workflow state machine
- Add 6 new issues to MILESTONE_ISSUES.md for evolution integration
- Preserve ideas in agent-evolution/ideas/
Pipeline Judge computes fitness = (test_rate*0.5) + (gates*0.25) + (efficiency*0.25)
Auto-triggers prompt-optimizer when fitness < 0.70
2026-04-06 00:23:50 +01:00
¨NW¨
1ab9939c92
fix: correct OpenRouter model paths across all files
...
Fixed format from 'qwen/...' to 'openrouter/qwen/...' for:
- product-owner.md
- prompt-optimizer.md
- workflow-architect.md
- status.md, blog.md, booking.md, commerce.md
- kilo.jsonc (default model + ask agent)
- agent-frontmatter-validation.md
- agent-versions.json (recommendations and history)
2026-04-05 23:47:14 +01:00
¨NW¨
6ba325cec5
fix: correct model path format for OpenRouter
...
Changed qwen/qwen3.6-plus:free to openrouter/qwen/qwen3.6-plus:free
for capability-analyst, agent-architect, and evaluator agents.
2026-04-05 23:42:32 +01:00
¨NW¨
a4e09ad5d5
feat: upgrade agent models based on research findings
...
- capability-analyst: nemotron-3-super → qwen3.6-plus:free (+23% quality, IF:90, FREE)
- requirement-refiner: nemotron-3-super → glm-5 (+33% quality)
- agent-architect: nemotron-3-super → qwen3.6-plus:free (+22% quality)
- evaluator: nemotron-3-super → qwen3.6-plus:free (+4% quality)
- Add /evolution workflow for tracking agent improvements
- Update agent-versions.json with evolution history
2026-04-05 23:37:23 +01:00
¨NW¨
4af7355429
feat: update agent models based on research recommendations
...
- requirement-refiner: kimi-k2-thinking -> nemotron-3-super (1M context for specs)
- history-miner: glm-5 -> nemotron-3-super (better git search, 1M context)
- capability-analyst: gpt-oss:120b -> nemotron-3-super (gap analysis improvement)
- agent-architect: gpt-oss:120b -> nemotron-3-super (agent design, 1M context)
- prompt-optimizer: gpt-oss:120b -> qwen3.6-plus:free (FREE on OpenRouter)
- product-owner: glm-5 -> qwen3.6-plus:free (FREE on OpenRouter, 1M context)
- evaluator: gpt-oss:120b -> nemotron-3-super (quality scoring)
- markdown-validator: nemotron-3-nano:30b -> gemma4:26b (better validation)
- debug (kilo.jsonc): gpt-oss:20b -> gemma4:31b (Intelligence Index 39)
- devops-engineer: NEW -> nemotron-3-super (Docker, K8s, CI/CD)
- flutter-developer: NEW -> qwen3-coder:480b (Dart/Flutter support)
Synced all agent models between capability-index.yaml and agent/*.md files.
Validated YAML and JSON5 configs.
2026-04-05 20:28:47 +01:00
¨NW¨
b899119d21
feat: add html-to-flutter skill and research report
...
- Add .kilo/skills/html-to-flutter/SKILL.md
- HTML parsing patterns with html package
- CSS to Flutter style mapping
- Widget tree generation from HTML templates
- flutter_html integration (608k downloads, 2.1k likes)
- Design-time code generation patterns
- Responsive layout conversion (flexbox/grid → Row/Column)
- Form, Card, Navigation conversion examples
- Update flutter-developer agent
- Reference html-to-flutter skill
- Add HTML template conversion workflow
- Integration with flutter_html package
- Add research report .kilo/reports/flutter-cycle-analysis.md
- Gap analysis: HTML→Flutter conversion (critical)
- Testing gap analysis
- Network/API gap analysis
- Storage gap analysis
- Implementation priority and recommendations
- Complete workflow for HTML Template + ТЗ → Flutter App
Research sources:
- flutter_html 3.0.0 (2.1k likes, 608k downloads)
- go_router 17.2.0 (5.6k likes, 2.31M downloads)
- flutter_riverpod 3.3.1 (2.8k likes, 1.61M downloads)
- freezed 3.2.5 (4.4k likes, 1.83M downloads)
Closes: HTML template input workflow for Flutter development
2026-04-05 17:26:02 +01:00
¨NW¨
af5f401a53
feat: add Flutter development support with agent, rules and skills
...
- Add flutter-developer agent (.kilo/agents/flutter-developer.md)
- Role definition for cross-platform mobile development
- Clean architecture templates (Domain/Presentation/Data)
- State management patterns (Riverpod, Bloc, Provider)
- Widget patterns, navigation, platform channels
- Build & release commands
- Performance and security checklists
- Add Flutter development rules (.kilo/rules/flutter.md)
- Code style guidelines (const, final, trailing commas)
- Widget architecture best practices
- State management requirements
- Error handling, API & network patterns
- Navigation, testing, performance
- Security and localization
- Prohibitions list
- Add Flutter skills:
- flutter-state: Riverpod, Bloc, Provider patterns
- flutter-widgets: Widget composition, responsive design
- flutter-navigation: go_router, deep links, guards
- Update AGENTS.md: add @flutter-developer to Core Development
- Update kilo.jsonc: configure flutter-developer and go-developer agents
2026-04-05 17:04:13 +01:00
¨NW¨
6c4756f8b4
fix: correct agent modes from 'all' to 'subagent'
...
These agents are invoked by other agents (orchestrator/evaluator), not directly by user:
- agent-architect: invoked by capability-analyst
- browser-automation: invoked by orchestrator for E2E testing
- history-miner: invoked by orchestrator during [planned] phase
- product-owner: invoked by evaluator for process improvements
- prompt-optimizer: invoked by evaluator when score < 7
- system-analyst: invoked by orchestrator during [researching] phase
- visual-tester: invoked by orchestrator for visual regression
Mode 'all' should be used only for agents that can be both
primary (user-facing) and subagent (invoked by other agents).
2026-04-05 16:19:18 +01:00
¨NW¨
8661c9719f
feat: add devops-engineer agent and validation rules
...
- Add devops-engineer agent (Docker, Kubernetes, CI/CD)
- Add Docker Security Checklist to security-auditor
- Add skill references to backend-developer, go-developer
- Add task permissions to frontend-developer
- Add devops-engineer permission to orchestrator
- Add agent-frontmatter-validation.md rule (prevents YAML errors)
Total: 429 insertions in agents + validation rules
2026-04-05 16:11:31 +01:00
¨NW¨
b517ad5dad
feat: add synchronization system for agent definitions
...
- Add kilo.jsonc (official Kilo Code config)
- Add kilo-meta.json (source of truth for sync)
- Add evolutionary-sync.md rule for documentation
- Add scripts/sync-agents.cjs for validation
- Fix agent mode mismatches (8 agents had wrong mode)
- Update KILO_SPEC.md and AGENTS.md
The sync system ensures:
- kilo-meta.json is the single source of truth
- Agent .md files frontmatter matches meta
- KILO_SPEC.md tables stay synchronized
- AGENTS.md category tables stay synchronized
Run: node scripts/sync-agents.cjs --check
Fix: node scripts/sync-agents.cjs --fix
2026-04-05 13:19:54 +01:00
¨NW¨
0a854a3bc3
fix: add missing agent permissions and update orchestrator mappings
...
- Add devops-engineer permission to orchestrator
- Add BrowserAutomation to orchestrator mappings
- Add code-skeptic task permission to devops-engineer, backend-developer, frontend-developer, go-developer
- Add security-auditor task permission to devops-engineer
- Add Task Tool Invocation section to agent files
- Add go-developer to AGENTS.md Core Development table
- Update KILO_SPEC.md with go-developer agent
2026-04-05 13:02:32 +01:00
¨NW¨
43747d9875
feat: add Docker/DevOps skills and devops-engineer agent
2026-04-05 12:47:01 +01:00
¨NW¨
ebbb4d01b2
feat: optimize agent models for better performance
...
- Update agent-architect to nemotron-3-super for detailed specs
- Update capability-analyst to nemotron-3-super for deeper analysis
- Update history-miner to nemotron-3-super for better relevance
- Update prompt-optimizer to qwen3.6-plus via OpenRouter
- Fix system-analyst model path (remove openrouter prefix)
Testing showed ~28% quality improvement across cognitive agents
2026-04-05 12:15:30 +01:00
¨NW¨
caf77f53c8
feat: update agent models based on recommendations
...
- Update security-auditor to ollama-cloud/nemotron-3-super-120b-a12b
- Update performance-engineer to ollama-cloud/nemotron-3-super-120b-a12b
- Update memory-manager to ollama-cloud/nemotron-3-super-120b-a12b
- Update evaluator to ollama-cloud/nemotron-3-super-120b-a12b
- Update planner to ollama-cloud/nemotron-3-super-120b-a12b
- Update reflector to ollama-cloud/nemotron-3-super-120b-a12b
- Update system-analyst to ollama-cloud/glm-5
- Update go-developer to ollama-cloud/qwen3-coder:480b
- Update markdown-validator to ollama-cloud/nemotron-3-nano
- Update prompt-optimizer to ollama-cloud/nemotron-3-super-120b-a12b
- Update product-owner to ollama-cloud/glm-5
Based on archive/agent-model-recommendations.json analysis
2026-04-05 05:21:02 +01:00
¨NW¨
41eb4c7d4d
feat: add database skills for ClickHouse, PostgreSQL, and SQLite
...
- Add ClickHouse patterns skill
- Add PostgreSQL patterns skill
- Add SQLite patterns skill
- Update backend-developer agent to reference PostgreSQL and SQLite skills
- Update go-developer agent to reference ClickHouse, PostgreSQL, and SQLite skills
- Update capability-index.yaml with database integration capabilities
2026-04-05 05:00:55 +01:00
¨NW¨
be4c586c8f
feat: add Go Lang development coverage (Milestone #49 )
...
- Add go-developer agent for Go backend development
- Add 8 Go skills: web-patterns, middleware, db-patterns,
error-handling, security, testing, concurrency, modules
- Add go.md rules file
- Update capability-index.yaml with Go capabilities
- Complete backend coverage for both NodeJS and Go
2026-04-05 03:40:32 +01:00
¨NW¨
8fcd8f8a9b
feat: add comprehensive NodeJS development skills and rules
...
Based on Planner and Memory Manager analysis:
New Skills (8):
- nodejs-express-patterns: App structure, routing, middleware
- nodejs-security-owasp: OWASP Top 10 security practices
- nodejs-testing-jest: Unit/integration tests, mocking
- nodejs-auth-jwt: JWT authentication, OAuth, sessions
- nodejs-error-handling: Error classes, middleware, async handlers
- nodejs-middleware-patterns: Auth, validation, rate limiting
- nodejs-db-patterns: SQLite, PostgreSQL, MongoDB patterns
- nodejs-npm-management: package.json, scripts, dependencies
New Rules:
- nodejs.md: Code style, security, best practices
Updated:
- backend-developer.md: Added skills reference table
Milestone: #48 NodeJS Development Coverage
Related: Planner & Memory Manager analysis results
2026-04-05 02:39:06 +01:00