NW
f5966db155
feat(gns2): integrate HybridGiteaClient into PollingSupervisor
...
- PollingSupervisor now uses HybridGiteaClient (MCP primary, REST fallback)
- Added mcpUrl to PipelineConfig
- Supervisor calls initialize() to detect MCP vs REST mode automatically
Refs: Milestone #67 , Issue #107
2026-05-08 22:35:21 +01:00
NW
06fb0421ef
fix(process-continuity): operator-free design for MCP Docker integration
...
- Resolve service_healthy deadlock by using service_started instead
- Fix 172.28.0.0/16 network collision by removing ipam config
- Add HybridGiteaClient (mcp → rest → bash fallback)
- Create .kilo/rules/process-continuity.md with 5 operator-free principles:
1. No service_healthy conditions
2. No hardcoded networks
3. Automatic fallback chains
4. Pre-flight validation
5. Self-documenting failures
- Update docker-compose.yml with resilient config:
- start_period: 60s, retries: 5, restart: on-failure:3
- /tools healthcheck (guaranteed endpoint)
- tmpfs for Node.js /tmp
- Resource limits: 256M RAM, 0.5 CPU
- MCP/REST integration test passed (issue #109 )
Refs: Milestone #67 , Issues #107 , #109
2026-05-08 22:31:59 +01:00
NW
3cc6ee2ffe
feat(gns2): Phase 8 MCP Docker containers for Gitea direct integration
...
- docker/mcp-gitea/docker-compose.yml — MCP server container (Sqcoows/forgejo-mcp)
- .kilo/skills/mcp-gitea-connection/SKILL.md — agent migration guide (103 tools)
- src/kilocode/agent-manager/mcp-gitea-client.ts — MCP native client with fallback
- Hybrid mode: MCP primary, REST API fallback if container unavailable
- All 29 Tier 0/1 agents mass-updated with GNS-2 protocol (checkpoint read, event footer)
- Security: no bash for Gitea ops, MCP handles credentials internally
Refs: Milestone #67 , Issue #107
2026-05-08 22:16:52 +01:00
NW
bd154f24d0
feat(gns2): mass-update all 30 agents with GNS-2 protocol
...
- 29 agents updated with GNS-2 checkpoint/event protocol
- 12 Tier 0 (leaf) agents: read checkpoint, write event footer, no cascade
- 17 Tier 1 (task) agents: read checkpoint, recommend next agent, no direct task calls
- 2 Tier 2 (meta) agents already updated: capability-analyst, agent-architect, evaluator
- All agents now include GNS_EVENT footer template in comments
- Frontmatter updated with '(GNS-2 Tier N)' classification
Scripts added:
- scripts/mass-update-gns-agents.py — idempotent mass updater
- scripts/validate-gns-agents.py — protocol checker
Refs: Milestone #67 , Issues #99-#107
2026-05-08 22:03:08 +01:00
NW
47b027a02f
feat(gns2): Gitea-Nervous-System v2.0 - distributed agent state machine
...
- Add GNS-2 label taxonomy (66 labels) with semantic routing
- Tier 2 agents (capability-analyst, agent-architect, evaluator) enabled for self-cascade
- GNS agent protocol: checkpoint v2 in issue body, machine-readable event footers
- GiteaClient extended: checkpoint CRUD, event parsing, assignee/lock control, triggered issue polling
- PipelineRunner rewritten as PollingSupervisor: reactive instead of active dispatch
- Security: circuit breakers (is_locked), budget governance, depth limits
- Scripts: init-gns-labels.py, validate-gns-agents.py
- Milestone #67 + 7 phase issues (#99-#105) tracking evolution
Refs: Milestone #67 , Issues #99-#105
2026-05-08 21:25:38 +01:00
NW
f01e2064fb
feat(evolution): Kilo Code release sync & APAW system hardening (v2026-05-07)
...
Security & Permissions:
- All 30 agents: task[*]=deny, task[subagent]=deny (cascade prevention)
- orchestrator & release-manager: bash=ask (hardening)
- New .kilo/rules/subagent-security.md with audit rules
- Updated .kilo/rules/global.md with Security & Permissions section
- Updated .kilo/agents/orchestrator.md with Security Enforcement block
Session Management:
- New .kilo/rules/session-persistence.md (checkpoint format, worktree isolation)
- Updated .kilo/rules/branch-strategy.md (worktree per agent)
- pipeline-runner.ts: Checkpoint interface + save/load/resume methods
Plan Persistence:
- Updated .kilo/rules/lead-developer.md (plan handover section)
Per-Agent Reasoning:
- capability-index.yaml: reasoning_effort for all 30 agents (xhigh/high/medium/low)
MCP Cleanup:
- New .kilo/skills/docker-security/SKILL.md (--rm, orphaned process cleanup)
Config Validation:
- Updated .kilo/rules/docker.md (startup checks, commit scoping, location awareness)
Docs:
- README.md: v2026-05-07 evolution badges
- .kilo/EVOLUTION_LOG.md: Entry #6 with full metrics
- .gitignore: ignore dist/ + bun.lock
Gitea: Milestone #66 , Issues #91-#98
Architect: 9/9 sections fresh (express project type)
2026-05-08 18:54:08 +01:00
NW
74ad7c4b6e
docs(branch-strategy): default branch is dev, not main
...
- Update branch strategy: dev is primary development branch
- main is stable release only
- Add release process: dev → PR → review → main → tag
- Sync .kilo/ to target projects after release
2026-05-07 07:39:00 +01:00
NW
994ca58821
fix(agents): add missing permissions + complete kilo-meta.json
...
- Fix 12 agents missing edit/write/bash permissions
- Add 5 missing agents to kilo-meta.json (architect-indexer, flutter-developer, php-developer, pipeline-judge, python-developer)
- Remove BOM from kilo.jsonc
- All 32 agents now consistent between files and meta
2026-05-07 07:22:32 +01:00
NW
defe57d53a
feat: merge infrastructure skills and workflows from TenerifeProp
...
Add MCP-based infrastructure skills:
- mcp-integration: Playwright + GitMCP
- e2e-testing: Cypress + AntV + Slack
- search-integration: Brave + Tavily + Markitdown
- security-scanner: CVE Search + MCP Validator
- knowledge-base: Docfork + Wikipedia + ArXiv
- prompt-manager: version control + DevTrends
- api-catalog: MCP server registry
- agent-architect-mcp: patterns + OpenAPI converter
Add workflow commands:
- feature.md: full feature pipeline
- hotfix.md: urgent bug fix workflow
Add rules:
- orchestrator-self-evolution.md
- sdet-engineer.md
Add audit:
- WORKFLOW_AUDIT.md
Source: UniqueSoft/TenerifeProp
2026-05-06 23:04:14 +01:00
¨NW¨
80dca09ae0
fix: unquoted color, duplicate key, GLM downgrade + cross-platform validator
...
- Fix security-auditor.md color bare hex to quoted
- Fix orchestrator.md duplicate devops-engineer key
- Fix .kilo/kilo.jsonc: orchestrator + root model to kimi-k2.6:cloud
- Update agent-frontmatter-validation.md with diagnostic guide
- Update global.md with YAML frontmatter rules for all agents
- Update agent-architect.md + workflow-architect.md with color checklist
- Add scripts/validate-agents.cjs: zero-dependency, cross-platform, --fix flag, scans worktrees
2026-05-04 22:01:45 +01:00
¨NW¨
fb552e0020
feat: v3 optimal model assignments + fitness gate
...
- Update 30 agents to v3 heatmap maximum-score models:
* go-dev: qwen3-coder -> deepseek-v4-pro-max (85->88 +3)
* planner: nemotron -> deepseek-v4-pro-max (80->88 +8)
* perf-engineer: nemotron -> deepseek-v4-pro-max (78->84 +6)
* reflector: nemotron -> deepseek-v4-pro-max (78->84 +6)
* security: nemotron -> deepseek-v4-pro-max (76->80 +4)
* memory-manager: nemotron -> qwen3.6-plus (86->87 +1)
* frontend: kimi-k2.5 -> minimax-m2.5 (92)
* the-fixer: minimax-m2.5 -> kimi-k2.6 (88->90 +2)
* browser-auto: kimi-k2.6 -> qwen3-coder (86->87 +1)
* prompt-opt: glm-5.1 -> qwen3.6-plus (82->83 +1)
* backend: deepseek-v3.2 -> qwen3-coder (91)
* capability-analyst: nemotron -> glm-5.1 (85)
* release-man: devstral-2 -> glm-5.1 (82)
* evaluator: nemotron -> glm-5.1 (86)
* workflow-arch: gpt-oss -> glm-5.1 (84)
- Add Model Evolution Guard:
* fitness-gate.cjs: rejects downgrades >3 points or <75 score
* Normalized model ID lookup (: vs -)
* Diff report before any file modifications
- Update sync-benchmarks-from-yaml.cjs with fitness gate
- Sync kilo-meta.json, kilo.jsonc, .md agent files
- Rebuild research-dashboard.html (104KB, 30 agents, 11 models)
Total improvement: +105 points across 11 agents
Source: v3.html heatmap IF-adjusted composite scores
2026-04-30 08:42:10 +01:00
¨NW¨
9e48a4960e
fix: restore optimal v3 models + add fitness gate protection
...
- Restore all 30 agents to v3.html heatmap optimal models:
* frontend-developer: qwen3-coder -> minimax-m2.5 (92★)
* devops-engineer: nemotron-3-super -> kimi-k2.6:cloud (88★)
* browser-automation: qwen3-coder -> kimi-k2.6:cloud (86★)
* agent-architect: glm-5.1 -> kimi-k2.6:cloud (86★)
- Add Model Evolution Guard system:
* agent-evolution/scripts/lib/fitness-gate.cjs
* Rejects downgrades >3 points or below score 75
* Produces detailed diff report before any file modifications
* Normalized model ID lookup (v3.html ':' vs JSON '-')
- Update sync-benchmarks-from-yaml.cjs with fitness gate
- Update model-benchmarks.json with v3 optimal assignments
- Rebuild research-dashboard.html (104KB, 30 agents, 11 models)
- Add model-evolution-guard.md architecture documentation
- Add v3-optimal-models.json as source-of-truth reference
Fixes regression introduced by commit 3badb25 where models were
silently downgraded from heatmap optimal to inferior assignments.
2026-04-29 23:19:16 +01:00
¨NW¨
d1516f4856
chore: organize temporary research artifacts into archive
...
- Create agent-evolution/archive/ with scripts/, reports/, data/
- Move 11 Python migration/diagnostic scripts
- Move 7 intermediate report files (json, md, txt)
- Move test data and old dashboard builds
- Add archive/README.md with full index of contents
- Update .gitignore to exclude archive/scripts, reports, data
- Keep archive/README.md tracked for documentation
2026-04-29 21:14:23 +01:00
¨NW¨
3badb259cc
feat: bidirectional research dashboard + agent config fixes
...
- Integrate apaw_agent_model_research_v3.html as standalone dashboard
- Add model-benchmarks.json with 32 agents, 11 scored models, 11 recommendations
- Add build-research-dashboard.ts: inject live data into template → standalone HTML
- Add rebuild-template.cjs: regenerate template from v3.html source
- Add sync-benchmarks-from-yaml.cjs: sync YAML → JSON round-trip
- Add sync-model-research.ts: apply recommendation matrix to config files
- Add model-benchmarks.schema.json and model-research.schema.json for validation
- Add bidirectional-data-flow.md architecture documentation
- Add log-execution.cjs pipeline hook
- Update capability-index.yaml: add fallback_models, failover_strategy
- Update kilo-meta.json, kilo.jsonc, KILO_SPEC.md with synced models
- Update evolution.md / research.md / self-evolution.md / evolutionary-sync.md docs
- Fix security-auditor.md: quote YAML color (#DC2626)
- Fix orchestrator.md: remove duplicate devops-engineer key
- Build research-dashboard.html (106KB standalone) + dated archive
2026-04-29 21:04:22 +01:00
¨NW¨
2ae7789802
fix: sync kilo.jsonc + capability-index.yaml after evolution upgrade
...
- kilo.jsonc: manual fix 7 agent models (sync script does not write back)
- capability-index.yaml: orchestrator model glm-5.1 → kimi-k2.6:cloud
- evolutionary-sync.md: add kilo.jsonc + capability-index.yaml manual rules
- Add cloud suffix verification and per-file verification checklist
- Document finding: sync script reads kilo.jsonc but never writes back
2026-04-27 16:49:25 +01:00
¨NW¨
dbea8c90db
feat: evolutionary agent model upgrades based on recommendation matrix
...
- devops-engineer: deepseek-v3.2 → kimi-k2.6:cloud (★88)
- browser-automation: glm-5 → kimi-k2.6:cloud (★86)
- visual-tester: glm-5 → qwen3-coder:480b (★82)
- agent-architect: nemotron-3-super → kimi-k2.6:cloud (★86)
- orchestrator: glm-5 → kimi-k2.6:cloud (dispatch critical)
- product-owner: glm-5 → glm-5.1 (★84)
- prompt-optimizer: qwen3.6-plus:free → glm-5.1 (stable fallback)
- system-analyst: qwen3.6-plus:free → glm-5.1 (★90)
- Add autonomous-mode.md rule for zero-confirmation workflow
2026-04-27 12:09:36 +01:00
¨NW¨
af43eaef80
Merge remote-tracking branch 'origin/agent-sync-features'
2026-04-24 07:21:39 +01:00
¨NW¨
3127d82102
feat: sync agent evolution data and add self-diagnostic report
2026-04-23 07:58:44 +01:00
¨NW¨
6b71ea2b57
feat: add .architect/ project mapping system with architect-indexer agent and Docker containerization
...
- Add .architect/ directory structure (10 template files) as project brain for agent orientation
- Add architect-indexer agent that scans codebase and generates structured architecture docs
- Add Docker containerization: Dockerfile.architect-indexer, docker-compose.architect.yml
- Add TypeScript project-mapper module with staleness detection and context injection
- Add /index-project command, architect-first-contact rule, project-mapping skill
- Integrate orchestrator first-contact check: triggers indexing before any task delegation
- Add npm arch:* scripts for Docker-based indexing workflow
- Register agent in capability-index.yaml and AGENTS.md
2026-04-22 20:01:38 +01:00
¨NW¨
9d85dd9f83
merge: dev into main — centralized auth + trailing-slash fix + all recent features
...
- Security: extricate hardcoded Gitea credentials, add centralized auth module
- Fix: get_target_repo() regex now handles trailing slashes (.rstrip('/') in Python, sed 's:/*' in Bash)
- Fix: task-analysis broken functions (orphaned req references, stray parentheses)
- Documentation: README.md, STRUCTURE.md, AGENTS.md updated with auth section
- Evolution: Entry #5 documenting credentials extrication
2026-04-19 12:20:38 +01:00
¨NW¨
573d9a641e
fix(security): add rstrip('/') to get_target_repo for trailing-slash URLs
...
The regex r'[:/]([^/]+/[^/]+?)(?:\.git)?$' fails on URLs with trailing slashes
like 'https://git.softuniq.eu/UniqueSoft/APAW/ ' because the final '/' breaks
the pattern. Added .rstrip('/') in Python and sed 's:/*' in Bash to all
get_target_repo() implementations across 11 files.
2026-04-19 12:17:53 +01:00
¨NW¨
7523911812
fix(security): extricate hardcoded Gitea credentials, add centralized auth module
...
- Remove all hardcoded NW:eshkink0t credentials from 9 files across skills, commands, rules, and specs
- Add .kilo/shared/gitea-auth.md with get_gitea_token() and .kilo/gitea.jsonc config structure
- All Gitea API callers now use env vars (GITEA_TOKEN → GITEA_USER+GITEA_PASS → ValueError)
- Fix task-analysis/SKILL.md broken functions (orphaned req references, stray parentheses)
- Replace hardcoded UniqueSoft/APAW API URLs with get_target_repo() auto-detection in 3 files
- Update README.md, STRUCTURE.md, AGENTS.md with centralized auth documentation
- Add EVOLUTION_LOG Entry #5 documenting credentials extrication
2026-04-19 11:43:59 +01:00
¨NW¨
7445e66676
feat: add Next.js, Vue/Nuxt, React, Python (Django/FastAPI) skills and agents
...
- python-developer agent: Django/FastAPI backend specialist
- nextjs-patterns skill: App Router, Server Components, Server Actions, Auth.js
- vue-nuxt-patterns skill: Composition API, Pinia, Nitro server, SSR
- react-patterns skill: hooks, Context, TanStack Query, React Hook Form
- python-django-patterns skill: DRF, services, repositories
- python-fastapi-patterns skill: async, Pydantic, SQLAlchemy, dependencies
- /nextjs pipeline command for full-stack Next.js apps
- /vue pipeline command for full-stack Vue/Nuxt apps
- Updated frontend-developer with framework-specific skills
- Updated orchestrator, capability-index for Python + frontend routing
- Updated README, STRUCTURE, EVOLUTION_LOG with all new stacks
Total agents: 30. Stacks: PHP, Next.js, Vue/Nuxt, React, Python, Go, Flutter, Node.js
2026-04-19 10:04:51 +01:00
¨NW¨
b46a1a20a8
feat: add PHP development stack, atomic tasks, modular code rules, agent monitoring, fix target project detection
...
7 evolutionary tasks implemented:
1. PHP web development: php-developer agent + 6 skills (Laravel, Symfony, WordPress, security, testing, modular architecture) + 2 pipeline commands (/laravel, /wordpress)
2. Atomic task decomposition: 1 action = 1 task rule, task sizing guide, decomposition protocol for orchestrator, token budgets per complexity
3. Modular code rules: max 100 lines/file, max 30 lines/function, service/repository patterns, cross-module communication via events only
4. Gitea-centric workflow: mandatory issue creation before work, research with links, progress checkboxes, screenshots on test, git history as knowledge base
5. Fix: target project auto-detection — removed all hardcoded UniqueSoft/APAW from API calls, added get_target_repo() via git remote, GITEA_TARGET_REPO env override
6. Agent execution monitoring: agent-executions.jsonl logging, agent-stats.ts statistics script, required fields per invocation, Gitea comment includes duration/tokens
7. Token optimization: 1 action = 1 task principle, token budgets by task type, routing matrix, no scope creep, skip unnecessary pipeline steps
2026-04-18 23:43:04 +01:00
¨NW¨
28a3b648cc
refactor(prompts): compress 29 agents (-77%) and 7 rules (-55%), delete 2 duplicates
...
Agents: 6,235 → 1,454 lines (-77%). Each agent compressed to Role/Behavior/Delegates/Output/Handoff format.
Gitea commenting extracted to shared block (.kilo/shared/gitea-commenting.md).
Self-evolution protocol extracted to shared block (.kilo/shared/self-evolution.md).
Gitea API client centralized (.kilo/shared/gitea-api.md).
Rules: 2,358 → 1,189 lines (-50%). Deleted sdet-engineer.md (duplicate of agent)
and orchestrator-self-evolution.md (moved to shared/). Compressed docker (549→26),
flutter (521→28), go (283→21), nodejs (271→27), code-skeptic (59→14) to checklists
with skill references.
Fitness: 54/54 tests pass, 29/29 agents validated, fitness=0.92
2026-04-18 13:49:24 +01:00
¨NW¨
c416f53103
refactor: clean main to starter template — remove project-specific and generated files
...
- Remove project-specific commands: booking, blog, commerce, landing-page, feature, hotfix
- Remove project-specific skills: booking, blog, ecommerce
- Remove generated files: EVOLUTION_LOG, WORKFLOW_AUDIT, logs/, reports/
- Add .gitignore entries for auto-generated dirs (.kilo/logs/, .kilo/reports/)
- Remove e2e_booking_flow from capability-index.yaml
- Remove docker/evolution-test/ (dev infra, not starter)
- Genericize AGENTS.md project description
- Genericize tests/README.md title
All removed content preserved on dev branch.
2026-04-17 21:11:12 +01:00
¨NW¨
2573d81cff
refactor: remove CBS-specific e2e-booking flow — belongs to CBS project, not APAW starter
2026-04-17 20:21:29 +01:00
NW
c258d16ef5
feat: add Gitea integration, E2E booking flow, Docker DNS fix, browser-launcher module
...
- Add tests/scripts/lib/gitea-client.js: Gitea API client with auth, comments,
attachments, and Markdown report formatters for visual and console reports
- Add tests/scripts/lib/browser-launcher.js: shared Playwright launch config with
--dns-resolution-order=hostname-first, realistic UA, and navigateTo() helper
using waitUntil:'commit' + waitForLoadState('domcontentloaded')
- Add tests/scripts/e2e-booking-flow-v2.js: full E2E scenario for irina-vik.ru
(register → book service → login → personal cabinet) with Gitea reporting
- Update visual-test-pipeline.js: GITEA_ISSUE env var, Gitea comment+attachment
posting, browser-launcher integration, waitUntil:'commit' navigation
- Update console-error-monitor-standalone.js: same Gitea + DNS fixes
- Update capture-screenshots.js: browser-launcher integration, DNS fix
- Update docker-compose.web-testing.yml: NETWORK_MODE env var (bridge),
DNS_RESOLUTION_ORDER, GITEA_USER/PASSWORD env passthrough, e2e-booking service
- Update tests/package.json: pin playwright to exact 1.52.0 (matches Docker image)
- Update .gitignore: add tests/visual/e2e/ for E2E screenshots
- Update .kilo/agents/visual-tester.md: Docker networking note, Gitea scripts,
e2e-booking service, updated script table
- Update .kilo/commands/web-test.md: Docker Networking section, --issue flag,
Gitea Integration section, e2e-booking service
- Update .kilo/commands/e2e-test.md: complete rewrite — Docker-based Playwright,
no more MCP dependency, proper service table, Gitea integration docs
- Update .kilo/capability-index.yaml: add gitea_integration, e2e_booking_flow,
docker_networking capabilities to visual-tester; add routing entries
screenshots-bugfix
2026-04-17 09:27:27 +01:00
NW
3a8aa6b416
docs: update visual testing agent docs, remove test artifacts from git, add pipeline documentation
...
- Remove baseline screenshots from git tracking (test artifacts, not code)
- Add tests/visual/baseline/ to .gitignore
- Rewrite .kilo/agents/visual-tester.md: Docker-first pipeline, bbox extraction, console/network error detection
- Rewrite .kilo/commands/web-test.md: accurate commands, output format, agent flow
- Update .kilo/capability-index.yaml: add bbox_extraction, console_error_detection, button_overflow_detection to visual-tester
- Update AGENTS.md: add /web-test and /e2e-test commands, update visual-tester description
2026-04-16 22:48:46 +01:00
NW
c6b15e0bcd
feat: implement visual regression testing v2.0 — Playwright pipeline with bbox extraction
...
- Add visual-test-pipeline.js: captures screenshots, extracts UI elements with bounding boxes, compares via pixelmatch, reports console/network errors
- Add capture-screenshots.js: baseline/current screenshot capture at mobile/tablet/desktop viewports
- Add console-error-monitor-standalone.js: standalone console/network error detection without MCP dependency
- Rewrite docker-compose.web-testing.yml: real Playwright image, working services, proper volume mounts
- Update package.json: v2.0.0, add playwright dependency, clean scripts
- Update README.md: accurate Docker-first docs with usage examples
- Add .gitignore: exclude node_modules, current/diff screenshots, reports
- Include baseline screenshots for bbox.wtf homepage
2026-04-16 22:32:41 +01:00
NW
e19fa3effd
refactor: full agent system revision — migrate to GLM-5.1, fix delegation chains, audit consistency
...
- Migrate 8 agents from openrouter/qwen3.6-plus:free to ollama-cloud/glm-5.1
- Assign thinking/variant/instant depth by role complexity
- Fix broken delegation chains: system-analyst, all developer agents, devops-engineer now can reach orchestrator
- Add task permissions to browser-automation, visual-tester, capability-analyst, markdown-validator
- Add visual-tester permission to flutter-developer and frontend-developer
- Fix capability-index.yaml routing map indentation (go_* keys misplaced)
- Add delegates_to and variant fields to capability-index.yaml
- Update KILO_SPEC.md agent table with Variant column
- Update AGENTS.md with Model/Variant/CanCall columns
- Update kilo.jsonc ask agent model
- Fix YAML indentation in capability-analyst.md and markdown-validator.md
- Update agent-architect.md template models (remove gpt-oss, qwen3.6-plus)
- Add Skills Reference tables to 7 previously unlinked agents
- Full audit: 10/10 consistency checks passed
2026-04-12 22:38:41 +01:00
¨NW¨
1f4536ab93
Merge feature/web-testing-infrastructure into main
...
Add comprehensive web testing infrastructure:
- Visual regression testing with pixelmatch
- Link checking for 404/500 errors
- Console error detection with Gitea issues
- Form testing capabilities
- Docker-based Playwright MCP (no host pollution)
- /web-test and /web-test-fix commands
No database changes - safe to merge.
2026-04-07 08:56:37 +01:00
¨NW¨
e074612046
feat: add web testing infrastructure
...
- Docker configurations for Playwright MCP (no host pollution)
- Visual regression testing with pixelmatch
- Link checking for 404/500 errors
- Console error detection with Gitea issue creation
- Form testing capabilities
- /web-test and /web-test-fix commands
- web-testing skill documentation
- Reorganize project structure (docker/, scripts/, tests/)
- Update orchestrator model to ollama-cloud/glm-5
Structure:
- docker/ - Docker configurations (moved from archive)
- scripts/ - Utility scripts
- tests/ - Test suite with visual, console, links testing
- .kilo/commands/ - /web-test and /web-test-fix commands
- .kilo/skills/ - web-testing skill
Issues: #58 #60 #62
2026-04-07 08:55:24 +01:00
¨NW¨
b9abd91d07
feat: orchestrator evolution — full access + model upgrades + self-evolution protocol
...
- Add 9 missing agents to orchestrator task whitelist (20→28 agents)
- Fix 2 broken agents: debug (gpt-oss:20b→qwen3.6-plus), release-manager (devstral-2→qwen3.6-plus)
- Upgrade orchestrator (glm-5→qwen3.6-plus, IF:80→90, 128K→1M context)
- Upgrade pipeline-judge (nemotron→qwen3.6-plus, IF:85→90)
- Add orchestrator escalation path to 7 agents (lead-dev, sdet, skeptic, perf, security, evaluator, devops)
- Create self-evolution protocol (.kilo/rules/orchestrator-self-evolution.md)
- Create evolution log (.kilo/EVOLUTION_LOG.md)
- Full audit of all 29 agents with verification tests
2026-04-06 22:55:12 +01:00
¨NW¨
01ce40ae8a
restore: Docker evolution test files for remote usage
...
Docker files restored for use on other machines with Docker/WSL2.
Available test methods:
1. Docker (isolated environment):
docker-compose -f docker/evolution-test/docker-compose.yml up evolution-feature
2. Local (bun runtime):
docker/evolution-test/run-local-test.bat feature
./docker/evolution-test/run-local-test.sh feature
Both methods provide:
- Millisecond precision timing
- Fitness score with 2 decimal places
- JSONL logging to .kilo/logs/fitness-history.jsonl
2026-04-06 01:36:26 +01:00
¨NW¨
ae471dcd6b
docs: remove Docker references from pipeline-judge
...
Use local bun runtime only for evolution testing.
2026-04-06 01:35:29 +01:00
¨NW¨
b5c5f5ba82
chore: remove Docker test files - use local testing instead
...
Docker Desktop removed from system. Evolution testing uses local bun runtime.
Local testing approach:
- Uses bun runtime (already installed)
- Millisecond precision timing
- Fitness calculation with 2 decimal places
- Works without Docker/WSL2
Usage:
powershell: docker/evolution-test/run-local-test.bat feature
bash: ./docker/evolution-test/run-local-test.sh feature
Tests verified:
- 54/54 tests pass (100%)
- Time: 214.16ms precision
- Fitness: 1.00 (PASS)
2026-04-06 01:34:24 +01:00
¨NW¨
8e492ffa90
test: run evolution test with exact measurements
...
Results:
- Tests: 54/54 passed (100%)
- Time: 214.16ms (millisecond precision)
- Fitness: 1.00 (PASS)
Breakdown:
- Test pass rate: 100% (weight 50%, contribution 0.50)
- Quality gates: 5/5 (weight 25%, contribution 0.25)
- Efficiency: 0.9993 (weight 25%, contribution 0.25)
System verified:
- Bun runtime installed and working
- Fitness calculation precise to 2 decimals
- Logging to fitness-history.jsonl working
2026-04-06 01:08:54 +01:00
¨NW¨
0dbc15b602
feat: add local fallback scripts for evolution testing
...
- run-local-test.sh - Bash script for Linux/macOS
- run-local-test.bat - Batch script for Windows
- PowerShell timing with millisecond precision
- Fitness calculation with 2 decimal places
- Works without Docker (less precise environment)
- Logs to .kilo/logs/fitness-history.jsonl
Usage:
./docker/evolution-test/run-local-test.sh feature
docker\evolution-test\run-local-test.bat feature
Both scripts calculate:
- Test pass rate (2 decimals)
- Quality gates (5 gates)
- Efficiency score (time/normalized)
- Final fitness (weighted average)
2026-04-06 01:03:54 +01:00
¨NW¨
1703247651
feat: add Docker-based evolution testing with precise measurements
...
- Add docker/evolution-test/Dockerfile with bun, TypeScript
- Add docker/evolution-test/docker-compose.yml for parallel workflow testing
- Add run-evolution-test.sh and .bat scripts for cross-platform
- Update pipeline-judge.md with Docker-first approach:
- Millisecond precision timing (date +%s%3N)
- 2 decimal places for test pass rate and coverage
- Docker container for consistent test environment
- Multiple workflow types (feature/bugfix/refactor/security)
Enables:
- Parallel testing with docker-compose
- Consistent environment across machines
- Precise fitness measurements (ms, 2 decimals)
- Multi-workflow testing in containers
2026-04-06 00:48:21 +01:00
¨NW¨
fa68141d47
feat: add pipeline-judge agent and evolution workflow system
...
- Add pipeline-judge agent for objective fitness scoring
- Update capability-index.yaml with pipeline-judge, evolution config
- Add fitness-evaluation.md workflow for auto-optimization
- Update evolution.md command with /evolve CLI
- Create .kilo/logs/fitness-history.jsonl for metrics logging
- Update AGENTS.md with new workflow state machine
- Add 6 new issues to MILESTONE_ISSUES.md for evolution integration
- Preserve ideas in agent-evolution/ideas/
Pipeline Judge computes fitness = (test_rate*0.5) + (gates*0.25) + (efficiency*0.25)
Auto-triggers prompt-optimizer when fitness < 0.70
2026-04-06 00:23:50 +01:00
¨NW¨
1ab9939c92
fix: correct OpenRouter model paths across all files
...
Fixed format from 'qwen/...' to 'openrouter/qwen/...' for:
- product-owner.md
- prompt-optimizer.md
- workflow-architect.md
- status.md, blog.md, booking.md, commerce.md
- kilo.jsonc (default model + ask agent)
- agent-frontmatter-validation.md
- agent-versions.json (recommendations and history)
2026-04-05 23:47:14 +01:00
¨NW¨
6ba325cec5
fix: correct model path format for OpenRouter
...
Changed qwen/qwen3.6-plus:free to openrouter/qwen/qwen3.6-plus:free
for capability-analyst, agent-architect, and evaluator agents.
2026-04-05 23:42:32 +01:00
¨NW¨
a4e09ad5d5
feat: upgrade agent models based on research findings
...
- capability-analyst: nemotron-3-super → qwen3.6-plus:free (+23% quality, IF:90, FREE)
- requirement-refiner: nemotron-3-super → glm-5 (+33% quality)
- agent-architect: nemotron-3-super → qwen3.6-plus:free (+22% quality)
- evaluator: nemotron-3-super → qwen3.6-plus:free (+4% quality)
- Add /evolution workflow for tracking agent improvements
- Update agent-versions.json with evolution history
2026-04-05 23:37:23 +01:00
¨NW¨
fe28aa5922
chore: reorganize project structure and update README
...
- Move docker-compose.evolution.yml to agent-evolution/docker-compose.yml
- Update README with current agent lineup (28+ agents)
- Fix model references in README tables
- Add recent commits history
- Simplify architecture overview
2026-04-05 23:02:44 +01:00
¨NW¨
ff00b8e716
fix: sync agent models across config files
...
- Fix performance-engineer model: gpt-oss:120b -> nemotron-3-super
- Fix markdown-validator model: gemma4:26b -> nemotron-3-nano:30b
- Update KILO_SPEC.md documentation for SystemAnalyst, RequirementRefiner, FrontendDeveloper
- Revert kilo.jsonc to minimal config (primary agents only)
- Keep subagent definitions in .md files and capability-index.yaml
2026-04-05 20:51:09 +01:00
¨NW¨
4af7355429
feat: update agent models based on research recommendations
...
- requirement-refiner: kimi-k2-thinking -> nemotron-3-super (1M context for specs)
- history-miner: glm-5 -> nemotron-3-super (better git search, 1M context)
- capability-analyst: gpt-oss:120b -> nemotron-3-super (gap analysis improvement)
- agent-architect: gpt-oss:120b -> nemotron-3-super (agent design, 1M context)
- prompt-optimizer: gpt-oss:120b -> qwen3.6-plus:free (FREE on OpenRouter)
- product-owner: glm-5 -> qwen3.6-plus:free (FREE on OpenRouter, 1M context)
- evaluator: gpt-oss:120b -> nemotron-3-super (quality scoring)
- markdown-validator: nemotron-3-nano:30b -> gemma4:26b (better validation)
- debug (kilo.jsonc): gpt-oss:20b -> gemma4:31b (Intelligence Index 39)
- devops-engineer: NEW -> nemotron-3-super (Docker, K8s, CI/CD)
- flutter-developer: NEW -> qwen3-coder:480b (Dart/Flutter support)
Synced all agent models between capability-index.yaml and agent/*.md files.
Validated YAML and JSON5 configs.
2026-04-05 20:28:47 +01:00
¨NW¨
15a7b4b7a4
feat: add Agent Evolution Dashboard
...
- Create agent-evolution/ directory with standalone dashboard
- Add interactive HTML dashboard with agent/model matrix
- Add heatmap view for agent-model compatibility scores
- Add recommendations tab with optimization suggestions
- Add Gitea integration preparation (history timeline)
- Add Docker configuration for deployment
- Add build scripts for standalone HTML generation
- Add sync scripts for agent data synchronization
- Add milestone and issues documentation
- Add skills and rules for evolution sync
- Update AGENTS.md with dashboard documentation
- Update package.json with evolution scripts
Features:
- 28 agents with model assignments and fit scores
- 8 models with benchmarks (SWE-bench, RULER, Terminal)
- 11 recommendations for model optimization
- History timeline with agent changes
- Interactive modal windows for model details
- Filter and search functionality
- Russian language interface
- Works offline (file://) with embedded data
Docker:
- Dockerfile for standalone deployment
- docker-compose.evolution.yml
- docker-run.sh/docker-run.bat scripts
NPM scripts:
- sync:evolution - sync and build dashboard
- evolution:open - open in browser
- evolution:dashboard - start dev server
Status: PAUSED - foundation complete, Gitea integration pending
2026-04-05 19:58:59 +01:00
¨NW¨
b899119d21
feat: add html-to-flutter skill and research report
...
- Add .kilo/skills/html-to-flutter/SKILL.md
- HTML parsing patterns with html package
- CSS to Flutter style mapping
- Widget tree generation from HTML templates
- flutter_html integration (608k downloads, 2.1k likes)
- Design-time code generation patterns
- Responsive layout conversion (flexbox/grid → Row/Column)
- Form, Card, Navigation conversion examples
- Update flutter-developer agent
- Reference html-to-flutter skill
- Add HTML template conversion workflow
- Integration with flutter_html package
- Add research report .kilo/reports/flutter-cycle-analysis.md
- Gap analysis: HTML→Flutter conversion (critical)
- Testing gap analysis
- Network/API gap analysis
- Storage gap analysis
- Implementation priority and recommendations
- Complete workflow for HTML Template + ТЗ → Flutter App
Research sources:
- flutter_html 3.0.0 (2.1k likes, 608k downloads)
- go_router 17.2.0 (5.6k likes, 2.31M downloads)
- flutter_riverpod 3.3.1 (2.8k likes, 1.61M downloads)
- freezed 3.2.5 (4.4k likes, 1.83M downloads)
Closes: HTML template input workflow for Flutter development
2026-04-05 17:26:02 +01:00
¨NW¨
af5f401a53
feat: add Flutter development support with agent, rules and skills
...
- Add flutter-developer agent (.kilo/agents/flutter-developer.md)
- Role definition for cross-platform mobile development
- Clean architecture templates (Domain/Presentation/Data)
- State management patterns (Riverpod, Bloc, Provider)
- Widget patterns, navigation, platform channels
- Build & release commands
- Performance and security checklists
- Add Flutter development rules (.kilo/rules/flutter.md)
- Code style guidelines (const, final, trailing commas)
- Widget architecture best practices
- State management requirements
- Error handling, API & network patterns
- Navigation, testing, performance
- Security and localization
- Prohibitions list
- Add Flutter skills:
- flutter-state: Riverpod, Bloc, Provider patterns
- flutter-widgets: Widget composition, responsive design
- flutter-navigation: go_router, deep links, guards
- Update AGENTS.md: add @flutter-developer to Core Development
- Update kilo.jsonc: configure flutter-developer and go-developer agents
2026-04-05 17:04:13 +01:00