UniqueSoft/APAW - APAW - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
NW	f5966db155	feat(gns2): integrate HybridGiteaClient into PollingSupervisor - PollingSupervisor now uses HybridGiteaClient (MCP primary, REST fallback) - Added mcpUrl to PipelineConfig - Supervisor calls initialize() to detect MCP vs REST mode automatically Refs: Milestone #67, Issue #107	2026-05-08 22:35:21 +01:00
NW	06fb0421ef	fix(process-continuity): operator-free design for MCP Docker integration - Resolve service_healthy deadlock by using service_started instead - Fix 172.28.0.0/16 network collision by removing ipam config - Add HybridGiteaClient (mcp → rest → bash fallback) - Create .kilo/rules/process-continuity.md with 5 operator-free principles: 1. No service_healthy conditions 2. No hardcoded networks 3. Automatic fallback chains 4. Pre-flight validation 5. Self-documenting failures - Update docker-compose.yml with resilient config: - start_period: 60s, retries: 5, restart: on-failure:3 - /tools healthcheck (guaranteed endpoint) - tmpfs for Node.js /tmp - Resource limits: 256M RAM, 0.5 CPU - MCP/REST integration test passed (issue #109) Refs: Milestone #67, Issues #107, #109	2026-05-08 22:31:59 +01:00
NW	3cc6ee2ffe	feat(gns2): Phase 8 MCP Docker containers for Gitea direct integration - docker/mcp-gitea/docker-compose.yml — MCP server container (Sqcoows/forgejo-mcp) - .kilo/skills/mcp-gitea-connection/SKILL.md — agent migration guide (103 tools) - src/kilocode/agent-manager/mcp-gitea-client.ts — MCP native client with fallback - Hybrid mode: MCP primary, REST API fallback if container unavailable - All 29 Tier 0/1 agents mass-updated with GNS-2 protocol (checkpoint read, event footer) - Security: no bash for Gitea ops, MCP handles credentials internally Refs: Milestone #67, Issue #107	2026-05-08 22:16:52 +01:00
NW	bd154f24d0	feat(gns2): mass-update all 30 agents with GNS-2 protocol - 29 agents updated with GNS-2 checkpoint/event protocol - 12 Tier 0 (leaf) agents: read checkpoint, write event footer, no cascade - 17 Tier 1 (task) agents: read checkpoint, recommend next agent, no direct task calls - 2 Tier 2 (meta) agents already updated: capability-analyst, agent-architect, evaluator - All agents now include GNS_EVENT footer template in comments - Frontmatter updated with '(GNS-2 Tier N)' classification Scripts added: - scripts/mass-update-gns-agents.py — idempotent mass updater - scripts/validate-gns-agents.py — protocol checker Refs: Milestone #67, Issues #99-#107	2026-05-08 22:03:08 +01:00
NW	47b027a02f	feat(gns2): Gitea-Nervous-System v2.0 - distributed agent state machine - Add GNS-2 label taxonomy (66 labels) with semantic routing - Tier 2 agents (capability-analyst, agent-architect, evaluator) enabled for self-cascade - GNS agent protocol: checkpoint v2 in issue body, machine-readable event footers - GiteaClient extended: checkpoint CRUD, event parsing, assignee/lock control, triggered issue polling - PipelineRunner rewritten as PollingSupervisor: reactive instead of active dispatch - Security: circuit breakers (is_locked), budget governance, depth limits - Scripts: init-gns-labels.py, validate-gns-agents.py - Milestone #67 + 7 phase issues (#99-#105) tracking evolution Refs: Milestone #67, Issues #99-#105	2026-05-08 21:25:38 +01:00
NW	f01e2064fb	feat(evolution): Kilo Code release sync & APAW system hardening (v2026-05-07) Security & Permissions: - All 30 agents: task[*]=deny, task[subagent]=deny (cascade prevention) - orchestrator & release-manager: bash=ask (hardening) - New .kilo/rules/subagent-security.md with audit rules - Updated .kilo/rules/global.md with Security & Permissions section - Updated .kilo/agents/orchestrator.md with Security Enforcement block Session Management: - New .kilo/rules/session-persistence.md (checkpoint format, worktree isolation) - Updated .kilo/rules/branch-strategy.md (worktree per agent) - pipeline-runner.ts: Checkpoint interface + save/load/resume methods Plan Persistence: - Updated .kilo/rules/lead-developer.md (plan handover section) Per-Agent Reasoning: - capability-index.yaml: reasoning_effort for all 30 agents (xhigh/high/medium/low) MCP Cleanup: - New .kilo/skills/docker-security/SKILL.md (--rm, orphaned process cleanup) Config Validation: - Updated .kilo/rules/docker.md (startup checks, commit scoping, location awareness) Docs: - README.md: v2026-05-07 evolution badges - .kilo/EVOLUTION_LOG.md: Entry #6 with full metrics - .gitignore: ignore dist/ + bun.lock Gitea: Milestone #66, Issues #91-#98 Architect: 9/9 sections fresh (express project type)	2026-05-08 18:54:08 +01:00
NW	74ad7c4b6e	docs(branch-strategy): default branch is dev, not main - Update branch strategy: dev is primary development branch - main is stable release only - Add release process: dev → PR → review → main → tag - Sync .kilo/ to target projects after release	2026-05-07 07:39:00 +01:00
NW	994ca58821	fix(agents): add missing permissions + complete kilo-meta.json - Fix 12 agents missing edit/write/bash permissions - Add 5 missing agents to kilo-meta.json (architect-indexer, flutter-developer, php-developer, pipeline-judge, python-developer) - Remove BOM from kilo.jsonc - All 32 agents now consistent between files and meta	2026-05-07 07:22:32 +01:00
NW	defe57d53a	feat: merge infrastructure skills and workflows from TenerifeProp Add MCP-based infrastructure skills: - mcp-integration: Playwright + GitMCP - e2e-testing: Cypress + AntV + Slack - search-integration: Brave + Tavily + Markitdown - security-scanner: CVE Search + MCP Validator - knowledge-base: Docfork + Wikipedia + ArXiv - prompt-manager: version control + DevTrends - api-catalog: MCP server registry - agent-architect-mcp: patterns + OpenAPI converter Add workflow commands: - feature.md: full feature pipeline - hotfix.md: urgent bug fix workflow Add rules: - orchestrator-self-evolution.md - sdet-engineer.md Add audit: - WORKFLOW_AUDIT.md Source: UniqueSoft/TenerifeProp	2026-05-06 23:04:14 +01:00
¨NW¨	80dca09ae0	fix: unquoted color, duplicate key, GLM downgrade + cross-platform validator - Fix security-auditor.md color bare hex to quoted - Fix orchestrator.md duplicate devops-engineer key - Fix .kilo/kilo.jsonc: orchestrator + root model to kimi-k2.6:cloud - Update agent-frontmatter-validation.md with diagnostic guide - Update global.md with YAML frontmatter rules for all agents - Update agent-architect.md + workflow-architect.md with color checklist - Add scripts/validate-agents.cjs: zero-dependency, cross-platform, --fix flag, scans worktrees	2026-05-04 22:01:45 +01:00
¨NW¨	fb552e0020	feat: v3 optimal model assignments + fitness gate - Update 30 agents to v3 heatmap maximum-score models: * go-dev: qwen3-coder -> deepseek-v4-pro-max (85->88 +3) * planner: nemotron -> deepseek-v4-pro-max (80->88 +8) * perf-engineer: nemotron -> deepseek-v4-pro-max (78->84 +6) * reflector: nemotron -> deepseek-v4-pro-max (78->84 +6) * security: nemotron -> deepseek-v4-pro-max (76->80 +4) * memory-manager: nemotron -> qwen3.6-plus (86->87 +1) * frontend: kimi-k2.5 -> minimax-m2.5 (92) * the-fixer: minimax-m2.5 -> kimi-k2.6 (88->90 +2) * browser-auto: kimi-k2.6 -> qwen3-coder (86->87 +1) * prompt-opt: glm-5.1 -> qwen3.6-plus (82->83 +1) * backend: deepseek-v3.2 -> qwen3-coder (91) * capability-analyst: nemotron -> glm-5.1 (85) * release-man: devstral-2 -> glm-5.1 (82) * evaluator: nemotron -> glm-5.1 (86) * workflow-arch: gpt-oss -> glm-5.1 (84) - Add Model Evolution Guard: * fitness-gate.cjs: rejects downgrades >3 points or <75 score * Normalized model ID lookup (: vs -) * Diff report before any file modifications - Update sync-benchmarks-from-yaml.cjs with fitness gate - Sync kilo-meta.json, kilo.jsonc, .md agent files - Rebuild research-dashboard.html (104KB, 30 agents, 11 models) Total improvement: +105 points across 11 agents Source: v3.html heatmap IF-adjusted composite scores	2026-04-30 08:42:10 +01:00
¨NW¨	9e48a4960e	fix: restore optimal v3 models + add fitness gate protection - Restore all 30 agents to v3.html heatmap optimal models: * frontend-developer: qwen3-coder -> minimax-m2.5 (92★) * devops-engineer: nemotron-3-super -> kimi-k2.6:cloud (88★) * browser-automation: qwen3-coder -> kimi-k2.6:cloud (86★) * agent-architect: glm-5.1 -> kimi-k2.6:cloud (86★) - Add Model Evolution Guard system: * agent-evolution/scripts/lib/fitness-gate.cjs * Rejects downgrades >3 points or below score 75 * Produces detailed diff report before any file modifications * Normalized model ID lookup (v3.html ':' vs JSON '-') - Update sync-benchmarks-from-yaml.cjs with fitness gate - Update model-benchmarks.json with v3 optimal assignments - Rebuild research-dashboard.html (104KB, 30 agents, 11 models) - Add model-evolution-guard.md architecture documentation - Add v3-optimal-models.json as source-of-truth reference Fixes regression introduced by commit `3badb25` where models were silently downgraded from heatmap optimal to inferior assignments.	2026-04-29 23:19:16 +01:00
¨NW¨	d1516f4856	chore: organize temporary research artifacts into archive - Create agent-evolution/archive/ with scripts/, reports/, data/ - Move 11 Python migration/diagnostic scripts - Move 7 intermediate report files (json, md, txt) - Move test data and old dashboard builds - Add archive/README.md with full index of contents - Update .gitignore to exclude archive/scripts, reports, data - Keep archive/README.md tracked for documentation	2026-04-29 21:14:23 +01:00
¨NW¨	3badb259cc	feat: bidirectional research dashboard + agent config fixes - Integrate apaw_agent_model_research_v3.html as standalone dashboard - Add model-benchmarks.json with 32 agents, 11 scored models, 11 recommendations - Add build-research-dashboard.ts: inject live data into template → standalone HTML - Add rebuild-template.cjs: regenerate template from v3.html source - Add sync-benchmarks-from-yaml.cjs: sync YAML → JSON round-trip - Add sync-model-research.ts: apply recommendation matrix to config files - Add model-benchmarks.schema.json and model-research.schema.json for validation - Add bidirectional-data-flow.md architecture documentation - Add log-execution.cjs pipeline hook - Update capability-index.yaml: add fallback_models, failover_strategy - Update kilo-meta.json, kilo.jsonc, KILO_SPEC.md with synced models - Update evolution.md / research.md / self-evolution.md / evolutionary-sync.md docs - Fix security-auditor.md: quote YAML color (#DC2626) - Fix orchestrator.md: remove duplicate devops-engineer key - Build research-dashboard.html (106KB standalone) + dated archive	2026-04-29 21:04:22 +01:00
¨NW¨	2ae7789802	fix: sync kilo.jsonc + capability-index.yaml after evolution upgrade - kilo.jsonc: manual fix 7 agent models (sync script does not write back) - capability-index.yaml: orchestrator model glm-5.1 → kimi-k2.6:cloud - evolutionary-sync.md: add kilo.jsonc + capability-index.yaml manual rules - Add cloud suffix verification and per-file verification checklist - Document finding: sync script reads kilo.jsonc but never writes back	2026-04-27 16:49:25 +01:00
¨NW¨	dbea8c90db	feat: evolutionary agent model upgrades based on recommendation matrix - devops-engineer: deepseek-v3.2 → kimi-k2.6:cloud (★88) - browser-automation: glm-5 → kimi-k2.6:cloud (★86) - visual-tester: glm-5 → qwen3-coder:480b (★82) - agent-architect: nemotron-3-super → kimi-k2.6:cloud (★86) - orchestrator: glm-5 → kimi-k2.6:cloud (dispatch critical) - product-owner: glm-5 → glm-5.1 (★84) - prompt-optimizer: qwen3.6-plus:free → glm-5.1 (stable fallback) - system-analyst: qwen3.6-plus:free → glm-5.1 (★90) - Add autonomous-mode.md rule for zero-confirmation workflow	2026-04-27 12:09:36 +01:00
¨NW¨	af43eaef80	Merge remote-tracking branch 'origin/agent-sync-features'	2026-04-24 07:21:39 +01:00
¨NW¨	3127d82102	feat: sync agent evolution data and add self-diagnostic report	2026-04-23 07:58:44 +01:00
¨NW¨	6b71ea2b57	feat: add .architect/ project mapping system with architect-indexer agent and Docker containerization - Add .architect/ directory structure (10 template files) as project brain for agent orientation - Add architect-indexer agent that scans codebase and generates structured architecture docs - Add Docker containerization: Dockerfile.architect-indexer, docker-compose.architect.yml - Add TypeScript project-mapper module with staleness detection and context injection - Add /index-project command, architect-first-contact rule, project-mapping skill - Integrate orchestrator first-contact check: triggers indexing before any task delegation - Add npm arch:* scripts for Docker-based indexing workflow - Register agent in capability-index.yaml and AGENTS.md	2026-04-22 20:01:38 +01:00
¨NW¨	9d85dd9f83	merge: dev into main — centralized auth + trailing-slash fix + all recent features - Security: extricate hardcoded Gitea credentials, add centralized auth module - Fix: get_target_repo() regex now handles trailing slashes (.rstrip('/') in Python, sed 's:/*' in Bash) - Fix: task-analysis broken functions (orphaned req references, stray parentheses) - Documentation: README.md, STRUCTURE.md, AGENTS.md updated with auth section - Evolution: Entry #5 documenting credentials extrication	2026-04-19 12:20:38 +01:00
¨NW¨	573d9a641e	fix(security): add rstrip('/') to get_target_repo for trailing-slash URLs The regex r'[:/]([^/]+/[^/]+?)(?:\.git)?$' fails on URLs with trailing slashes like 'https://git.softuniq.eu/UniqueSoft/APAW/' because the final '/' breaks the pattern. Added .rstrip('/') in Python and sed 's:/*' in Bash to all get_target_repo() implementations across 11 files.	2026-04-19 12:17:53 +01:00
¨NW¨	7523911812	fix(security): extricate hardcoded Gitea credentials, add centralized auth module - Remove all hardcoded NW:eshkink0t credentials from 9 files across skills, commands, rules, and specs - Add .kilo/shared/gitea-auth.md with get_gitea_token() and .kilo/gitea.jsonc config structure - All Gitea API callers now use env vars (GITEA_TOKEN → GITEA_USER+GITEA_PASS → ValueError) - Fix task-analysis/SKILL.md broken functions (orphaned req references, stray parentheses) - Replace hardcoded UniqueSoft/APAW API URLs with get_target_repo() auto-detection in 3 files - Update README.md, STRUCTURE.md, AGENTS.md with centralized auth documentation - Add EVOLUTION_LOG Entry #5 documenting credentials extrication	2026-04-19 11:43:59 +01:00
¨NW¨	7445e66676	feat: add Next.js, Vue/Nuxt, React, Python (Django/FastAPI) skills and agents - python-developer agent: Django/FastAPI backend specialist - nextjs-patterns skill: App Router, Server Components, Server Actions, Auth.js - vue-nuxt-patterns skill: Composition API, Pinia, Nitro server, SSR - react-patterns skill: hooks, Context, TanStack Query, React Hook Form - python-django-patterns skill: DRF, services, repositories - python-fastapi-patterns skill: async, Pydantic, SQLAlchemy, dependencies - /nextjs pipeline command for full-stack Next.js apps - /vue pipeline command for full-stack Vue/Nuxt apps - Updated frontend-developer with framework-specific skills - Updated orchestrator, capability-index for Python + frontend routing - Updated README, STRUCTURE, EVOLUTION_LOG with all new stacks Total agents: 30. Stacks: PHP, Next.js, Vue/Nuxt, React, Python, Go, Flutter, Node.js	2026-04-19 10:04:51 +01:00
¨NW¨	b46a1a20a8	feat: add PHP development stack, atomic tasks, modular code rules, agent monitoring, fix target project detection 7 evolutionary tasks implemented: 1. PHP web development: php-developer agent + 6 skills (Laravel, Symfony, WordPress, security, testing, modular architecture) + 2 pipeline commands (/laravel, /wordpress) 2. Atomic task decomposition: 1 action = 1 task rule, task sizing guide, decomposition protocol for orchestrator, token budgets per complexity 3. Modular code rules: max 100 lines/file, max 30 lines/function, service/repository patterns, cross-module communication via events only 4. Gitea-centric workflow: mandatory issue creation before work, research with links, progress checkboxes, screenshots on test, git history as knowledge base 5. Fix: target project auto-detection — removed all hardcoded UniqueSoft/APAW from API calls, added get_target_repo() via git remote, GITEA_TARGET_REPO env override 6. Agent execution monitoring: agent-executions.jsonl logging, agent-stats.ts statistics script, required fields per invocation, Gitea comment includes duration/tokens 7. Token optimization: 1 action = 1 task principle, token budgets by task type, routing matrix, no scope creep, skip unnecessary pipeline steps	2026-04-18 23:43:04 +01:00
¨NW¨	28a3b648cc	refactor(prompts): compress 29 agents (-77%) and 7 rules (-55%), delete 2 duplicates Agents: 6,235 → 1,454 lines (-77%). Each agent compressed to Role/Behavior/Delegates/Output/Handoff format. Gitea commenting extracted to shared block (.kilo/shared/gitea-commenting.md). Self-evolution protocol extracted to shared block (.kilo/shared/self-evolution.md). Gitea API client centralized (.kilo/shared/gitea-api.md). Rules: 2,358 → 1,189 lines (-50%). Deleted sdet-engineer.md (duplicate of agent) and orchestrator-self-evolution.md (moved to shared/). Compressed docker (549→26), flutter (521→28), go (283→21), nodejs (271→27), code-skeptic (59→14) to checklists with skill references. Fitness: 54/54 tests pass, 29/29 agents validated, fitness=0.92	2026-04-18 13:49:24 +01:00
¨NW¨	c416f53103	refactor: clean main to starter template — remove project-specific and generated files - Remove project-specific commands: booking, blog, commerce, landing-page, feature, hotfix - Remove project-specific skills: booking, blog, ecommerce - Remove generated files: EVOLUTION_LOG, WORKFLOW_AUDIT, logs/, reports/ - Add .gitignore entries for auto-generated dirs (.kilo/logs/, .kilo/reports/) - Remove e2e_booking_flow from capability-index.yaml - Remove docker/evolution-test/ (dev infra, not starter) - Genericize AGENTS.md project description - Genericize tests/README.md title All removed content preserved on dev branch.	2026-04-17 21:11:12 +01:00
¨NW¨	2573d81cff	refactor: remove CBS-specific e2e-booking flow — belongs to CBS project, not APAW starter	2026-04-17 20:21:29 +01:00
NW	c258d16ef5	feat: add Gitea integration, E2E booking flow, Docker DNS fix, browser-launcher module - Add tests/scripts/lib/gitea-client.js: Gitea API client with auth, comments, attachments, and Markdown report formatters for visual and console reports - Add tests/scripts/lib/browser-launcher.js: shared Playwright launch config with --dns-resolution-order=hostname-first, realistic UA, and navigateTo() helper using waitUntil:'commit' + waitForLoadState('domcontentloaded') - Add tests/scripts/e2e-booking-flow-v2.js: full E2E scenario for irina-vik.ru (register → book service → login → personal cabinet) with Gitea reporting - Update visual-test-pipeline.js: GITEA_ISSUE env var, Gitea comment+attachment posting, browser-launcher integration, waitUntil:'commit' navigation - Update console-error-monitor-standalone.js: same Gitea + DNS fixes - Update capture-screenshots.js: browser-launcher integration, DNS fix - Update docker-compose.web-testing.yml: NETWORK_MODE env var (bridge), DNS_RESOLUTION_ORDER, GITEA_USER/PASSWORD env passthrough, e2e-booking service - Update tests/package.json: pin playwright to exact 1.52.0 (matches Docker image) - Update .gitignore: add tests/visual/e2e/ for E2E screenshots - Update .kilo/agents/visual-tester.md: Docker networking note, Gitea scripts, e2e-booking service, updated script table - Update .kilo/commands/web-test.md: Docker Networking section, --issue flag, Gitea Integration section, e2e-booking service - Update .kilo/commands/e2e-test.md: complete rewrite — Docker-based Playwright, no more MCP dependency, proper service table, Gitea integration docs - Update .kilo/capability-index.yaml: add gitea_integration, e2e_booking_flow, docker_networking capabilities to visual-tester; add routing entries screenshots-bugfix	2026-04-17 09:27:27 +01:00
NW	3a8aa6b416	docs: update visual testing agent docs, remove test artifacts from git, add pipeline documentation - Remove baseline screenshots from git tracking (test artifacts, not code) - Add tests/visual/baseline/ to .gitignore - Rewrite .kilo/agents/visual-tester.md: Docker-first pipeline, bbox extraction, console/network error detection - Rewrite .kilo/commands/web-test.md: accurate commands, output format, agent flow - Update .kilo/capability-index.yaml: add bbox_extraction, console_error_detection, button_overflow_detection to visual-tester - Update AGENTS.md: add /web-test and /e2e-test commands, update visual-tester description	2026-04-16 22:48:46 +01:00
NW	c6b15e0bcd	feat: implement visual regression testing v2.0 — Playwright pipeline with bbox extraction - Add visual-test-pipeline.js: captures screenshots, extracts UI elements with bounding boxes, compares via pixelmatch, reports console/network errors - Add capture-screenshots.js: baseline/current screenshot capture at mobile/tablet/desktop viewports - Add console-error-monitor-standalone.js: standalone console/network error detection without MCP dependency - Rewrite docker-compose.web-testing.yml: real Playwright image, working services, proper volume mounts - Update package.json: v2.0.0, add playwright dependency, clean scripts - Update README.md: accurate Docker-first docs with usage examples - Add .gitignore: exclude node_modules, current/diff screenshots, reports - Include baseline screenshots for bbox.wtf homepage	2026-04-16 22:32:41 +01:00
NW	e19fa3effd	refactor: full agent system revision — migrate to GLM-5.1, fix delegation chains, audit consistency - Migrate 8 agents from openrouter/qwen3.6-plus:free to ollama-cloud/glm-5.1 - Assign thinking/variant/instant depth by role complexity - Fix broken delegation chains: system-analyst, all developer agents, devops-engineer now can reach orchestrator - Add task permissions to browser-automation, visual-tester, capability-analyst, markdown-validator - Add visual-tester permission to flutter-developer and frontend-developer - Fix capability-index.yaml routing map indentation (go_* keys misplaced) - Add delegates_to and variant fields to capability-index.yaml - Update KILO_SPEC.md agent table with Variant column - Update AGENTS.md with Model/Variant/CanCall columns - Update kilo.jsonc ask agent model - Fix YAML indentation in capability-analyst.md and markdown-validator.md - Update agent-architect.md template models (remove gpt-oss, qwen3.6-plus) - Add Skills Reference tables to 7 previously unlinked agents - Full audit: 10/10 consistency checks passed	2026-04-12 22:38:41 +01:00
¨NW¨	1f4536ab93	Merge feature/web-testing-infrastructure into main Add comprehensive web testing infrastructure: - Visual regression testing with pixelmatch - Link checking for 404/500 errors - Console error detection with Gitea issues - Form testing capabilities - Docker-based Playwright MCP (no host pollution) - /web-test and /web-test-fix commands No database changes - safe to merge.	2026-04-07 08:56:37 +01:00
¨NW¨	e074612046	feat: add web testing infrastructure - Docker configurations for Playwright MCP (no host pollution) - Visual regression testing with pixelmatch - Link checking for 404/500 errors - Console error detection with Gitea issue creation - Form testing capabilities - /web-test and /web-test-fix commands - web-testing skill documentation - Reorganize project structure (docker/, scripts/, tests/) - Update orchestrator model to ollama-cloud/glm-5 Structure: - docker/ - Docker configurations (moved from archive) - scripts/ - Utility scripts - tests/ - Test suite with visual, console, links testing - .kilo/commands/ - /web-test and /web-test-fix commands - .kilo/skills/ - web-testing skill Issues: #58 #60 #62	2026-04-07 08:55:24 +01:00
¨NW¨	b9abd91d07	feat: orchestrator evolution — full access + model upgrades + self-evolution protocol - Add 9 missing agents to orchestrator task whitelist (20→28 agents) - Fix 2 broken agents: debug (gpt-oss:20b→qwen3.6-plus), release-manager (devstral-2→qwen3.6-plus) - Upgrade orchestrator (glm-5→qwen3.6-plus, IF:80→90, 128K→1M context) - Upgrade pipeline-judge (nemotron→qwen3.6-plus, IF:85→90) - Add orchestrator escalation path to 7 agents (lead-dev, sdet, skeptic, perf, security, evaluator, devops) - Create self-evolution protocol (.kilo/rules/orchestrator-self-evolution.md) - Create evolution log (.kilo/EVOLUTION_LOG.md) - Full audit of all 29 agents with verification tests	2026-04-06 22:55:12 +01:00
¨NW¨	01ce40ae8a	restore: Docker evolution test files for remote usage Docker files restored for use on other machines with Docker/WSL2. Available test methods: 1. Docker (isolated environment): docker-compose -f docker/evolution-test/docker-compose.yml up evolution-feature 2. Local (bun runtime): docker/evolution-test/run-local-test.bat feature ./docker/evolution-test/run-local-test.sh feature Both methods provide: - Millisecond precision timing - Fitness score with 2 decimal places - JSONL logging to .kilo/logs/fitness-history.jsonl	2026-04-06 01:36:26 +01:00
¨NW¨	ae471dcd6b	docs: remove Docker references from pipeline-judge Use local bun runtime only for evolution testing.	2026-04-06 01:35:29 +01:00
¨NW¨	b5c5f5ba82	chore: remove Docker test files - use local testing instead Docker Desktop removed from system. Evolution testing uses local bun runtime. Local testing approach: - Uses bun runtime (already installed) - Millisecond precision timing - Fitness calculation with 2 decimal places - Works without Docker/WSL2 Usage: powershell: docker/evolution-test/run-local-test.bat feature bash: ./docker/evolution-test/run-local-test.sh feature Tests verified: - 54/54 tests pass (100%) - Time: 214.16ms precision - Fitness: 1.00 (PASS)	2026-04-06 01:34:24 +01:00
¨NW¨	8e492ffa90	test: run evolution test with exact measurements Results: - Tests: 54/54 passed (100%) - Time: 214.16ms (millisecond precision) - Fitness: 1.00 (PASS) Breakdown: - Test pass rate: 100% (weight 50%, contribution 0.50) - Quality gates: 5/5 (weight 25%, contribution 0.25) - Efficiency: 0.9993 (weight 25%, contribution 0.25) System verified: - Bun runtime installed and working - Fitness calculation precise to 2 decimals - Logging to fitness-history.jsonl working	2026-04-06 01:08:54 +01:00
¨NW¨	0dbc15b602	feat: add local fallback scripts for evolution testing - run-local-test.sh - Bash script for Linux/macOS - run-local-test.bat - Batch script for Windows - PowerShell timing with millisecond precision - Fitness calculation with 2 decimal places - Works without Docker (less precise environment) - Logs to .kilo/logs/fitness-history.jsonl Usage: ./docker/evolution-test/run-local-test.sh feature docker\evolution-test\run-local-test.bat feature Both scripts calculate: - Test pass rate (2 decimals) - Quality gates (5 gates) - Efficiency score (time/normalized) - Final fitness (weighted average)	2026-04-06 01:03:54 +01:00
¨NW¨	1703247651	feat: add Docker-based evolution testing with precise measurements - Add docker/evolution-test/Dockerfile with bun, TypeScript - Add docker/evolution-test/docker-compose.yml for parallel workflow testing - Add run-evolution-test.sh and .bat scripts for cross-platform - Update pipeline-judge.md with Docker-first approach: - Millisecond precision timing (date +%s%3N) - 2 decimal places for test pass rate and coverage - Docker container for consistent test environment - Multiple workflow types (feature/bugfix/refactor/security) Enables: - Parallel testing with docker-compose - Consistent environment across machines - Precise fitness measurements (ms, 2 decimals) - Multi-workflow testing in containers	2026-04-06 00:48:21 +01:00
¨NW¨	fa68141d47	feat: add pipeline-judge agent and evolution workflow system - Add pipeline-judge agent for objective fitness scoring - Update capability-index.yaml with pipeline-judge, evolution config - Add fitness-evaluation.md workflow for auto-optimization - Update evolution.md command with /evolve CLI - Create .kilo/logs/fitness-history.jsonl for metrics logging - Update AGENTS.md with new workflow state machine - Add 6 new issues to MILESTONE_ISSUES.md for evolution integration - Preserve ideas in agent-evolution/ideas/ Pipeline Judge computes fitness = (test_rate0.5) + (gates0.25) + (efficiency*0.25) Auto-triggers prompt-optimizer when fitness < 0.70	2026-04-06 00:23:50 +01:00
¨NW¨	1ab9939c92	fix: correct OpenRouter model paths across all files Fixed format from 'qwen/...' to 'openrouter/qwen/...' for: - product-owner.md - prompt-optimizer.md - workflow-architect.md - status.md, blog.md, booking.md, commerce.md - kilo.jsonc (default model + ask agent) - agent-frontmatter-validation.md - agent-versions.json (recommendations and history)	2026-04-05 23:47:14 +01:00
¨NW¨	6ba325cec5	fix: correct model path format for OpenRouter Changed qwen/qwen3.6-plus:free to openrouter/qwen/qwen3.6-plus:free for capability-analyst, agent-architect, and evaluator agents.	2026-04-05 23:42:32 +01:00
¨NW¨	a4e09ad5d5	feat: upgrade agent models based on research findings - capability-analyst: nemotron-3-super → qwen3.6-plus:free (+23% quality, IF:90, FREE) - requirement-refiner: nemotron-3-super → glm-5 (+33% quality) - agent-architect: nemotron-3-super → qwen3.6-plus:free (+22% quality) - evaluator: nemotron-3-super → qwen3.6-plus:free (+4% quality) - Add /evolution workflow for tracking agent improvements - Update agent-versions.json with evolution history	2026-04-05 23:37:23 +01:00
¨NW¨	fe28aa5922	chore: reorganize project structure and update README - Move docker-compose.evolution.yml to agent-evolution/docker-compose.yml - Update README with current agent lineup (28+ agents) - Fix model references in README tables - Add recent commits history - Simplify architecture overview	2026-04-05 23:02:44 +01:00
¨NW¨	ff00b8e716	fix: sync agent models across config files - Fix performance-engineer model: gpt-oss:120b -> nemotron-3-super - Fix markdown-validator model: gemma4:26b -> nemotron-3-nano:30b - Update KILO_SPEC.md documentation for SystemAnalyst, RequirementRefiner, FrontendDeveloper - Revert kilo.jsonc to minimal config (primary agents only) - Keep subagent definitions in .md files and capability-index.yaml	2026-04-05 20:51:09 +01:00
¨NW¨	4af7355429	feat: update agent models based on research recommendations - requirement-refiner: kimi-k2-thinking -> nemotron-3-super (1M context for specs) - history-miner: glm-5 -> nemotron-3-super (better git search, 1M context) - capability-analyst: gpt-oss:120b -> nemotron-3-super (gap analysis improvement) - agent-architect: gpt-oss:120b -> nemotron-3-super (agent design, 1M context) - prompt-optimizer: gpt-oss:120b -> qwen3.6-plus:free (FREE on OpenRouter) - product-owner: glm-5 -> qwen3.6-plus:free (FREE on OpenRouter, 1M context) - evaluator: gpt-oss:120b -> nemotron-3-super (quality scoring) - markdown-validator: nemotron-3-nano:30b -> gemma4:26b (better validation) - debug (kilo.jsonc): gpt-oss:20b -> gemma4:31b (Intelligence Index 39) - devops-engineer: NEW -> nemotron-3-super (Docker, K8s, CI/CD) - flutter-developer: NEW -> qwen3-coder:480b (Dart/Flutter support) Synced all agent models between capability-index.yaml and agent/*.md files. Validated YAML and JSON5 configs.	2026-04-05 20:28:47 +01:00
¨NW¨	15a7b4b7a4	feat: add Agent Evolution Dashboard - Create agent-evolution/ directory with standalone dashboard - Add interactive HTML dashboard with agent/model matrix - Add heatmap view for agent-model compatibility scores - Add recommendations tab with optimization suggestions - Add Gitea integration preparation (history timeline) - Add Docker configuration for deployment - Add build scripts for standalone HTML generation - Add sync scripts for agent data synchronization - Add milestone and issues documentation - Add skills and rules for evolution sync - Update AGENTS.md with dashboard documentation - Update package.json with evolution scripts Features: - 28 agents with model assignments and fit scores - 8 models with benchmarks (SWE-bench, RULER, Terminal) - 11 recommendations for model optimization - History timeline with agent changes - Interactive modal windows for model details - Filter and search functionality - Russian language interface - Works offline (file://) with embedded data Docker: - Dockerfile for standalone deployment - docker-compose.evolution.yml - docker-run.sh/docker-run.bat scripts NPM scripts: - sync:evolution - sync and build dashboard - evolution:open - open in browser - evolution:dashboard - start dev server Status: PAUSED - foundation complete, Gitea integration pending	2026-04-05 19:58:59 +01:00
¨NW¨	b899119d21	feat: add html-to-flutter skill and research report - Add .kilo/skills/html-to-flutter/SKILL.md - HTML parsing patterns with html package - CSS to Flutter style mapping - Widget tree generation from HTML templates - flutter_html integration (608k downloads, 2.1k likes) - Design-time code generation patterns - Responsive layout conversion (flexbox/grid → Row/Column) - Form, Card, Navigation conversion examples - Update flutter-developer agent - Reference html-to-flutter skill - Add HTML template conversion workflow - Integration with flutter_html package - Add research report .kilo/reports/flutter-cycle-analysis.md - Gap analysis: HTML→Flutter conversion (critical) - Testing gap analysis - Network/API gap analysis - Storage gap analysis - Implementation priority and recommendations - Complete workflow for HTML Template + ТЗ → Flutter App Research sources: - flutter_html 3.0.0 (2.1k likes, 608k downloads) - go_router 17.2.0 (5.6k likes, 2.31M downloads) - flutter_riverpod 3.3.1 (2.8k likes, 1.61M downloads) - freezed 3.2.5 (4.4k likes, 1.83M downloads) Closes: HTML template input workflow for Flutter development	2026-04-05 17:26:02 +01:00
¨NW¨	af5f401a53	feat: add Flutter development support with agent, rules and skills - Add flutter-developer agent (.kilo/agents/flutter-developer.md) - Role definition for cross-platform mobile development - Clean architecture templates (Domain/Presentation/Data) - State management patterns (Riverpod, Bloc, Provider) - Widget patterns, navigation, platform channels - Build & release commands - Performance and security checklists - Add Flutter development rules (.kilo/rules/flutter.md) - Code style guidelines (const, final, trailing commas) - Widget architecture best practices - State management requirements - Error handling, API & network patterns - Navigation, testing, performance - Security and localization - Prohibitions list - Add Flutter skills: - flutter-state: Riverpod, Bloc, Provider patterns - flutter-widgets: Widget composition, responsive design - flutter-navigation: go_router, deep links, guards - Update AGENTS.md: add @flutter-developer to Core Development - Update kilo.jsonc: configure flutter-developer and go-developer agents	2026-04-05 17:04:13 +01:00

1 2 3

120 Commits