Kilo Code Agents Reference
This file configures AI agent behavior for the project - a self-improving code pipeline with Gitea logging.
Pipeline Workflow
The main workflow is /pipeline - use it to process issues through all agents automatically.
Commands (Slash Commands)
| Command |
Description |
Usage |
/pipeline <issue> |
Run full agent pipeline for issue |
/pipeline 42 |
/nextjs |
Next.js 14+ full-stack app pipeline |
/nextjs my-app |
/vue |
Vue/Nuxt 3 full-stack app pipeline |
/vue my-app |
/laravel |
Laravel full-stack app pipeline |
/laravel my-app |
/wordpress |
WordPress plugin/site pipeline |
/wordpress my-plugin |
/feature |
Feature development pipeline |
/feature |
/commerce |
E-commerce site pipeline |
/commerce |
/status <issue> |
Check pipeline status for issue |
/status 42 |
/evolve |
Run evolution cycle with fitness scoring |
/evolve --issue 42 |
/evaluate <issue> |
Generate performance report |
/evaluate 42 |
/plan |
Creates detailed task plans |
/plan feature X |
/ask |
Answers codebase questions |
/ask how does auth work |
/debug |
Analyzes and fixes bugs |
/debug error in login |
/code |
Quick code generation |
/code add validation |
/research [topic] |
Run research and self-improvement |
/research multi-agent |
/evolution log |
Log agent model change |
/evolution log planner "reason" |
/evolution report |
Generate evolution report |
/evolution report |
/index-project |
Index codebase into .architect/ for agent orientation |
/index-project |
/web-test <url> |
Visual regression testing in Docker |
/web-test https://bbox.wtf |
/e2e-test <url> |
E2E browser automation tests |
/e2e-test https://my-app.com |
Pipeline Agents (Subagents)
These agents are invoked automatically by /pipeline or manually via @mention:
Core Development
| Agent |
Role |
When Invoked |
@RequirementRefiner |
Converts vague ideas and bug reports into strict User Stories with acceptance criteria checklists |
Issue status: new |
@HistoryMiner |
Analyzes git history to find duplicates and past solutions, preventing regression and duplicate work |
Status: planned |
@SystemAnalyst |
Designs technical specifications, data schemas, and API contracts before implementation |
Status: researching |
@SdetEngineer |
Writes tests following TDD methodology |
Status: designed |
@LeadDeveloper |
Primary code writer for backend and core logic |
Status: testing |
@FrontendDeveloper |
Handles UI implementation with multimodal capabilities |
When UI work needed |
@BackendDeveloper |
Backend specialist for Node |
When backend needed |
@GoDeveloper |
Go backend specialist for Gin, Echo, APIs, and database integration |
When Go backend needed |
@DevopsEngineer |
DevOps specialist for Docker, Kubernetes, CI/CD pipeline automation, and infrastructure management |
When deployment/infra needed |
Quality Assurance
| Agent |
Role |
When Invoked |
@CodeSkeptic |
Adversarial code reviewer |
Status: implementing |
@TheFixer |
Iteratively fixes bugs based on specific error reports and test failures |
When review fails |
@PerformanceEngineer |
Reviews code for performance issues |
After code-skeptic |
@SecurityAuditor |
Scans for security vulnerabilities, OWASP Top 10, dependency CVEs, and hardcoded secrets |
After performance |
@VisualTester |
Visual regression testing agent that compares screenshots and detects UI differences using pixelmatch and image diff |
When UI changes |
DevOps & Infrastructure
| Agent |
Role |
When Invoked |
@devops-engineer |
Docker/Swarm/K8s deployment |
When deployment needed |
@security-auditor |
Container security scan |
After deployment config |
Cognitive Enhancement
| Agent |
Role |
When Invoked |
@Planner |
Advanced task planner using Chain of Thought, Tree of Thoughts, and Plan-Execute-Reflect |
Complex tasks |
@Reflector |
Self-reflection agent using Reflexion pattern - learns from mistakes |
After each agent |
@MemoryManager |
Manages agent memory systems - short-term (context), long-term (vector store), and episodic (experiences) |
Context management |
Meta & Process
| Agent |
Role |
When Invoked |
@Orchestrator |
Main dispatcher |
Manages all agent routing |
@ReleaseManager |
Manages git operations, semantic versioning, branching, and deployments |
Status: releasing |
@Evaluator |
Scores agent effectiveness after task completion for continuous improvement |
Status: evaluated |
@PromptOptimizer |
Improves agent system prompts based on performance failures |
When score < 7 |
@ProductOwner |
Manages issue checklists, status labels, tracks progress and coordinates with human users |
Manages issues |
@AgentArchitect |
Creates, modifies, and reviews new agents, workflows, and skills based on capability gap analysis |
When gaps identified |
@CapabilityAnalyst |
Analyzes task requirements against available agents, workflows, and skills |
When starting new task |
@WorkflowArchitect |
Creates and maintains workflow definitions with complete architecture, Gitea integration, and quality gates |
New workflow needed |
@MarkdownValidator |
Validates and corrects Markdown descriptions for Gitea issues |
Before issue creation |
Status Labels
Pipeline uses Gitea labels to track progress:
status: new → status: planned → status: researching → ...
- Agents add/remove labels automatically
Performance Logging
Each agent logs to Gitea issue comments:
Efficiency Tracking
Scores saved to .kilo/logs/efficiency_score.json:
Fitness Tracking
Fitness scores saved to .kilo/logs/fitness-history.jsonl:
Manual Agent Invocation
Or via @mention:
Environment Variables
Gitea integration uses centralized authentication (see .kilo/shared/gitea-auth.md and .kilo/gitea.jsonc):
| Variable |
Required |
Description |
GITEA_API_URL |
No |
API base URL (default: https://git.softuniq.eu/api/v1) |
GITEA_TOKEN |
Preferred |
Pre-existing API token |
GITEA_USER |
Fallback |
Username for Basic Auth token creation |
GITEA_PASS |
Fallback |
Password for Basic Auth token creation |
GITEA_TARGET_REPO |
No |
Override target project (auto-detected otherwise) |
Auth resolution: GITEA_TOKEN → GITEA_USER+GITEA_PASS → ValueError. NEVER hardcode credentials.
Self-Improvement Cycle
- Pipeline runs for each issue
- Evaluator scores each agent (1-10) - subjective
- Pipeline Judge measures fitness objectively (0.0-1.0)
- Low fitness (<0.70) triggers prompt-optimizer
- Prompt optimizer analyzes failures and improves prompts
- Re-run workflow with improved prompts
- Compare fitness before/after - commit if improved
- Log results to
.kilo/logs/fitness-history.jsonl
Evaluator vs Pipeline Judge
| Aspect |
Evaluator |
Pipeline Judge |
| Type |
Subjective |
Objective |
| Score |
1-10 (opinion) |
0.0-1.0 (metrics) |
| Metrics |
Observations |
Tests, tokens, time |
| Trigger |
After workflow |
After evaluator |
| Action |
Logs to Gitea |
Triggers optimization |
Fitness Score Components
Architecture Files
| File |
Purpose |
AGENTS.md |
This file - main config |
.kilo/agents/*.md |
Agent definitions with prompts |
.kilo/commands/*.md |
Workflow commands |
.kilo/rules/*.md |
Custom rules loaded globally |
.kilo/skills/ |
Skill modules |
.kilo/shared/gitea-auth.md |
Centralized Gitea auth (env vars, no hardcoded creds) |
.kilo/gitea.jsonc |
Gitea auth structure (env var mapping) |
.kilo/shared/gitea-api.md |
Centralized Gitea API client |
.kilo/shared/gitea-commenting.md |
Comment format for Gitea |
.kilo/shared/self-evolution.md |
Self-evolution protocol |
.kilo/rules/architect-first-contact.md |
First-contact project indexing rules |
.kilo/skills/project-mapping/SKILL.md |
Project mapping skill (.architect/ system) |
.architect/ |
Project codebase map (auto-indexed, see below) |
src/kilocode/ |
TypeScript API for programmatic use |
Skills Reference
Containerization Skills
| Skill |
Purpose |
Location |
docker-compose |
Multi-container orchestration |
.kilo/skills/docker-compose/ |
docker-swarm |
Production cluster deployment |
.kilo/skills/docker-swarm/ |
docker-security |
Container security hardening |
.kilo/skills/docker-security/ |
docker-monitoring |
Container monitoring/logging |
.kilo/skills/docker-monitoring/ |
Node.js Skills
| Skill |
Purpose |
Location |
nodejs-express-patterns |
Express routing, middleware |
.kilo/skills/nodejs-express-patterns/ |
nodejs-auth-jwt |
JWT authentication |
.kilo/skills/nodejs-auth-jwt/ |
nodejs-security-owasp |
OWASP security |
.kilo/skills/nodejs-security-owasp/ |
Database Skills
| Skill |
Purpose |
Location |
postgresql-patterns |
PostgreSQL patterns |
.kilo/skills/postgresql-patterns/ |
sqlite-patterns |
SQLite patterns |
.kilo/skills/sqlite-patterns/ |
clickhouse-patterns |
ClickHouse patterns |
.kilo/skills/clickhouse-patterns/ |
Go Skills
| Skill |
Purpose |
Location |
go-modules |
Go modules management |
.kilo/skills/go-modules/ |
go-concurrency |
Goroutines and channels |
.kilo/skills/go-concurrency/ |
go-testing |
Go testing patterns |
.kilo/skills/go-testing/ |
go-security |
Go security patterns |
.kilo/skills/go-security/ |
Process Skills
| Skill |
Purpose |
Location |
planning-patterns |
CoT/ToT planning |
.kilo/skills/planning-patterns/ |
memory-systems |
Memory management |
.kilo/skills/memory-systems/ |
tool-use |
Tool usage patterns |
.kilo/skills/tool-use/ |
research-cycle |
Self-improvement cycle |
.kilo/skills/research-cycle/ |
Using the TypeScript API
Agent Evolution Dashboard
Track agent model changes, performance, and recommendations in real-time.
Access
Dashboard Tabs
| Tab |
Description |
| Overview |
Stats, recent changes, pending recommendations |
| All Agents |
Filterable agent cards with history |
| Timeline |
Full evolution history |
| Recommendations |
Priority-based model suggestions |
| Model Matrix |
Agent × Model mapping with fit scores |
Data Sources
| Source |
What it tracks |
.kilo/agents/*.md |
Model, description, capabilities |
.kilo/kilo.jsonc |
Model assignments |
.kilo/capability-index.yaml |
Capability routing |
| Git History |
Model and prompt changes |
| Gitea Comments |
Performance scores |
Evolution Data Structure
Recommendations Priority
| Priority |
When |
Example |
| Critical |
Fit score < 70 |
Immediate model change required |
| High |
Model unavailable |
Switch to fallback |
| Medium |
Better model available |
Consider upgrade |
| Low |
Optimization possible |
Optional improvement |
Agent Execution Monitoring
Every agent invocation is logged to .kilo/logs/agent-executions.jsonl for project-level monitoring.
Log Format
Monitoring Commands
Required Logging Fields
| Field |
Description |
agent |
Agent name |
issue |
Gitea issue number |
project |
Target project repo (NOT hardcoded APAW) |
task |
Atomic task description |
duration_ms |
Execution time |
tokens_used |
Token estimate |
status |
success/fail/pass/blocked |
Critical Rules
Target Project (NOT APAW)
Issues MUST be created in the target project repository, NOT in APAW. APAW is the agent framework, not the default project.
Atomic Tasks (1 action = 1 task)
Every agent invocation solves exactly ONE atomic task:
- ❌ "Implement the entire e-commerce backend"
- ✅ "Create Product model with migration"
- ✅ "Add POST /api/products endpoint"
Modular Code
- Maximum 100 lines per file
- Maximum 30 lines per function
- Features organized as independent modules
- Cross-module communication via events/interfaces only
Token Budgets
| Task Size |
Max Tokens |
Example |
| Tiny |
2,000 |
Fix typo, add config |
| Small |
5,000 |
Create model + migration |
| Medium |
10,000 |
Create API endpoint + test |
| Large |
20,000 |
Create service with 3 methods |
Code Style
- Use TypeScript for new files
- Follow existing patterns
- Write tests before code (TDD)
- Keep functions under 50 lines
- Use early returns
- No comments unless explicitly requested