Files
APAW/.kilo/commands/evaluate.md
swp 35bbdcb08f feat: integrate agent-manager module with KiloCode workflows
- Move agent-manager from packages/opencode/ to src/kilocode/
- Add Gitea client for pipeline logging
- Add pipeline-runner for workflow orchestration
- Create slash commands: /pipeline, /status, /evaluate
- Update AGENTS.md with workflow documentation
- Update README.md with KiloCode integration details
- Add package.json and tsconfig.json for TypeScript compilation
- Remove duplicate files from packages/opencode/

This enables:
- /pipeline <issue> - run full agent pipeline with Gitea logging
- Direct agent invocation via @mention
- Performance tracking and prompt optimization
2026-04-04 01:11:06 +01:00

1.1 KiB

description, mode, model, color
description mode model color
Evaluate agent performance for completed issue subagent ollama-cloud/gpt-oss:120b #F59E0B

Evaluate Command

Generate performance evaluation report for a completed pipeline run.

Usage

/evaluate <issue-number>

Process

  1. Fetch issue comments
  2. Parse agent execution logs
  3. Calculate scores per agent
  4. Generate recommendations
  5. Post evaluation to Gitea

Scoring Criteria

Criterion Weight
Code Quality 30%
Test Coverage 20%
Review Iterations 20%
Time to Complete 15%
Security Issues 15%

Output Format

## 🟢 Pipeline Evaluation Report

**Issue**: #42
**Overall Score**: 8.2/10
**Duration**: 2.5h
**Iterations**: 2

### Agent Scores

| Agent | Score | Notes |
|-------|-------|-------|
| 🟢 requirement-refiner | 9/10 | Clear acceptance criteria |
| 🟢 lead-developer | 8/10 | Clean implementation |
| 🟡 code-skeptic | 7/10 | Found 2 minor issues |
| 🟢 the-fixer | 9/10 | Fixed issues quickly |

### Recommendations

- Consider optimizing code-skeptic prompt (score < 8)