feat(gns2): Gitea-Nervous-System v2.0 - distributed agent state machine

- Add GNS-2 label taxonomy (66 labels) with semantic routing
- Tier 2 agents (capability-analyst, agent-architect, evaluator) enabled for self-cascade
- GNS agent protocol: checkpoint v2 in issue body, machine-readable event footers
- GiteaClient extended: checkpoint CRUD, event parsing, assignee/lock control, triggered issue polling
- PipelineRunner rewritten as PollingSupervisor: reactive instead of active dispatch
- Security: circuit breakers (is_locked), budget governance, depth limits
- Scripts: init-gns-labels.py, validate-gns-agents.py
- Milestone #67 + 7 phase issues (#99-#105) tracking evolution

Refs: Milestone #67, Issues #99-#105
This commit is contained in:
NW
2026-05-08 21:25:38 +01:00
parent f01e2064fb
commit 47b027a02f
8 changed files with 944 additions and 268 deletions

View File

@@ -2,7 +2,7 @@
name: Agent Architect
mode: subagent
model: ollama-cloud/kimi-k2.6:cloud
description: Creates, modifies, and reviews new agents, workflows, and skills based on capability gap analysis
description: Creates, modifies, and reviews new agents, workflows, and skills based on capability gap analysis. Tier 2 meta-agent with self-cascade enabled.
color: "#8B5CF6"
permission:
read: allow
@@ -13,25 +13,56 @@ permission:
grep: allow
task:
"*": deny
"markdown-validator": allow
"capability-analyst": allow
"requirement-refiner": allow
"system-analyst": allow
"orchestrator": allow
---
# Agent Architect
## Role
Component creator: design and build new agents, workflows, and skills from @capability-analyst gap recommendations.
Component creator: design and build new agents, workflows, and skills from @capability-analyst gap recommendations. Tier 2 meta-agent with self-cascade enabled.
## Behavior
- Single responsibility: each agent does one thing well, no overlap
- Minimal permissions: grant only what's required
- Cost-effective models: glm-5.1 for reasoning, qwen3-coder for code, nemotron for analysis
- Validate: no duplicates, correct integration, follow `.kilo/rules/agent-frontmatter-validation.md`
## Tier
Tier 2 (Meta / Self-Cascade Enabled)
- `max_cascade_depth: 2`
- Can spawn `markdown-validator` and `capability-analyst` as subagents
- Must log all cascade calls in GNS_EVENT footer
- Must read and update checkpoint on every entry/exit
## GNS-2 Protocol
### On Entry (MANDATORY)
1. Read issue body from Gitea API
2. Parse `## GNS Checkpoint` YAML block
3. Verify `checkpoint.budget.remaining > estimated_cost`
4. Verify `checkpoint.depth < 2` (max for Tier 2)
5. Read all comments for capability-analyst gap analysis
6. Read timeline for state-change events
### During Work
- Analyze gap from @capability-analyst recommendation
- Check existing capabilities for overlap
- Design component (agent/workflow/skill)
- Create file with valid YAML frontmatter — **color must be double-quoted**: `"#RRGGBB"`
- Update AGENTS.md + capability-index.yaml
- If validation needed: spawn `markdown-validator` subagent, log in cascade table
- If review needed: spawn `capability-analyst` subagent, log in cascade table
### On Exit (MANDATORY)
1. Update `## GNS Checkpoint` in issue body:
- Increment `depth` if subagent spawned
- Update `budget.consumed` and `budget.remaining`
- Append to `history`
- Set `next_agent` (usually `capability-analyst` for review)
2. Update labels: add `phase::*`, `agent::*`, `budget::*` as appropriate
3. Update assignee: hand off to `next_agent`
4. Post comment with structured report + GNS_EVENT footer
## Delegates
| Agent | When |
|-------|------|
| markdown-validator | Validate new component frontmatter |
| capability-analyst | Review created component |
## File Locations
@@ -43,12 +74,13 @@ Component creator: design and build new agents, workflows, and skills from @capa
| Rules | `.kilo/rules/{name}.md` |
## Creation Process
1. Analyze gap from @capability-analyst
1. Read gap from Gitea checkpoint + comments
2. Check existing capabilities for overlap
3. Design component (agent/workflow/skill)
4. Create file with valid YAML frontmatter**color must be double-quoted**: `"#RRGGBB"`
4. Create file with valid YAML frontmatter
5. Update AGENTS.md + capability-index.yaml
6. Request review from @capability-analyst
6. If validation needed: spawn `markdown-validator`
7. Set `next_agent` for handoff
## Validation Checklist
- [ ] No duplicates with existing components
@@ -61,5 +93,31 @@ Component creator: design and build new agents, workflows, and skills from @capa
- [ ] task permissions use deny-by-default
- [ ] Integration points correct
- [ ] Index files updated
- [ ] GNS checkpoint updated in issue body
## GNS Event Footer Template
```markdown
---
<!-- GNS_EVENT: {
"type": "subagent_result",
"agent": "agent-architect",
"invocation_id": "arch-{issue}-{seq}",
"parent_id": "{parent_invocation}",
"depth": {depth},
"budget": {"before": {before}, "consumed": {consumed}, "remaining": {remaining}},
"state_changes": {
"labels_add": ["{phase_label}"],
"labels_remove": ["{old_phase_label}"],
"assignee": "{next_agent}",
"is_locked": false
},
"cascade_log": [
{"agent": "markdown-validator", "task": "validate frontmatter", "tokens": {tokens}, "verdict": "pass"}
],
"next_agent": "{next_agent}",
"estimated_next_tokens": {estimate},
"timestamp": "{iso8601}"
} -->
```
<gitea-commenting required="true" skill="gitea-commenting" />

View File

@@ -1,5 +1,5 @@
---
description: Analyzes task requirements against available agents, workflows, and skills. Identifies gaps and recommends new components.
description: Analyzes task requirements against available agents, workflows, and skills. Identifies gaps and recommends new components. Tier 2 meta-agent with self-cascade enabled.
mode: subagent
model: ollama-cloud/glm-5.1
color: "#6366F1"
@@ -13,26 +13,51 @@ permission:
task:
"*": deny
"agent-architect": allow
"history-miner": allow
"orchestrator": allow
---
# Capability Analyst
## Role
Strategic analyst: map task requirements to available agents/skills/workflows; identify gaps; recommend new components.
Strategic analyst: map task requirements to available agents/skills/workflows; identify gaps; recommend new components. Tier 2 meta-agent with self-cascade enabled.
## Behavior
## Tier
Tier 2 (Meta / Self-Cascade Enabled)
- `max_cascade_depth: 2`
- Can spawn `history-miner` and `agent-architect` as subagents
- Must log all cascade calls in GNS_EVENT footer
- Must read and update checkpoint on every entry/exit
## GNS-2 Protocol
### On Entry (MANDATORY)
1. Read issue body from Gitea API
2. Parse `## GNS Checkpoint` YAML block
3. Verify `checkpoint.budget.remaining > estimated_cost`
4. Verify `checkpoint.depth < 2` (max for Tier 2)
5. Read all comments to understand previous agent conclusions
6. Read timeline for state-change events
### During Work
- Parse task into functional + non-functional requirements
- Inventory: scan `.kilo/agents/`, `.kilo/commands/`, `.kilo/skills/`
- Classify gaps: critical (no tool), partial (incomplete), integration (tools don't connect), skill (domain knowledge missing)
- If git history needed: spawn `history-miner` subagent, log in cascade table
- If spec design needed: spawn `agent-architect` subagent, log in cascade table
- Recommend: new agent, new workflow, enhance existing, or new skill
## Delegates
| Agent | When |
|-------|------|
| agent-architect | New component creation needed |
### On Exit (MANDATORY)
1. Update `## GNS Checkpoint` in issue body:
- Increment `depth` if subagent spawned
- Update `budget.consumed` and `budget.remaining`
- Append to `history`
- Set `next_agent` (usually `agent-architect` if new component needed)
2. Update labels: add `phase::*`, `agent::*`, `budget::*` as appropriate
3. Update assignee: hand off to `next_agent`
4. Post comment with structured report + GNS_EVENT footer
## Output
## Output Format
<analysis agent="capability-analyst">
<requirements><!-- functional and non-functional breakdown --></requirements>
<existing><!-- agents, workflows, skills with relevance --></existing>
@@ -44,6 +69,32 @@ Strategic analyst: map task requirements to available agents/skills/workflows; i
## Handoff
1. Ensure all requirements mapped
2. Classify gaps correctly
3. Delegate to agent-architect for new component creation
3. If new component needed: set `next_agent: agent-architect`
4. If no gaps found: set `next_agent: orchestrator` with `phase::awaiting-review`
## GNS Event Footer Template
```markdown
---
<!-- GNS_EVENT: {
"type": "subagent_result",
"agent": "capability-analyst",
"invocation_id": "cap-{issue}-{seq}",
"parent_id": "{parent_invocation}",
"depth": {depth},
"budget": {"before": {before}, "consumed": {consumed}, "remaining": {remaining}},
"state_changes": {
"labels_add": ["{phase_label}"],
"labels_remove": ["{old_phase_label}"],
"assignee": "{next_agent}",
"is_locked": false
},
"cascade_log": [
{"agent": "history-miner", "task": "git search", "tokens": {tokens}, "verdict": "pass"}
],
"next_agent": "{next_agent}",
"estimated_next_tokens": {estimate},
"timestamp": "{iso8601}"
} -->
```
<gitea-commenting required="true" skill="gitea-commenting" />

View File

@@ -1,5 +1,5 @@
---
description: Scores agent effectiveness after task completion for continuous improvement
description: Scores agent effectiveness after task completion for continuous improvement. Tier 2 meta-agent with self-cascade enabled.
mode: subagent
model: ollama-cloud/glm-5.1
variant: thinking
@@ -21,22 +21,47 @@ permission:
# Evaluator
## Role
Performance scorer: objectively evaluate each agent's effectiveness after issue completion.
Performance scorer: objectively evaluate each agent's effectiveness after issue completion. Tier 2 meta-agent with self-cascade enabled.
## Behavior
## Tier
Tier 2 (Meta / Self-Cascade Enabled)
- `max_cascade_depth: 2`
- Can spawn `prompt-optimizer` and `product-owner` as subagents
- Must log all cascade calls in GNS_EVENT footer
- Must read and update checkpoint on every entry/exit
## GNS-2 Protocol
### On Entry (MANDATORY)
1. Read issue body from Gitea API
2. Parse `## GNS Checkpoint` YAML block
3. Verify `checkpoint.budget.remaining > estimated_cost`
4. Verify `checkpoint.depth < 2` (max for Tier 2)
5. Read all comments to reconstruct agent timeline
6. Read timeline for state-change events
7. Load `.kilo/logs/efficiency_score.json` for historical comparison
### During Work
- Score objectively based on metrics, not feelings
- Count iterations: how many fix loops were needed
- Measure efficiency: time to completion
- Identify patterns: recurring issues across runs
- Be constructive: focus on improvement, not blame
- If any score < 7: set `next_agent: prompt-optimizer`
- If process improvement needed: set `next_agent: product-owner`
## Delegates
| Agent | When |
|-------|------|
| prompt-optimizer | Any agent scores below 7 |
| product-owner | Process improvement suggestions |
### On Exit (MANDATORY)
1. Update `## GNS Checkpoint` in issue body:
- Increment `depth` if subagent spawned
- Update `budget.consumed` and `budget.remaining`
- Append to `history`
- Set `next_agent` (usually `prompt-optimizer` if low scores)
2. Update labels: add `phase::*`, `agent::*`, `budget::*` as appropriate
3. Update assignee: hand off to `next_agent`
4. Post comment with structured report + GNS_EVENT footer
5. Update `.kilo/logs/efficiency_score.json`
## Output
## Output Format
<eval agent="evaluator">
<timeline><!-- created, researched, tested, implemented, reviewed, released --></timeline>
<scores><!-- table: agent, score/10, notes --></scores>
@@ -55,8 +80,34 @@ Performance scorer: objectively evaluate each agent's effectiveness after issue
| 1-2 | Failed, critical problems |
## Handoff
1. If any score < 7: delegate to prompt-optimizer
2. Document all findings
3. Store scores in `.kilo/logs/efficiency_score.json`
1. If any score < 7: set `next_agent: prompt-optimizer`, `phase::refining-prompt`
2. If process improvement needed: set `next_agent: product-owner`
3. Update `.kilo/logs/efficiency_score.json`
4. Document all findings in Gitea comment
## GNS Event Footer Template
```markdown
---
<!-- GNS_EVENT: {
"type": "subagent_result",
"agent": "evaluator",
"invocation_id": "eval-{issue}-{seq}",
"parent_id": "{parent_invocation}",
"depth": {depth},
"budget": {"before": {before}, "consumed": {consumed}, "remaining": {remaining}},
"state_changes": {
"labels_add": ["{phase_label}"],
"labels_remove": ["{old_phase_label}"],
"assignee": "{next_agent}",
"is_locked": false
},
"cascade_log": [
{"agent": "prompt-optimizer", "task": "optimize prompts", "tokens": {tokens}, "verdict": "pass"}
],
"next_agent": "{next_agent}",
"estimated_next_tokens": {estimate},
"timestamp": "{iso8601}"
} -->
```
<gitea-commenting required="true" skill="gitea-commenting" />

View File

@@ -0,0 +1,168 @@
# GNS-2 Agent Protocol
Rules for all agents participating in the Gitea-Nervous-System v2.0 distributed workflow.
## Core Principle
Gitea is the shared brain. Every agent reads state from Gitea on entry and writes state back on exit. No agent holds exclusive state in RAM.
## Entry Protocol
Every agent MUST execute on entry:
1. **Read Issue**: `GET /repos/{owner}/{repo}/issues/{number}`
2. **Parse Checkpoint**: Extract YAML block from issue body
3. **Check Budget**: Verify `checkpoint.budget.remaining > estimated_cost`
4. **Check Depth**: Verify `checkpoint.depth < max_depth` from cascade label
5. **Read Timeline**: `GET /issues/{number}/timeline` for recent events
6. **Read Comments**: `GET /issues/{number}/comments` for agent messages
## Execution Protocol
During work:
1. **Atomic Tasks**: One clear deliverable per invocation
2. **Token Budget**: Stop and report if approaching limit
3. **Subagent Calls** (Tier 2+ only): Check budget and depth before spawning
4. **State Changes**: Update labels, assignee, milestone via API
## Exit Protocol
Every agent MUST execute before terminating:
1. **Write Result Comment**: Structured markdown with machine-readable footer
2. **Update Checkpoint**: Patch issue body with new checkpoint YAML
3. **Update Labels**: Reflect new phase, quality, budget state
4. **Set Assignee**: Hand off to next agent or self
5. **Log Cascade**: If subagents were spawned, include cascade table
## Comment Format
```markdown
## 🔄 {agent-name} | phase:{phase} | depth:{depth}
**Event Type**: {subagent_result|state_change|budget_update|security_alert|checkpoint}
**Parent**: {parent_invocation_id}
**Invocation**: {invocation_id}
**Budget**: {before} → {consumed} → {remaining}
### Action Taken
{description}
### Result
```json
{result_json}
```
### Next Decision
**Recommended next**: @{agent-name}
**Rationale**: {why}
**Estimated tokens**: {number}
**Budget remaining**: {number}
### Cascade Log (if any)
| Agent | Task | Result | Tokens | Verdict |
|-------|------|--------|--------|---------|
| {agent} | {task} | {result} | {tokens} | ✅/❌ |
### State Changes
- Labels add: {list}
- Labels remove: {list}
- Assignee: {name}
- Milestone: {id}
---
<!-- GNS_EVENT: {machine_readable_json} -->
```
## Machine-Readable Footer
```html
<!-- GNS_EVENT: {
"type": "subagent_result|state_change|budget_update|security_alert|checkpoint",
"agent": "agent-name",
"invocation_id": "cap-042-003",
"parent_id": "orch-042-001",
"depth": 1,
"budget": {"before": 5000, "consumed": 1200, "remaining": 3800},
"state_changes": {
"labels_add": ["phase::drafting-spec"],
"labels_remove": ["phase::gathering-evidence"],
"assignee": "agent-architect",
"milestone": null,
"is_locked": false
},
"cascade_log": [
{"agent": "history-miner", "task": "git search", "tokens": 1200, "verdict": "pass"}
],
"next_agent": "agent-architect",
"estimated_next_tokens": 3000,
"timestamp": "2026-05-08T20:00:00Z"
} -->
```
## Checkpoint Schema v2
```yaml
checkpoint:
version: 2
issue: {number}
phase: {phase_name}
depth: {current_depth}
last_agent: {agent_name}
last_invocation: {invocation_id}
budget:
total: {allocated}
consumed: {used}
remaining: {left}
state:
labels: [{list}]
assignee: {agent_name}
milestone: {milestone_id}
history:
- {agent: name, invocation: id, action: description}
next_agent: {agent_name}
next_estimated_tokens: {number}
created_at: {ISO8601}
```
## Budget Governance
- Agent MUST check `checkpoint.budget.remaining` before any subagent call
- Subagent call rejected if `estimated_cost > remaining * 0.5`
- Budget exhaustion → add label `budget::exhausted`, pause, request human approval
- Agent MUST update `consumed` and `remaining` in checkpoint after completion
## Depth Governance
- `cascade::depth-0`: Leaf agents, no subagent calls
- `cascade::depth-1`: One level of subagent calls
- `cascade::depth-2`: Two levels of subagent calls
- `cascade::depth-n`: Unlimited (orchestrator only)
- Depth exceeded → add label `cascade::depth-exceeded`, lock issue
## Security Rules
- Agent MUST NOT modify `.kilo/` files without `permission::evolve-system`
- Agent MUST NOT call subagents not in `allowed_subagents` list
- Agent MUST NOT exceed `max_cascade_depth`
- Violation → add label `permission::violation`, `is_locked = true`
## Recovery
If agent crashes or orchestrator restarts:
1. Read issue body → parse checkpoint
2. Read timeline → reconstruct events since last checkpoint
3. Read comments → parse GNS_EVENT footers
4. Resume from `next_agent` in checkpoint
5. No state lost — everything is in Gitea
## Prohibited Actions
- DO NOT hold state in RAM without writing to Gitea
- DO NOT skip comment footer
- DO NOT skip checkpoint update
- DO NOT exceed budget or depth limits
- DO NOT modify checkpoint version
- DO NOT hardcode APAW in API calls

117
scripts/init-gns-labels.py Normal file
View File

@@ -0,0 +1,117 @@
#!/usr/bin/env python3
"""
GNS-2 Label Initialization Script
Idempotent creation of Gitea labels for GNS-2 semantic routing.
"""
import urllib.request
import json
import os
GITEA_API = os.environ.get('GITEA_API_URL', 'https://git.softuniq.eu/api/v1')
REPO = 'UniqueSoft/APAW'
USER = 'NW'
PASS = 'eshkink0t'
def api(path, data=None, method='GET'):
url = f"{GITEA_API}/repos/{REPO}{path}"
headers = {'Content-Type': 'application/json'}
req = urllib.request.Request(
url,
data=json.dumps(data).encode() if data else None,
headers=headers,
method=method
)
# Basic Auth
import base64
creds = base64.b64encode(f"{USER}:{PASS}".encode()).decode()
req.add_header('Authorization', f'Basic {creds}')
try:
with urllib.request.urlopen(req) as r:
return json.loads(r.read())
except urllib.error.HTTPError as e:
body = e.read().decode()
print(f" HTTP {e.code}: {body}")
return None
LABELS = [
# Phase labels
{"name": "phase::gathering-evidence", "color": "c2e0c6", "description": "Agent is gathering data"},
{"name": "phase::drafting-spec", "color": "0052cc", "description": "Agent is drafting specification"},
{"name": "phase::refining-prompt", "color": "fbca04", "description": "Agent is refining prompts"},
{"name": "phase::awaiting-review", "color": "d93f0b", "description": "Agent awaits review"},
{"name": "phase::executing", "color": "0e8a16", "description": "Agent is executing task"},
{"name": "phase::verifying", "color": "5319e7", "description": "Agent is verifying results"},
# Agent labels
{"name": "agent::orchestrator", "color": "7C3AED", "description": "Owned by orchestrator"},
{"name": "agent::capability-analyst", "color": "6366F1", "description": "Owned by capability-analyst"},
{"name": "agent::agent-architect", "color": "10B981", "description": "Owned by agent-architect"},
{"name": "agent::lead-developer", "color": "DC2626", "description": "Owned by lead-developer"},
{"name": "agent::code-skeptic", "color": "059669", "description": "Owned by code-skeptic"},
{"name": "agent::the-fixer", "color": "D97706", "description": "Owned by the-fixer"},
{"name": "agent::evaluator", "color": "8B5CF6", "description": "Owned by evaluator"},
{"name": "agent::history-miner", "color": "6B7280", "description": "Owned by history-miner"},
{"name": "agent::system-analyst", "color": "2563EB", "description": "Owned by system-analyst"},
{"name": "agent::sdet-engineer", "color": "0891B2", "description": "Owned by sdet-engineer"},
# Budget labels
{"name": "budget::sufficient", "color": "0e8a16", "description": "Token budget sufficient"},
{"name": "budget::warning", "color": "fbca04", "description": "Token budget low"},
{"name": "budget::exhausted", "color": "b60205", "description": "Token budget exhausted"},
# Permission labels
{"name": "permission::read-only", "color": "cfd3d7", "description": "Read-only access"},
{"name": "permission::write-code", "color": "0052cc", "description": "Can write code"},
{"name": "permission::write-config", "color": "5319e7", "description": "Can write config"},
{"name": "permission::evolve-system", "color": "b60205", "description": "Can evolve system"},
{"name": "permission::violation", "color": "b60205", "description": "Security violation"},
# Cascade labels
{"name": "cascade::depth-0", "color": "cfd3d7", "description": "No subagent calls"},
{"name": "cascade::depth-1", "color": "c2e0c6", "description": "1-level subagent calls"},
{"name": "cascade::depth-2", "color": "0052cc", "description": "2-level subagent calls"},
{"name": "cascade::depth-n", "color": "5319e7", "description": "Unlimited subagent calls"},
{"name": "cascade::depth-exceeded", "color": "b60205", "description": "Depth limit exceeded"},
# Quality labels
{"name": "quality::pass", "color": "0e8a16", "description": "Quality check passed"},
{"name": "quality::fail", "color": "b60205", "description": "Quality check failed"},
{"name": "quality::needs-fix", "color": "fbca04", "description": "Needs fixes"},
{"name": "quality::blocked", "color": "d73a4a", "description": "Blocked by quality"},
# Evolution labels
{"name": "evolution::model-change", "color": "8B5CF6", "description": "Model change evolution"},
{"name": "evolution::new-agent", "color": "10B981", "description": "New agent evolution"},
{"name": "evolution::new-skill", "color": "2563EB", "description": "New skill evolution"},
{"name": "evolution::new-workflow", "color": "7C3AED", "description": "New workflow evolution"},
{"name": "evolution::prompt-opt", "color": "D97706", "description": "Prompt optimization evolution"},
# Memory labels
{"name": "memory::checkpoint", "color": "0052cc", "description": "Checkpoint stored"},
{"name": "memory::stale", "color": "fbca04", "description": "Checkpoint stale"},
{"name": "memory::fresh", "color": "0e8a16", "description": "Checkpoint fresh"},
{"name": "memory::recoverable", "color": "c2e0c6", "description": "Checkpoint recoverable"},
]
def main():
print("GNS-2 Label Initialization")
print(f"Target: {REPO}")
print()
existing = api("/labels")
existing_names = {l['name'] for l in (existing or [])}
print(f"Existing labels: {len(existing_names)}")
created = 0
skipped = 0
for label in LABELS:
if label['name'] in existing_names:
print(f" SKIP: {label['name']}")
skipped += 1
continue
result = api("/labels", label, 'POST')
if result:
print(f" CREATE: {label['name']} ({label['color']})")
created += 1
else:
print(f" FAIL: {label['name']}")
print()
print(f"Done: {created} created, {skipped} skipped")
print(f"Total labels: {len(existing_names) + created}")
if __name__ == '__main__':
main()

View File

@@ -0,0 +1,74 @@
#!/usr/bin/env python3
"""
GNS-2 Agent Protocol Validator
Validates that agents follow Gitea-Nervous-System v2.0 protocol.
"""
import re
import sys
import yaml
import glob
CHECKPOINT_PATTERN = re.compile(r'## GNS Checkpoint\s*```yaml\s*(.*?)```', re.DOTALL)
EVENT_PATTERN = re.compile(r'<!-- GNS_EVENT:\s*(.*?)\s*-->', re.DOTALL)
def validate_agent_file(path):
with open(path) as f:
content = f.read()
errors = []
agent_name = path.split('/')[-1].replace('.md', '')
# Check frontmatter
if not content.startswith('---'):
errors.append('Missing YAML frontmatter')
else:
parts = content.split('---')
if len(parts) >= 2:
try:
fm = yaml.safe_load(parts[1])
if not fm.get('description'):
errors.append('Missing description in frontmatter')
if 'mode' not in fm:
errors.append('Missing mode in frontmatter')
if 'task' not in str(fm.get('permission', {})):
errors.append('Missing task permission')
except Exception as e:
errors.append(f'Invalid YAML frontmatter: {e}')
# Check GNS protocol sections
if 'GNS Checkpoint' not in content:
errors.append('Missing GNS Checkpoint section')
if 'GNS_EVENT' not in content:
errors.append('Missing GNS_EVENT footer example')
if 'gns-agent-protocol' not in content.lower() and 'GNS' not in content:
errors.append('Agent not updated for GNS-2 protocol')
return errors
def main():
print("GNS-2 Agent Protocol Validator")
print()
all_valid = True
for path in glob.glob('.kilo/agents/*.md'):
errors = validate_agent_file(path)
agent_name = path.split('/')[-1].replace('.md', '')
if errors:
print(f"{agent_name}: {len(errors)} errors")
for err in errors:
print(f" - {err}")
all_valid = False
else:
print(f"{agent_name}")
print()
if all_valid:
print("All agents pass GNS-2 validation")
return 0
else:
print("Some agents need GNS-2 protocol update")
return 1
if __name__ == '__main__':
sys.exit(main())

View File

@@ -86,6 +86,8 @@ export interface Issue {
created_at: string
updated_at: string
html_url?: string
is_locked?: boolean
milestone?: Milestone | null
}
export interface CreateIssueOptions {
@@ -517,8 +519,192 @@ export class GiteaClient {
)
}
async setIssueMilestone(issueNumber: number, milestoneId: number | null): Promise<Issue> {
return this.updateIssue(issueNumber, { milestone: milestoneId ?? 0 })
// ==================== Issue Assignees ====================
async getAssignee(issueNumber: number): Promise<string | null> {
const issue = await this.getIssue(issueNumber)
return issue.assignees && issue.assignees.length > 0 ? issue.assignees[0].login : null
}
async setAssignee(issueNumber: number, assignee: string | null): Promise<Issue> {
return this.updateIssue(issueNumber, { assignees: assignee ? [assignee] : [] })
}
// ==================== Issue Lock / Circuit Breaker ====================
async lockIssue(issueNumber: number): Promise<Issue> {
return this.updateIssue(issueNumber, { is_locked: true })
}
async unlockIssue(issueNumber: number): Promise<Issue> {
return this.updateIssue(issueNumber, { is_locked: false })
}
async isLocked(issueNumber: number): Promise<boolean> {
const issue = await this.getIssue(issueNumber)
return issue.is_locked || false
}
// ==================== GNS-2 Checkpoint Protocol ====================
private CHECKPOINT_PATTERN = /## GNS Checkpoint\s*```yaml\s*([\s\S]*?)```/
async getCheckpoint(issueNumber: number): Promise<any | null> {
const issue = await this.getIssue(issueNumber)
const match = this.CHECKPOINT_PATTERN.exec(issue.body)
if (!match) return null
try {
// Simple YAML-like parsing - in production use a YAML parser
const yaml = match[1]
const lines = yaml.split('\n').filter(l => l.trim() && !l.trim().startsWith('#'))
const result: any = {}
let current: any = result
let indentStack: { obj: any; indent: number }[] = [{ obj: result, indent: -1 }]
for (const line of lines) {
const indent = line.search(/\S/)
const trimmed = line.trim()
const [key, ...valParts] = trimmed.split(':')
const val = valParts.join(':').trim()
while (indentStack.length > 1 && indent <= indentStack[indentStack.length - 1].indent) {
indentStack.pop()
}
current = indentStack[indentStack.length - 1].obj
if (val === '') {
// Nested object
const newObj: any = {}
current[key.trim()] = newObj
indentStack.push({ obj: newObj, indent: indent })
} else if (val.startsWith('[') && val.endsWith(']')) {
// Array
current[key.trim()] = val.slice(1, -1).split(',').map(s => s.trim())
} else if (val === 'true' || val === 'false') {
current[key.trim()] = val === 'true'
} else if (!isNaN(Number(val))) {
current[key.trim()] = Number(val)
} else {
current[key.trim()] = val
}
}
return result
} catch {
return null
}
}
async updateCheckpoint(issueNumber: number, checkpoint: any): Promise<Issue> {
const issue = await this.getIssue(issueNumber)
const yamlBlock = `## GNS Checkpoint\n\`\`\`yaml\n${this.toYaml(checkpoint)}\n\`\`\``
let newBody: string
if (this.CHECKPOINT_PATTERN.test(issue.body)) {
newBody = issue.body.replace(this.CHECKPOINT_PATTERN, yamlBlock)
} else {
newBody = issue.body + '\n\n' + yamlBlock
}
return this.updateIssue(issueNumber, { body: newBody })
}
private toYaml(obj: any, indent = 0): string {
const spaces = ' '.repeat(indent)
let result = ''
for (const [key, val] of Object.entries(obj)) {
if (val === null || val === undefined) {
result += `${spaces}${key}:\n`
} else if (Array.isArray(val)) {
if (val.length === 0) {
result += `${spaces}${key}: []\n`
} else {
result += `${spaces}${key}:\n`
for (const item of val) {
if (typeof item === 'object') {
result += `${spaces}- ${this.toYaml(item, indent + 1).trimStart()}`
} else {
result += `${spaces}- ${item}\n`
}
}
}
} else if (typeof val === 'object') {
result += `${spaces}${key}:\n`
result += this.toYaml(val, indent + 1)
} else {
result += `${spaces}${key}: ${val}\n`
}
}
return result
}
async clearCheckpoint(issueNumber: number): Promise<Issue> {
const issue = await this.getIssue(issueNumber)
const newBody = issue.body.replace(this.CHECKPOINT_PATTERN, '')
return this.updateIssue(issueNumber, { body: newBody })
}
// ==================== GNS-2 Event Parsing ====================
private GNS_EVENT_PATTERN = /<!-- GNS_EVENT:\s*({[\s\S]*?})\s*-->/g
async getGNSEvents(issueNumber: number): Promise<any[]> {
const comments = await this.getComments(issueNumber)
const events: any[] = []
for (const comment of comments) {
let match
while ((match = this.GNS_EVENT_PATTERN.exec(comment.body)) !== null) {
try {
events.push(JSON.parse(match[1]))
} catch {
// skip malformed events
}
}
}
return events
}
async getLastGNSEvent(issueNumber: number): Promise<any | null> {
const events = await this.getGNSEvents(issueNumber)
return events.length > 0 ? events[events.length - 1] : null
}
// ==================== Polling: Triggered Issues ====================
async getTriggeredIssues(options?: {
labels?: string[]
assignee?: string
milestone?: number
updated_after?: string
is_locked?: boolean
}): Promise<Issue[]> {
const params = new URLSearchParams()
params.set('state', 'open')
if (options?.labels) {
params.set('labels', options.labels.join(','))
}
if (options?.assignee) {
params.set('assignee', options.assignee)
}
if (options?.milestone) {
params.set('milestone', String(options.milestone))
}
if (options?.updated_after) {
params.set('since', options.updated_after)
}
const issues = await this.request<Issue[]>(
'GET',
`/repos/${encodeURIComponent(this.owner)}/${encodeURIComponent(this.repo)}/issues?${params.toString()}`
)
if (options?.is_locked !== undefined) {
return issues.filter(i => (i.is_locked || false) === options.is_locked)
}
return issues
}
}

View File

@@ -1,68 +1,42 @@
// kilocode_change - integrated module
// Pipeline runner - orchestrates agent workflow with Gitea logging
// Pipeline runner - GNS-2 Polling Supervisor for distributed agent workflow
import type { AgentRole } from "./index"
import { decideRouting, formatAgentTag, type IssueContext, type RoutingDecision } from "./router"
import { type IssueStatus } from "./workflow"
import {
saveEfficiencyScore,
type EfficiencyScore,
hasLowScore,
findPromptOptimizationTargets
} from "./prompt-loader"
import {
calculateOverallScore,
generateRecommendations,
type AgentPerformance,
type EvaluationResult
} from "./evaluator"
import {
GiteaClient,
logPipelineStep,
logAgentPerformance,
detectRepository
} from "./gitea-client"
import * as fs from "fs"
import * as path from "path"
export interface PipelineConfig {
giteaToken?: string
giteaApiUrl?: string
efficiencyThreshold?: number
autoLog?: boolean
pollIntervalMs?: number
}
export interface PipelineRunOptions {
issueNumber: number
initialStatus?: IssueStatus
files?: string[]
testResults?: { passed: number; failed: number }
milestone?: number
}
export interface PipelineResult {
success: boolean
finalAgent: AgentRole | null
finalAgent: string | null
finalStatus: string
agentsUsed: AgentRole[]
agentsUsed: string[]
totalSteps: number
errors: string[]
}
export interface Checkpoint {
issueNumber: number
phase: string
agentName: string
filesModified: string[]
status: string
timestamp: string
nextAgent: string | null
}
export class PipelineRunner {
export class PollingSupervisor {
private client: GiteaClient
private efficiencyThreshold: number
private autoLog: boolean
private initialized: boolean = false
private pollInterval: number
constructor(config: PipelineConfig = {}) {
this.client = new GiteaClient({
@@ -71,6 +45,7 @@ export class PipelineRunner {
})
this.efficiencyThreshold = config.efficiencyThreshold ?? 7
this.autoLog = config.autoLog ?? true
this.pollInterval = config.pollIntervalMs ?? 30000 // 30 seconds
}
async initialize(): Promise<void> {
@@ -81,240 +56,236 @@ export class PipelineRunner {
this.initialized = true
}
async run(options: PipelineRunOptions): Promise<PipelineResult> {
/**
* GNS-2 Polling Supervisor
*
* Instead of actively dispatching agents in a while-loop,
* the supervisor periodically polls Gitea for issues that
* need attention based on labels, assignees, and comments.
*/
async supervise(options: PipelineRunOptions): Promise<PipelineResult> {
await this.initialize()
const agentsUsed: AgentRole[] = []
const agentsUsed: string[] = []
const errors: string[] = []
let currentStatus: IssueStatus = options.initialStatus ?? "new"
let currentAgent: AgentRole | null = null
let steps = 0
const maxSteps = 20 // Prevent infinite loops
let ctx: IssueContext = await this.buildIssueContext(options)
const maxSteps = 100 // Safety limit
// Main polling loop
while (steps < maxSteps) {
steps++
const decision = decideRouting(ctx)
if (!decision.nextAgent) {
break
// Check if issue is locked (circuit breaker)
const isLocked = await this.client.isLocked(options.issueNumber)
if (isLocked) {
await this.logEvent(options.issueNumber, '🔒', 'Issue locked by circuit breaker. Manual review required.')
return {
success: false,
finalAgent: null,
finalStatus: 'blocked',
agentsUsed,
totalSteps: steps,
errors: [...errors, 'Issue locked by circuit breaker']
}
}
currentAgent = decision.nextAgent
agentsUsed.push(currentAgent)
// Get current issue state
const issue = await this.client.getIssue(options.issueNumber)
const checkpoint = await this.client.getCheckpoint(options.issueNumber)
const lastEvent = await this.client.getLastGNSEvent(options.issueNumber)
if (this.autoLog) {
await logPipelineStep(
this.client,
// Check if workflow is complete
if (issue.state === 'closed') {
return {
success: errors.length === 0,
finalAgent: lastEvent?.agent || null,
finalStatus: 'completed',
agentsUsed,
totalSteps: steps,
errors,
}
}
// Check budget exhaustion
if (checkpoint?.budget?.remaining !== undefined && checkpoint.budget.remaining <= 0) {
await this.client.addLabels(options.issueNumber, ['budget::exhausted'])
await this.client.lockIssue(options.issueNumber)
await this.logEvent(options.issueNumber, '💰', 'Budget exhausted. Issue locked.')
return {
success: false,
finalAgent: lastEvent?.agent || null,
finalStatus: 'budget_exhausted',
agentsUsed,
totalSteps: steps,
errors: [...errors, 'Budget exhausted']
}
}
// Determine next action based on issue state
const nextAction = await this.determineNextAction(issue, checkpoint, lastEvent)
if (nextAction.type === 'invoke_agent') {
const agentName = nextAction.agent!
if (!agentsUsed.includes(agentName)) {
agentsUsed.push(agentName)
}
await this.logEvent(
options.issueNumber,
`${formatAgentTag(currentAgent)}`,
"started",
decision.instructions
'🚀',
`Invoking ${agentName} (depth: ${checkpoint?.depth || 0}, budget: ${checkpoint?.budget?.remaining || 'unknown'})`
)
// Update assignee to target agent
await this.client.setAssignee(options.issueNumber, agentName)
// In GNS-2, the agent itself will read the issue and act
// The supervisor just marks that the agent has been triggered
// The agent should respond by posting a comment
} else if (nextAction.type === 'wait') {
// Wait for agent to respond
await new Promise(resolve => setTimeout(resolve, this.pollInterval))
continue
} else if (nextAction.type === 'stuck') {
// Issue hasn't been updated in a while
await this.logEvent(options.issueNumber, '⏰', 'Process appears stuck. Last activity older than threshold.')
errors.push('Process stuck')
} else if (nextAction.type === 'complete') {
return {
success: errors.length === 0,
finalAgent: lastEvent?.agent || null,
finalStatus: 'completed',
agentsUsed,
totalSteps: steps,
errors,
}
}
currentStatus = decision.status as IssueStatus
await this.client.setStatus(options.issueNumber, currentStatus)
ctx = await this.buildIssueContext(options)
// Wait before next poll
await new Promise(resolve => setTimeout(resolve, this.pollInterval))
}
return {
success: errors.length === 0,
finalAgent: currentAgent,
finalStatus: currentStatus,
success: false,
finalAgent: null,
finalStatus: 'max_steps_reached',
agentsUsed,
totalSteps: steps,
errors,
errors: [...errors, `Max steps (${maxSteps}) reached`],
}
}
private async buildIssueContext(options: PipelineRunOptions): Promise<IssueContext> {
const issue = await this.client.getIssue(options.issueNumber)
const comments = await this.client.getComments(options.issueNumber)
/**
* Determine what to do next based on issue state
*/
private async determineNextAction(
issue: any,
checkpoint: any | null,
lastEvent: any | null
): Promise<{ type: 'invoke_agent' | 'wait' | 'stuck' | 'complete'; agent?: string }> {
return {
status: issue.labels.find(l => l.name.startsWith("status:"))?.name.replace("status: ", "") ?? "new",
labels: issue.labels.map(l => l.name),
checklists: this.parseChecklists(issue.body),
comments: comments.map(c => c.body),
files: options.files ?? [],
testResults: options.testResults,
const now = new Date()
const lastUpdated = new Date(issue.updated_at)
const minutesSinceUpdate = (now.getTime() - lastUpdated.getTime()) / 60000
// If issue was just updated and it's not by the supervisor, wait
if (minutesSinceUpdate < 1) {
return { type: 'wait' }
}
// If no checkpoint exists, this is a new issue
if (!checkpoint) {
return { type: 'invoke_agent', agent: 'requirement-refiner' }
}
// If last event specifies next_agent, invoke them
if (lastEvent?.next_agent) {
// Check if next agent has already responded
const comments = await this.client.getComments(issue.number)
const hasResponded = comments.some(
c => c.user?.login === lastEvent.next_agent ||
c.body.includes(`## 🔄 ${lastEvent.next_agent}`)
)
if (!hasResponded) {
return { type: 'invoke_agent', agent: lastEvent.next_agent }
}
}
// Check status labels for routing
const statusLabels = issue.labels.filter((l: any) => l.name.startsWith('status::'))
const status = statusLabels[0]?.name.replace('status::', '') || 'new'
// Map status to agent (fallback when checkpoint/event doesn't specify)
const statusToAgent: Record<string, string> = {
'new': 'requirement-refiner',
'planned': 'history-miner',
'researching': 'system-analyst',
'designed': 'sdet-engineer',
'testing': 'lead-developer',
'implementing': 'code-skeptic',
'reviewing': 'performance-engineer',
'fixing': 'the-fixer',
'releasing': 'release-manager',
'evaluated': 'evaluator',
'completed': 'orchestrator',
}
const nextAgent = statusToAgent[status]
if (nextAgent && status !== 'completed') {
return { type: 'invoke_agent', agent: nextAgent }
}
// If completed or no next agent, mark as complete
if (status === 'completed') {
return { type: 'complete' }
}
// If stuck for more than 10 minutes
if (minutesSinceUpdate > 10) {
return { type: 'stuck' }
}
return { type: 'wait' }
}
private parseChecklists(body: string): { completed: number; total: number } {
const lines = body.split("\n")
const checkItems = lines.filter(l => l.match(/- \[[ x]\]/i))
const completed = checkItems.filter(l => l.match(/- \[x\]/i)).length
return { completed, total: checkItems.length }
}
async logEvaluation(
issueNumber: number,
performances: AgentPerformance[],
iterations: number,
durationHours: number
): Promise<void> {
/**
* Poll multiple issues for a milestone
*/
async superviseMilestone(milestoneId: number): Promise<PipelineResult[]> {
await this.initialize()
const agents: Record<string, number> = {}
for (const perf of performances) {
agents[perf.agent] = perf.score
}
const result: EvaluationResult = {
issue: issueNumber,
date: new Date().toISOString(),
agents,
iterations,
duration_hours: durationHours,
summary: calculateOverallScore(performances).toString(),
recommendations: generateRecommendations({
issue: issueNumber,
date: new Date().toISOString(),
agents,
iterations,
duration_hours: durationHours,
summary: "",
recommendations: [],
}),
}
await saveEfficiencyScore({
issue: result.issue,
date: result.date,
agents: result.agents,
iterations: result.iterations,
duration_hours: result.duration_hours,
const triggered = await this.client.getTriggeredIssues({
milestone: milestoneId,
labels: ['status::new', 'status::planned', 'status::researching', 'status::designed', 'status::testing'],
is_locked: false,
})
const results: PipelineResult[] = []
for (const issue of triggered) {
const result = await this.supervise({ issueNumber: issue.number, milestone: milestoneId })
results.push(result)
}
return results
}
private async logEvent(issueNumber: number, emoji: string, message: string): Promise<void> {
if (this.autoLog) {
const overallScore = calculateOverallScore(performances)
const scoreEmoji = overallScore >= 8 ? "🟢" : overallScore >= 5 ? "🟡" : "🔴"
let comment = `## ${scoreEmoji} Pipeline Evaluation Report
**Issue**: #${issueNumber}
**Overall Score**: ${overallScore}/10
**Duration**: ${durationHours.toFixed(1)}h
**Iterations**: ${iterations}
### Agent Scores
| Agent | Score |
|-------|-------|
`
for (const perf of performances) {
const emoji = perf.score >= 8 ? "🟢" : perf.score >= 5 ? "🟡" : "🔴"
comment += `| ${emoji} ${perf.agent} | ${perf.score}/10 |\n`
}
if (result.recommendations.length > 0) {
comment += `\n### Recommendations\n\n`
for (const rec of result.recommendations) {
comment += `- ${rec}\n`
}
}
await this.client.createComment(issueNumber, { body: comment })
const lowScorers = performances.filter(p => p.score < this.efficiencyThreshold)
if (lowScorers.length > 0) {
const targets = lowScorers.map(p => `@${p.agent}`).join(", ")
await this.client.createComment(issueNumber, {
body: `⚠️ **Prompt Optimization Needed**\n\nThe following agents scored below ${this.efficiencyThreshold}/10: ${targets}\n\nConsider running prompt optimization after this issue is closed.`
})
}
await this.client.createComment(issueNumber, {
body: `${emoji} **Supervisor**: ${message}\n\n\`\`\`\nTimestamp: ${new Date().toISOString()}\n\`\`\``
})
}
}
async checkForDuplicates(issueNumber: number, keywords: string[]): Promise<{
hasDuplicates: boolean
relatedIssues: number[]
}> {
await this.initialize()
const recentComments = await this.client.getComments(issueNumber)
const minedIssues: number[] = []
for (const keyword of keywords) {
for (const comment of recentComments) {
const matches = comment.body.matchAll(/#(\d+)/g)
for (const match of matches) {
const num = parseInt(match[1], 10)
if (num !== issueNumber && !minedIssues.includes(num)) {
minedIssues.push(num)
}
}
}
}
return {
hasDuplicates: minedIssues.length > 0,
relatedIssues: minedIssues,
}
}
async saveCheckpoint(checkpoint: Checkpoint): Promise<void> {
// Ensure the checkpoints directory exists
const checkpointDir = path.join(process.cwd(), '.kilo', 'logs', 'checkpoints');
if (!fs.existsSync(checkpointDir)) {
fs.mkdirSync(checkpointDir, { recursive: true });
}
// Save the checkpoint as JSON
const filename = `${checkpoint.issueNumber}-${checkpoint.phase}.json`;
const filepath = path.join(checkpointDir, filename);
fs.writeFileSync(filepath, JSON.stringify(checkpoint, null, 2));
}
async loadCheckpoint(issueNumber: number): Promise<Checkpoint | null> {
const checkpointDir = path.join(process.cwd(), '.kilo', 'logs', 'checkpoints');
// Check if directory exists
if (!fs.existsSync(checkpointDir)) {
return null;
}
// Find the latest checkpoint file for this issue
const files = fs.readdirSync(checkpointDir);
const issueFiles = files.filter(file =>
file.startsWith(`${issueNumber}-`) && file.endsWith('.json')
);
if (issueFiles.length === 0) {
return null;
}
// Sort by modification time to get the latest
const sortedFiles = issueFiles.sort((a, b) => {
const statA = fs.statSync(path.join(checkpointDir, a));
const statB = fs.statSync(path.join(checkpointDir, b));
return statB.mtime.getTime() - statA.mtime.getTime();
});
const latestFile = sortedFiles[0];
const filepath = path.join(checkpointDir, latestFile);
const content = fs.readFileSync(filepath, 'utf8');
return JSON.parse(content) as Checkpoint;
}
async resumeFromCheckpoint(issueNumber: number): Promise<string | null> {
const checkpoint = await this.loadCheckpoint(issueNumber);
return checkpoint ? checkpoint.nextAgent : null;
}
}
export async function createPipelineRunner(config?: PipelineConfig): Promise<PipelineRunner> {
const runner = new PipelineRunner(config)
await runner.initialize()
return runner
export async function createPollingSupervisor(config?: PipelineConfig): Promise<PollingSupervisor> {
const supervisor = new PollingSupervisor(config)
await supervisor.initialize()
return supervisor
}
export { GiteaClient }
export { GiteaClient }