Files
APAW/agent-evolution/archive
Deploy Bot a0604afaf6 chore: archive generated files and clean up runtime outputs
- index.standalone.html → agent-evolution/archive/index.standalone-2026-05-25.html (generated build output)
- tests/visual/dashboard-tabs/current/*.png → tests/visual/archive/dashboard-tabs-current-2026-05-25/ (runtime capture output)
- Cleaned empty tests/visual/dashboard-tabs/current/ directory
2026-05-25 21:23:47 +01:00
..

Agent Evolution Archive

This directory contains temporary scripts, reports, and artifacts from the April 2026 model research and data migration work.

Structure

  • scripts/ — One-off Python/Node scripts used for data migration, diagnostics, and fixes
  • reports/ — Intermediate analysis outputs (JSON, Markdown, text)
  • data/ — Test data files and old dashboard builds

Notable Scripts

Script Purpose
add_current_model_id.py Adds current_model_id string field to agent scores (replaces fragile index lookup)
add_fallback_models.py Injects fallback_models list into model-benchmarks.json agents
add_groq_fallbacks.py Adds Groq provider fallback configuration with rate limits
analyze_if_scores.py Analyzes IF (Inverse FLOPs) scores across model matrix
check_fixes.py Validates fixes applied to model-benchmarks.json
fix_model_benchmarks.py Corrects field name mismatches (before/after vs score_before/score_after)
update_failover_strategies.py Adds failover_strategy blocks to capability-index.yaml
validate_fixes.py Full validation suite for model-benchmarks.json structure
verify_fallback.py Checks fallback model existence in benchmark data
verify_fixes_simple.py Quick structure verification after fixes
_validate_benchmarks.py JSON schema validation runner
rebuild-template.js ES module version of rebuild-template.cjs (superseded)
sync-benchmarks-from-yaml.js ES module version of sync script (superseded)

Reports

Report Description
analysis_results.json IF score analysis output
optimization_summary.md Model assignment optimization recommendations
recommendations_summary.md Condensed recommendation matrix
research_report.md Full research findings before dashboard integration
quick_findings.txt Bullet-point summary for quick reference
patch.json Proposed data patches before application

Data Files

File Description
model-research-test.json Test fixture for dashboard rendering
research-dashboard-2026_04_27.html Previous dashboard build (superseded by 2026_04_29)

When to Reuse

These scripts may be useful for:

  • Future data migrations when model-benchmarks.json schema changes
  • Re-adding fallback configurations after resets
  • Benchmarking new model providers
  • Diagnostic checks when dashboard fails to render

Migration Notes

All fixes applied by these scripts have been merged into the production data:

  • model-benchmarks.json contains current_model_id on every agent
  • capability-index.yaml contains fallback_models and failover_strategy
  • Field names standardized to before/after (not score_before/score_after)