/cmetrics — Workflow Metrics Dashboard
Aggregate all workflow data into a project health and ROI dashboard showing bugs caught, token cost, and trends.
When to Use
- Monthly, to track how the workflow is performing over time.
- Before presenting to stakeholders, to justify the workflow investment with data.
- When deciding whether to increase workflow intensity.
- When something feels off — metrics reveal which phases are pulling weight and which are not.
- Not for: Single-feature summaries (use
/csummary), or checking current workflow state (use/cstatus).
How It Fits in the Workflow
This skill is standalone and project-wide. It reads accumulated data from every feature that has gone through the Correctless workflow — QA findings, verification reports, antipatterns, drift debt, audit history, token logs, and git history. It produces a comprehensive dashboard with actionable health analysis. Run it periodically to spot trends.
What It Does
- Counts features completed, total issues caught (deduplicated across phases), and bug escape rate.
- Breaks down issues by phase (Review, Test Audit, QA, Verify, Audit) to show where value concentrates.
- Tracks antipattern growth by category, flagging systemic issues that may need architectural fixes.
- Monitors drift debt health: open items, staleness, accumulation vs. resolution rate.
- Token ROI Analysis: Reads
.correctless/artifacts/token-log-*.jsonto compute cost per bug caught, tokens per feature by phase, tokens per LOC, and estimated production fix cost avoided. Shows whether token spend is efficient (e.g., “65% of tokens go to TDD, which catches 60% of bugs”). - Session Analytics: Reads Claude Code
session-metaandfacetsdata from~/.claude/usage-data/, filtered to the current project. Reports exact token cost (ground truth), tool distribution, friction rate, user engagement, and outcome rate. - Correctless vs Freeform comparison: Identifies which sessions used Correctless (by checking artifact timestamps and tool patterns) vs. freeform coding, then compares outcome rate, friction, duration, and token usage between the two groups.
- Cross-metric correlation analysis: flags patterns like “specs revised mid-TDD frequently AND antipattern growth accelerating in the same category.”
Example
User: /cmetrics
# Correctless Metrics — my-project
# Generated: 2026-03-29
## Overview
- Features completed: 8
- Total issues caught: 47
- Bug escape rate: 2 escaped / 47 caught (4.1%)
- Workflow active since: 2026-01-15
## Issues by Phase
| Phase | Issues | % of Total | Notes |
|--------------|--------|------------|------------------------------------|
| Review | 12 | 25% | Security checklist added 40% of these |
| QA | 22 | 47% | 5 class fixes added structural tests |
| Verify | 8 | 17% | |
| Audit | 5 | 11% | |
## Token ROI Analysis
- Total tokens tracked: 2.4M across 8 features
- Cost per bug caught: ~51k tokens (47 bugs)
- Estimated production fix cost avoided: 47 bugs x 2-10 hrs = 94-470 hrs
## Session Analytics
- Correctless sessions: 62% fully achieved outcome
- Freeform sessions: 41% fully achieved outcome
## Health Analysis
- QA rounds trending down (3.1 -> 2.3 avg) — workflow getting more effective.
- Error handling antipatterns growing fastest (+4 in 3 months). Consider architectural pattern.
- Spec revision rate: 25% of features — acceptable.
What It Reads / Writes
| Reads | Writes |
|---|---|
.correctless/artifacts/qa-findings-*.json | .correctless/artifacts/metrics-{date}.md |
docs/verification/*-verification.md | |
.correctless/meta/workflow-effectiveness.json | |
.correctless/antipatterns.md | |
.correctless/meta/drift-debt.json | |
.correctless/artifacts/findings/audit-*-history.md | |
.correctless/specs/*.md | |
.correctless/artifacts/summary-*.md | |
.correctless/artifacts/token-log-*.json | |
.correctless/artifacts/audit-trail-*.jsonl | |
~/.claude/usage-data/session-meta/*.json | |
~/.claude/usage-data/facets/*.json | |
docs/decisions/*.md | |
| Git log |
Intensity Levels
- Standard intensity: Omits Olympics convergence analysis and Olympics history table (those features require high+ intensity).
- High/critical intensity: Adds Olympics history, convergence speed analysis, and audit-specific metrics.
Common Issues
- Sparse data on new projects: This is expected. Data accumulates with each feature. The dashboard reports what exists and notes what will appear after more features run.
- Session-meta not found: Claude Code session data may not exist for the project yet. Session analytics will appear after a few sessions.
- Token logs vs. session-meta divergence: If the numbers differ significantly, the dashboard notes it: the gap is orchestrator overhead not captured by subagent tracking.