engineerstoolCommunitysha:c56e42c6d15cb3ebmanual

skill-judge

Use when reviewing, auditing, or improving an agent SKILL.md so the agent scores design quality across multiple dimensions and emits concrete fixes against the official skill spec.

source: https://github.com/softaworks/agent-toolkit/tree/main/skills/skill-judge ↗softaworks/agent-toolkit· ★ 1.9k

Install confidence

curl --create-dirs -fsSL https://skillmake.xyz/i/skill-judge -o ~/.claude/skills/skill-judge/SKILL.md

Pinned content

sha:c56e42c6d15cb3eb

Generated with

manual

Source

github.com

The file served at /api/marketplace/skill-judge-c56e42c6/raw matches this hash. Inspect before install, then copy the command.

2,757 chars · ~689 tokens

---
name: skill-judge
description: Use when reviewing, auditing, or improving an agent SKILL.md so the agent scores design quality across multiple dimensions and emits concrete fixes against the official skill spec.
source: https://github.com/softaworks/agent-toolkit/tree/main/skills/skill-judge
generated: 2026-05-25T05:10:34.966Z
category: tool
audience: ai
---

## When to use

- Auditing a freshly-written SKILL.md before publishing it to a marketplace
- Reviewing a third-party skill to decide whether it's worth installing
- Refactoring an in-house skill that has drifted from the official spec over time
- Comparing two skills covering the same surface to pick the better-designed one

## Key concepts

### Skill as knowledge externalization, not tutorial

Skill Judge enforces the distinction: a skill encodes patterns and triggers the agent activates on, not step-by-step instructions for a human. Mis-framed skills score low here.

### Multi-dimensional scoring

Each skill is graded across activation clarity, trigger specificity, content quality, tool/format hygiene, and anti-pattern detection. The output is a per-dimension score plus an aggregate, not a single thumbs-up.

### Reference library of 17+ examples

Patterns are derived from real official skills, not invented. When the judge flags something, it can point to a reference skill that handles the same problem correctly.

### Actionable fix suggestions

Findings come with the specific change to make — exact wording for triggers, missing gotcha categories, mis-shaped tool blocks — instead of generic 'improve clarity' notes.

### Anti-pattern detection

Catches common drift: tutorial-shaped skills, over-broad triggers, missing failure modes, and skills that duplicate work the base model already does well.

## API reference

```
npx skills add softaworks/agent-toolkit --skill skill-judge
```

Install the skill-judge auditing skill.

```
npx skills add softaworks/agent-toolkit --skill skill-judge
```

## Gotchas

- Don't run the judge against itself or other judge-style skills; the rubric was tuned for action skills
- Low scores don't mean delete; they mean 'rewrite against the spec' — the judge points to which dimension to fix first
- A perfect score is rare and not the goal; aim for high marks on activation and triggers, the highest-leverage dimensions
- Skill versioning matters; re-run the judge after any meaningful edit because triggers drift fast
- Some legitimate niche skills score lower because their patterns are unique; treat the judge as a discussion partner, not a gate

---
Generated by SkillMake from https://github.com/softaworks/agent-toolkit/tree/main/skills/skill-judge on 2026-05-25T05:10:34.966Z.
Verify against source before relying on details.

File: ~/.claude/skills/skill-judge/SKILL.md