SkillBench

Standalone skill test runner for AI agent skills. SkillBench runs *.skillbench.{ts,js,mts,mjs} suites in isolated workspaces and writes deterministic reports.

30-second demo

npx @swarmclawai/skillbench@latest init --write
npx @swarmclawai/skillbench@latest run
npx @swarmclawai/skillbench@latest report

Test File

import { defineSkillSuite, fileContains, exitCode } from "@swarmclawai/skillbench";

export default defineSkillSuite({
  name: "demo-skill",
  cases: [
    {
      name: "writes a note",
      workspace: { "input.txt": "hello\n" },
      command: ["node", "agent.mjs"],
      assertions: [exitCode(0), fileContains("note.md", "hello")]
    }
  ]
});

Commands

Command	Purpose
`skillbench init`	Print or write a sample suite
`skillbench list [path]`	List discovered suites and cases
`skillbench run [path]`	Run suites and write `.skillbench/latest.json`
`skillbench report [path]`	Render a Markdown report
`skillbench help-agents`	Print the machine-readable command catalog

Every data-returning command supports --json.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
docs/launch		docs/launch
src		src
tests		tests
.gitignore		.gitignore
.npmignore		.npmignore
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SkillBench

30-second demo

Test File

Commands

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SkillBench

30-second demo

Test File

Commands

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages