← Back to cards
executionA

Benchmark Test Skill

/benchmark-test-skill

Run verify and benchmark tests for one agentic-skills skill, producing pass-rate, latency, cost, and consistency metrics

v0.3

Benchmark Test Skill

Run verify and benchmark tests for one agentic-skills skill, producing pass-rate, latency, cost, and consistency metrics

Platformclaude
Scopepack
Packagentic-skills-bench
Versionv0.3
agentic-skills-benchbenchmarkclaudeexecutionpackskill

Benchmark Test Skill

A
/benchmark-test-skill

Run verify and benchmark tests for one agentic-skills skill, producing pass-rate, latency, cost, and consistency metrics

Type
execution
Platform
claude
Scope
pack
Version
v0.3
Pack
agentic-skills-bench
agentic-skills-benchbenchmarkclaudeexecutionpackskilltest

Benchmark

AgentPass rateCost / run
claude100.0% (3/3 evaluated)$0.25
codex100.0% (3/3 evaluated)$0.25

Part of decks

Not part of any deck.