SkillCheck Methodology · One-Page Poster

Seven phases of rubric construction, from December 2025 to June 2026. Each phase added a defined set of categories; block width encodes how many. Every reference chip is clickable.

Phase I

Build to Spec

v1.0 – v1.4 · Dec 2025

10categories 1–10

R1 R2 R3

Phase II

From Practice

v3.10 – v3.14 · Mar 2026

08categories 11–18

Phase III

From Field

v3.16 – v3.17 · Apr 2026

03categories 19–21

19Design Pattern
20Trigger Collision
21Eval Kit

R5 R6

Phase IV

Ecosystem

v3.18.0 · Apr 2026

01cat. 22

22Knowledge Density

R7 R8 R9

Phase V

Marketplace

v3.20.0 · Apr 2026

03categories 23–25

23Agent Integration
24Mkt. Governance
25Memory Governance

R10 R11 R12

Phase VI

Agentic Safety

v3.21.0 · Jun 2026

01cat. 26

26OWASP Agentic Top 10

R13

Phase VII

Lived-In-Ness

v3.22.0 · Jun 2026

01cat. 27

27Repo Maturity

R14

Hasan et al.

97%

of 856 MCP tools across 103 servers contained at least one description smell.

arXiv:2602.14878 · Phase IV anchor

Wang et al.

72%vs.20%

Across 10,831 MCP servers, standard-compliant descriptions get picked by an agent 72% of the time. Non-compliant descriptions get picked 20% of the time.

arXiv:2602.18914 · Phase IV anchor

Scoring Math

100starting score

Critical −20

Warning −5

Suggestion −1

Strength +0