B+

npm:mcp-eval-runner

https://www.npmjs.com/package/mcp-eval-runner
89/100 · MCP Trust Grade · checked 3h ago · MCP 1.0.0

What it offers — 3 tools · Developer Tools

list_cases

Enumerate all available fixtures with their step counts and assertion types.

regression_report

Compare current state to the last run and return what changed (regressions, fixes, new cases).

compare_results

Diff two named run results by run ID. Shows regressions, fixes, new and removed cases.

Spec / packaging20%100
Security (OWASP MCP)30%90
Maintenance / popularity20%92
Tool hygiene15%70
Transparency / provenance15%90

Findings

INFO Static analysis of npm package mcp-eval-runner@1.0.0 (stdio server — no remote endpoint). Reliability/behavioral signals require running it; not measured.
Grade another server

We re-grade npm:mcp-eval-runner on a schedule and alert your Slack/webhook the moment its tools change or its grade drops — rug-pull insurance for the connection.

Share this report card

A 1200×630 card with the grade + audit — drop it in a post, Slack, or your repo.

MCP Trust report card — npm:mcp-eval-runner grade B+
Share on X Open card image

Embed this grade

A live badge — it re-verifies itself and shows current stability. Static scorecards can't. Paste it in your README or site to show users you're independently audited.

MCP Trust Grade B+ · wmcp.sh
[![MCP Trust Grade B+](https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner/badge.svg)](https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner)
<a href="https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner"><img src="https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner/badge.svg" alt="MCP Trust Grade B+ · wmcp.sh"></a>

Agents: check this before connecting

Add the wmcp.sh trust oracle as an MCP server and call grade_mcp_server / check_mcp_drift in your agent's pre-connection gate:

https://wmcp.sh/mcp/trust
How this grade is computed. An open, independent rubric — Spec conformance (20%), Security mapped to the OWASP MCP Top 10 (30%), Reliability (20%), Tool hygiene (15%), Transparency (15%) — run by connecting to the server and inspecting its real MCP surface. The grade is free and identical whether or not the operator pays. v1 uses static + spec signals from a single connection; continuous uptime, real latency, and annotation-truthing (declared readOnly vs observed behavior) layer on via the wmcp.sh proxy.