When was npm:mcp-eval-runner last audited?

Last checked 2026-06-03; wmcp.sh re-checks it on a schedule for tool drift and rug-pulls.

B+

npm:mcp-eval-runner

Item: npm:mcp-eval-runner
Rating: 89
Author: wmcp.sh

https://www.npmjs.com/package/mcp-eval-runner

89/100 · MCP Trust Grade · checked 1mo ago · MCP 1.0.0

Is the npm:mcp-eval-runner MCP server safe to use?

Independent trust grade B+ (89/100). Static analysis of npm package mcp-eval-runner@1.0.0 (stdio server — no remote endpoint). Reliability/behavioral signals require running it; not measured. wmcp.sh continuously watches npm:mcp-eval-runner for tool drift and rug-pulls. The grade is free and identical whether or not the operator pays.

What it offers — 3 tools · Developer Tools

list_cases

Enumerate all available fixtures with their step counts and assertion types.

regression_report

Compare current state to the last run and return what changed (regressions, fixes, new cases).

compare_results

Diff two named run results by run ID. Shows regressions, fixes, new and removed cases.

Spec / packaging20%100

✓ depends on an MCP SDK
✓ declares a bin entry (runnable server)
✓ 3 tool(s) detected in source

Security (OWASP MCP)30%90

no high-risk patterns in sampled source
scanned 6 source files

Maintenance / popularity20%92

published within 90 days
1 published version

Tool hygiene15%70

no TypeScript types
4 runtime deps

Transparency / provenance15%90

✓ public repository linked
✓ MIT license
12 weekly downloads

Findings

INFO Static analysis of npm package mcp-eval-runner@1.0.0 (stdio server — no remote endpoint). Reliability/behavioral signals require running it; not measured.

Grade another server

We re-grade npm:mcp-eval-runner on a schedule and alert your Slack/webhook the moment its tools change or its grade drops — rug-pull insurance for the connection.

Share this report card

A 1200×630 card with the grade + audit — drop it in a post, Slack, or your repo.

MCP Trust report card — npm:mcp-eval-runner grade B+

Share on X Open card image

https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner

Embed this grade

A live badge — it re-verifies itself and shows current stability. Static scorecards can't. Paste it in your README or site to show users you're independently audited.

Markdown (for your README)

[![MCP Trust Grade B+](https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner/badge.svg)](https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner)

HTML

<a href="https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner"><img src="https://wmcp.sh/mcp/grade/npm%3Amcp-eval-runner/badge.svg" alt="MCP Trust Grade B+ · wmcp.sh"></a>

Agents: check this before connecting

Add the wmcp.sh trust oracle as an MCP server and call grade_mcp_server / check_mcp_drift in your agent's pre-connection gate:

https://wmcp.sh/mcp/trust

How this grade is computed. An open, independent rubric — Spec conformance (20%), Security mapped to the OWASP MCP Top 10 (30%), Reliability (20%), Tool hygiene (15%), Transparency (15%) — run by connecting to the server and inspecting its real MCP surface. The grade is free and identical whether or not the operator pays. v1 uses static + spec signals from a single connection; continuous uptime, real latency, and annotation-truthing (declared readOnly vs observed behavior) layer on via the wmcp.sh proxy.