28 CI-safe eval cases.
Public Trust Benchmark
Recommendation eval top-3 hit rate is 1; explanation checks are 12/12; data trust is 81/100 with 12 review queue items.
28 CI-safe eval cases.
Shortlist health for recommendation and search flows.
12/12 explanation checks passed.
high current corpus risk.
Benchmark
Eval cases match expected category classifications.
Benchmark
Eval cases match expected deployment classifications.
Benchmark
Eval cases match expected Cloudflare readiness.
Benchmark
Quality release gate based on warnings and errors.
Benchmark
Tracked taxonomy coverage in the loaded corpus.
Benchmark
Projects needing classification, collection, or signal review.
Known Limitations
Review Focus
Top Review Queue Items