Black-box AI reliability certification via self-consistency sampling and conformal calibration
-
Updated
Mar 28, 2026 - Python
Black-box AI reliability certification via self-consistency sampling and conformal calibration
Production RAG evaluation pipeline: RAGAS (faithfulness · context recall · answer relevancy) + DeepEval (hallucination scoring). Lambda-triggered on KB updates. Regression gating blocks deployment at >5% metric drop.
Compare Linux hosts against service profiles and produce machine-readable fit decisions
Add a description, image, and links to the deployment-gating topic page so that developers can more easily learn about it.
To associate your repository with the deployment-gating topic, visit your repo's landing page and select "manage topics."