MBT-5 Geometry-Only Output Regulator

MBT-5 Geometry-Only Output Regulator

MBT-5 tests whether AI candidate outputs remain inside a supplied semantic and relational reference manifold.

It runs at inference time:

no training
no fine-tuning
no model-weight inspection
no hidden classifier

The regulator checks candidate outputs and either emits the safest supported candidate or blocks when every candidate is unsafe.

Core Claim

MBT-5 treats hallucination as semantic or relational drift from supplied reference structure. It is not a fact oracle and does not claim direct access to external truth.

Universe does facts.
Humans describe the universe.
AI describes human descriptions.
MBT-5 regulates AI descriptions against supplied human reference structure.

Supported public claim:

MBT-5 v11 blocked hallucinated AI outputs against supplied reference manifolds
and relation constraints. In the frozen EXP20 ledger, it achieved confusion
[[97, 0], [0, 160]] over 257 labelled candidates across 53 cases.

Claims and Scope

See CLAIMS.md for the claim register. The short version: MBT-5 regulates outputs against supplied references and the public offline corpus; it is not claimed to be a universal fact checker.

Continuous integration for the offline corpus lives in .github/workflows/tests.yml.

Current Locked Result

Frozen ledger:

MBT5_EXP20_combined_guarded_master_ledger_v11

Candidates: 257
Cases:      53

Candidate-level confusion:
[[TN, FP], [FN, TP]]
[[97, 0], [0, 160]]

Accuracy:   1.0000
Precision:  1.0000
Recall:     1.0000
F1:         1.0000

Case-level:
Correct: 53 / 53
Emitted: 28
Blocked: 25

The current public claim is limited to the supplied test suites and reference manifolds included in the project.

What MBT-5 Checks

MBT-5 combines:

semantic geometry
internal consistency scoring
token-level shock analysis
literal drift guards
entity, number, and unit protection
overclaim detection
copular relation checks
non-copular relation checks
relation polarity checks
unsupported negation clamps
abstention when every candidate is unsafe

Examples of blocked drift:

The capital of France is London.
The Sun is a planet.
Earth is flat.
Water boils at 90 degrees Celsius at sea level.
Gravity is fully solved by modern physics.
Scientific descriptions do not use measurements.
DNA contains the nucleus.
The Sun orbits Earth.

Core Mechanisms

Semantic Shock

Candidate outputs are embedded into semantic space. MBT-5 measures distance from the reference manifold:

shock = Gamma * ||candidate_embedding - reference_center||^2

Higher shock means stronger semantic drift.

Literal Drift Guards

Geometry alone can miss small but important substitutions. MBT-5 protects numbers, units, named entities, and key content tokens.

Relation Clamps

MBT-5 checks relation structure, not just semantic similarity.

Copular examples:

The Sun is a planet.
A dog is a bird.
Rome is the capital city of France.

Non-copular examples:

Earth orbits the Moon.
The Sun orbits Earth.
DNA contains the nucleus.
Heat produces friction.
Photosynthesis converts oxygen into carbon dioxide and water.

Negation Clamp

MBT-5 blocks unsupported negations of positive reference support.

Water is not liquid at room temperature.
Sound does not need a material medium to travel.
Scientific descriptions do not use measurements or predictions.
General relativity proves gravity has no connection to mass or energy.

Abstention

When every candidate is unsafe, MBT-5 blocks instead of emitting the least-bad candidate.

Installation

Core (offline, no embedding dependency):

pip install -e .

Embedding-enabled mode:

pip install -e .[embeddings]

The optional extra currently installs sentence-transformers>=2.6.0,<3 for model-backed operation.

If sentence-transformers is unavailable, use offline literal/relation-only regulation with --no-embeddings / use_embeddings=False:

from mbt_ai_tools import evaluate_candidate

evaluate_candidate("Paris is the capital city of France.", ["The capital of France is Paris."], use_embeddings=False)

When embedding-backed operation is requested without sentence-transformers, you'll now get a direct error directing to install the dependency or use offline mode.

Python Usage

from mbt_ai_tools import regulate_candidates

references = [
    "The capital of France is Paris.",
    "Paris is the capital city of France.",
]
candidates = [
    "The capital of France is London.",
    "The capital of France is Paris.",
]

result = regulate_candidates(candidates, references, use_embeddings=False)
print(result.action)        # emit
print(result.emitted_text)  # The capital of France is Paris.

CLI Usage

mbt-check \
  --reference "The capital of France is Paris." \
  --candidate "The capital of France is London." \
  --candidate "The capital of France is Paris." \
  --no-embeddings

Expected output:

EMIT | The capital of France is Paris. | score=0.0000
[0] blocked | ...
[1] safe | ...

JSON report output:

mbt-check \
  --reference "The capital of France is Paris." \
  --candidate "The capital of France is London." \
  --candidate "The capital of France is Paris." \
  --no-embeddings \
  --format json

See examples/cli_json_report.md for a complete offline JSON report demo.

Optional token-level shock details can be included in regulation reports when embedding dependencies are installed:

mbt-check \
  --reference "The capital of France is Paris." \
  --candidate "The capital of France is London." \
  --format json \
  --token-shock \
  --token-shock-top-k 5

Batch JSONL evaluation:

mbt-check --input-jsonl examples/batch_input.jsonl --no-embeddings --output batch-report.jsonl

CI guard mode:

mbt-check --input-jsonl examples/batch_input.jsonl --no-embeddings --summary --fail-on-block

--fail-on-block exits with status 2 when a single regulation run blocks or any batch row blocks. --summary appends a final batch summary JSON object.

Markdown audit report:

mbt-check --input-jsonl examples/batch_input.jsonl --no-embeddings --format markdown --output audit.md

See examples/markdown_audit_report.md for a complete Markdown audit demo.

The JSON report schema is documented in docs/report_schema.md.

Regression Corpus

The lightweight public regression corpus lives in examples/regression_corpus.jsonl. It currently contains 220 offline cases covering entity swaps, multi-word capital handling, all-bad abstention, numeric drift, unit drift, role swaps, shared-subject relation repair, unsupported negation, historical-date drift, supported paraphrase, and overclaim blocking.

Regenerate the corpus:

uv run python examples/build_regression_corpus.py

Run the tests:

uv run --with pytest python -m pytest -q

Experiment Lineage

The full EXP01-EXP20 record is in MBT5_EXP01_EXP20_TECHNICAL_LEDGER.md. The expanded CSV experiment exports live in data/csv_exports/.

Key frozen output artifacts:

data/csv_exports/mbt5_exp20_master_candidate_ledger.csv
data/csv_exports/mbt5_exp20_master_case_ledger.csv
data/csv_exports/mbt5_exp20_summary_metrics.csv
data/csv_exports/mbt5_exp20_case_summary.csv
data/csv_exports/mbt5_exp20_clamp_counts.csv
data/csv_exports/mbt5_exp20_failure_table.csv
data/csv_exports/mbt5_exp20_patch_lineage.csv

Project Layout

mbt_ai_tools/
  mbt/
    embeddings.py      SentenceTransformer loader
    geometry.py        geometric median, shock, distance
    stability.py       self-consistency / entropy scoring
    tokens.py          leave-one-out token shock
    consensus.py       multi-agent / council logic
    regulator.py       v11 candidate regulator
  cli.py               mbt-check command
.github/workflows/
  tests.yml            GitHub Actions offline regression test workflow
CHANGELOG.md          release notes
CLAIMS.md             scoped public claims register
data/csv_exports/     expanded EXP01-EXP20 CSV exports
docs/
  report_schema.md
examples/
  batch_input.jsonl
  build_regression_corpus.py
  cli_json_report.md
  markdown_audit_report.md
  regression_corpus.jsonl
tests/
  test_regulator.py
  test_regression_corpus.py
REPLICATION.md        local/GitHub/Colab replication instructions
pyproject.toml
README.md
LICENSE

License

See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MBT-5 Geometry-Only Output Regulator

Core Claim

Claims and Scope

Current Locked Result

What MBT-5 Checks

Core Mechanisms

Semantic Shock

Literal Drift Guards

Relation Clamps

Negation Clamp

Abstention

Installation

Python Usage

CLI Usage

Regression Corpus

Experiment Lineage

Project Layout

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/workflows		.github/workflows
data/csv_exports		data/csv_exports
docs		docs
examples		examples
mbt_ai_tools		mbt_ai_tools
tests		tests
.gitignore		.gitignore
Basic testing		Basic testing
CHANGELOG.md		CHANGELOG.md
CLAIMS.md		CLAIMS.md
Colabs demo regulator 1.0		Colabs demo regulator 1.0
Inference-time confidence hallucination detection using geometry and self-consistency		Inference-time confidence hallucination detection using geometry and self-consistency
LICENSE		LICENSE
MBT5_EXP01_EXP20_TECHNICAL_LEDGER.md		MBT5_EXP01_EXP20_TECHNICAL_LEDGER.md
README.md		README.md
REPLICATION.md		REPLICATION.md
pyproject.toml		pyproject.toml
testing .ipynb		testing .ipynb

Folders and files

Latest commit

History

Repository files navigation

MBT-5 Geometry-Only Output Regulator

Core Claim

Claims and Scope

Current Locked Result

What MBT-5 Checks

Core Mechanisms

Semantic Shock

Literal Drift Guards

Relation Clamps

Negation Clamp

Abstention

Installation

Python Usage

CLI Usage

Regression Corpus

Experiment Lineage

Project Layout

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages