Skip to content

QuantumDrizzy/Blaze

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Blaze — Tensor-Network compression of high-order data

Lossy TT/MPS compression + approximate reconstruction for data with genuine tensor-train structure — primarily quantum states (Cirq) and TT-native scientific tensors — with honest compressibility diagnostics and a load-bearing quantum layer (the MPS ↔ quantum-circuit correspondence).

Honest scope (the Phase-1 fair benchmark settled this): TT only beats a fair matrix SVD on genuinely TT-native data — low rank across every cut, structure spread over modes (a low-bond MPS; a low-entanglement quantum state, guaranteed by physics). On generic "structured" data (smooth fields, hyperspectral-like volumes) a well-chosen single matrix SVD competes at equal parameters — "high-order" or "smooth" is not enough; generic high-entropy data doesn't compress at all ([KNOWN_LIMIT]). Blaze is not a general compressor. Its real, usable value: (1) quantum-state compression + fidelity/sampling, (2) honest diagnostics that tell you whether TT fits your data, (3) fast TT where it does, (4) operations in the compressed representation — inner product, fidelity, distance and exact similarity search (TTIndex) computed on the TT directly, without decompressing (Phase 7, O(nχ³) vs O(2ⁿ); see docs/PHASE7-results.md), (5) a second, composable quantization stage — int8/4-bit core codes on top of the TT rank truncation, error measured and composed (Phase 8: TFIM paramagnet 77× → 447× at fidelity 0.9999; see docs/PHASE8-results.md). Lossy; compared against the best matrix SVD at matched params, never against lossless Zstd.

Architecture, build sequence, and the honesty contract: docs/ADR-0001.

Stack (each language where it fits)

Rust ingestion · CLI · persistence · C++/CUDA TT kernels (hot core) · Python TT algorithms + benchmarks · Cirq MPS↔circuit bridge

Status — 8 phases complete (measured on RTX 5060 Ti, sm_120)

Every number below is from a reproducible gate in tests/ and written up in docs/. Honest baselines, documented [KNOWN_LIMIT]s, no overclaim — the contract of docs/ADR-0001 and docs/ADR-0002.

# Phase Headline result (measured) Detail
1 Honest classical benchmark TT beats a fair matrix SVD only on TT-native data; generic "structured"/high-entropy data does not compress ([KNOWN_LIMIT], documented not hidden) PHASE2 §0
2 Rust core blaze-core 1:1 parity with the Python reference (CPU, faer/nalgebra); same ranks, rel_error within fp lib.rs
3 CUDA SVD offload (cuSOLVER) ~2.9× (f64) / ~3.8× (c64) above the n≈24 crossover; below it the CPU wins (transfer/launch-bound — stated, not hidden) PHASE3
4 Approximate reconstruction monotone error↔rank dial; .blz roundtrip 75× on a TFIM paramagnet at fidelity 1.000000000000 PHASE4
5 MPS → circuit synthesis sequential state-prep (Schön 2005) reproduces the MPS at fidelity 1.0 (GHZ, product, physical TFIM); ancilla disentangles PHASE5
6 Rust CLI + .blz persistence blaze compress / reconstruct end-to-end, c64 verified on disk blz-format
7 Ops in compressed space inner/fidelity/distance/TTIndex via the MPS zipper, exact to 1e-15, O(nχ³); runs at n=40 (dense 2⁴⁰ = 17.6 TB, impossible) in 3.6 ms; TFIM search recovers the quantum phase transition PHASE7
8 Core quantization int8/4-bit codes on top of TT, error composed and measured; TFIM paramagnet 76.6× → 447× at fidelity 0.99994 (int8), up to 705× at 4-bit PHASE8

Flagship validation — real physics (SUBSTRATE). TFIM ground states (n=16) compress 19× (critical) → 77× (paramagnet) losslessly to ~1e-7, with four independent physics cross-checks (Page value, entropy monotonicity, TT rank = Schmidt rank, honest decline). The Haar-random control does not compress (0.38× — TT larger than dense); Blaze reports this instead of hiding it.

Benchmarks, visualized

Every figure is regenerated live from real runs — python docs/plots.py (no hardcoded numbers: TFIM states via quimb, compressed / quantized / overlapped by blaze itself), on the RTX 5060 Ti (sm_120).

Ground-state fidelity reveals the quantum phase transition — each overlap computed in compressed space (Phase 7), never decompressing a 2¹⁶ statevector. The ordered phase (h<1) is nearly orthogonal to the paramagnet (h>1); the sharp drop is the QPT:

TFIM ground-state fidelity matrix — the quantum phase transition

Compression tracks physical entanglement. Area-law TFIM ground states compress 19–77×; the Haar volume-law control falls below break-even (0.38×) and Blaze declines instead of hiding it:

compression ratio vs entanglement entropy

The lossy dial is monotone (Phase 4, left) and the second quantization stage buys ratio for a measured fidelity cost (Phase 8, right):

error vs rank quantization tradeoff

Overlaps run on the TT directlyO(nχ³), not O(2ⁿ). The zipper still answers at n=40, where a dense statevector (17.6 TB) cannot even exist (Phase 7):

overlap scaling: zipper vs dense

Quickstart

pip install -e '.[quantum]'                              # numpy/scipy + cirq/quimb
pytest -q                                                # 33 gates

python -m blaze.examples.substrate_quantum_state         # flagship: 19–77× on TFIM
python -m blaze.examples.quantum_similarity_search       # Phase 7: search in compressed space
python -m blaze.examples.quantize_sweep                  # Phase 8: TT × quantization, with fidelity
import numpy as np, blaze
psi = np.zeros(2**12, dtype=complex); psi[0] = psi[-1] = 2**-0.5   # a GHZ state
tt  = blaze.compress(psi.reshape((2,)*12), rel_tol=1e-10)          # TT/MPS
q   = blaze.quantize_tt(tt, bits=8)                                # second-stage codes
print(blaze.fidelity(tt, q.dequantize()))                         # ~1.0, no decompression

Stack status

Python reference — all 8 phases · Rust blaze-core — Phases 2–6 (parity-gated) · C++/CUDA — Phase 3 SVD offload (cuSOLVER, f64+c64) · Cirq — Phase 5 bridge. Phase 7–8 are the NumPy reference; their Rust port + CUDA batched overlap are the documented next steps (ADR-0002).

About

Tensor-Train / MPS compression for high-order scientific & quantum-state data — a specialist (not general) compressor with an honest compressibility diagnostic, GPU SVD, and an MPS↔circuit bridge.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors