Every claim Z-15 makes is classed here. The machine-readable mirror is data/z15/claims.jsonl in the public repo. Contradict a row by filing a falsification (see truth engine).
| Claim | Status | Source | Channels |
|---|---|---|---|
| Z-15 has never been independently benchmarked. | public-safe | operator statement | site / github / x / hn / email / nda |
| Z-15 proof engine is under construction. | public-safe | git log on this repo | site / github / x / hn / email / nda |
| Benchmark and falsification are invited. | public-safe | /z15/truth-engine | site / github / x / hn / email / nda |
| 59.9% token reduction (micro-benchmark). | simulated | z15/memo/z15_results.md, March 2026 | nda |
| 2.49x RU-ROI (micro-benchmark). | simulated | z15/memo/z15_results.md, March 2026 | nda |
| 40-75% compute reduction at scale. | unproven | historical projection only | nda only when marked unproven |
| 38-42% runtime token reduction at scale. | unproven | historical projection only | nda only when marked unproven |
| Crossover near 1.9B parameters. | needs replication | extrapolation only | none until reproduced |
| Every Z-15 proof packet's BUNDLE_SHA256 verifies against the artefact set. | public-safe | z15_engine/trace.py; tests/test_trace_no_secret_leak.py | site / github / x / hn / email / nda |
| The Z-15 gap-finder requires two consecutive empty searches before declaring a branch closed. | public-safe | z15_engine/gap_finder.py; tests/test_gap_finder.py | site / github / x / hn / email / nda |
| All Z-15 model calls and GPU rentals default to dry-run; live mode requires explicit env flag plus in-session operator approval. | public-safe | z15_engine/model_router.py; ops/scripts/z15-gpu-budget-check.sh | site / github / x / hn / email / nda |
| Frontier-scale superiority. | do-not-use | no measurement exists | forbidden everywhere |
public-safe may appear on the site, GitHub, X, HN, email, NDA.
simulated exists as a measurement on toy/synthetic data; not asserted as frontier proof.
unproven historical projection or extrapolation; not a measurement.
needs replication requires an independent re-run before publication.
do-not-use never appears in public artefacts.