Release
More reliable proof packs and calmer long runs
Proof packs gain a deterministic bash test suite and better runtime helpers, window selection becomes stable/offline, and perplexity runs get safer around bad token IDs.
Release: InvarLock 0.3.5 — Deterministic windows, safer perplexity evaluation
Highlights
- Proof pack bash test suite + runtime helpers for capturing artifacts during long runs.
- WikiText-2 window stratification switches to a deterministic offline byte-level n-gram scorer.
- Perplexity sanitizes out-of-range token IDs instead of crashing device-side.
0.3.5 is a “trust the machinery” release. Proof packs now have their own bash test suite with deterministic mocks, which is exactly the kind of unglamorous work that prevents subtle breakage later. Runtime helpers also make it easier to capture the right artifacts during long runs without improvising ad-hoc scripts.
On evaluation stability: window stratification for WikiText-2 moves to a deterministic offline scorer. That keeps window selection consistent across model families and avoids implicit downloads—two things that matter a lot when you’re trying to make runs comparable.
And for anyone who’s had a run explode due to weird token IDs: perplexity evaluation is now more defensive, masking out-of-range IDs rather than triggering asserts.
For more details, see CHANGELOG.md.
More from the blog
Continue through recent releases and implementation notes.
Release
Stable public contracts with stricter fail-closed verification
InvarLock 0.4.0 stabilizes contracts around policies, proof packs, and evaluation provenance while tightening verification, CI, and coverage enforcement.
Release
Coverage hardening across CLI, reporting, and observability paths
Coverage thresholds now enforce split-module branch floors for critical CLI/reporting paths.
Release
Targeted regression hardening for quantization and reporting paths
A focused hardening release: safer AWQ plugin discovery, stronger quantization clipping behavior, and broader report-schema acceptance for edge payloads.