Back to blog

Release

More reliable proof packs and calmer long runs

Ink/charcoal doodle: a proof-pack folder carries a stable test checklist.

Proof packs gain a deterministic bash test suite and better runtime helpers, window selection becomes stable/offline, and perplexity runs get safer around bad token IDs.

1 min read
InvarLock Team

Release: InvarLock 0.3.5 — Deterministic windows, safer perplexity evaluation

Highlights

  • Proof pack bash test suite + runtime helpers for capturing artifacts during long runs.
  • WikiText-2 window stratification switches to a deterministic offline byte-level n-gram scorer.
  • Perplexity sanitizes out-of-range token IDs instead of crashing device-side.

0.3.5 is a “trust the machinery” release. Proof packs now have their own bash test suite with deterministic mocks, which is exactly the kind of unglamorous work that prevents subtle breakage later. Runtime helpers also make it easier to capture the right artifacts during long runs without improvising ad-hoc scripts.

On evaluation stability: window stratification for WikiText-2 moves to a deterministic offline scorer. That keeps window selection consistent across model families and avoids implicit downloads—two things that matter a lot when you’re trying to make runs comparable.

And for anyone who’s had a run explode due to weird token IDs: perplexity evaluation is now more defensive, masking out-of-range IDs rather than triggering asserts.

For more details, see CHANGELOG.md.

More from the blog

Continue through recent releases and implementation notes.