Release
Fail-closed baseline pairing with lower-memory retries
CI/release baseline pairing is fail-closed (pairing evidence is required), and adapters reduce peak memory during retries via chunked snapshot/restore.
Release: InvarLock 0.3.4 - Required pairing evidence and chunked adapter snapshots
Highlights
- CI/Release baseline pairing now fails closed with strict parity checks.
- Report generation requires paired-windows evidence (non-perfect pairing is rejected).
- Chunked snapshot/restore and proof-pack workflow helpers reduce retry and long-run friction.
This release makes baseline pairing a checked contract instead of a convention. If a CI/release run says it is paired to a baseline, it now has to carry valid evaluation_windows evidence, and the system enforces dataset, tokenizer, and masking parity instead of letting ambiguous comparisons slip through.
Report generation is equally strict: paired_windows evidence is required, and non-perfect pairing is rejected. That gives reviewers a clearer answer to the most important question: when a report says it is paired, it really is paired.
On the practical side, chunked HF adapter snapshot/restore helps reduce peak memory during retries, and the new proof-pack workflow helpers (run_suite, scheduler/queue utilities, and model creation tooling) make longer evidence runs easier to operate without weakening the pairing rules.
For the immutable release record, read the tagged CHANGELOG.md for v0.3.4.
More from the blog
Continue through recent releases and implementation notes.
Synthesis
The Minimum Evidence Surface for Trustworthy Weight-Edit Results
A trustworthy weight-edit result needs more than a benchmark delta. It needs a bounded claim, an exactly paired comparison, and verification that rejects incomplete evidence.
Release
Evidence packs and explicit runtime provenance
InvarLock 0.8.0 moves the public bundle surface to evidence packs, pins docs to versioned release paths, and makes container-vs-host runtime provenance explicit across evaluate and verify.
Research Note
Fail-Closed Verification for Weight-Edit Evaluation
A verifier is only useful if it rejects incomplete evidence. InvarLock's verification path is designed to stop stronger claims when the evidence bundle is missing or inconsistent.