Evidence pack showcase coverage and reproducible CI
Evidence packs add new showcase and evidence artifacts, while CI and release flows become more deterministic and easier to validate repeatedly.
Release: InvarLock 0.3.10 - Guard showcases, VE sidecars, and deterministic verification
Highlights
- Evidence packs add a guard showcase suite, expanded scenario filtering/overrides, and new evidence artifacts like verdict table generation plus a VE probe sidecar.
- CI and release verification are tighter and more reproducible, with deterministic
verify-full, Python 3.12 smoke coverage, and scheduled weekly verification. - Mixture-of-Experts and variance/VE paths are hardened for broader model compatibility, with additional reliability fixes across evidence-pack packaging and evaluation handling.
0.3.10 grows the practical surface area of evidence-pack validation while tightening failure handling and evidence quality. The new guard showcase suite, scenario filtering/errors-only mode, suite-scoped scenarios, and model override support make it easier to run targeted evidence checks without turning every evidence-pack exercise into a full sweep.
The evidence surface gets richer too: verdict table generation, a VE ve_probe sidecar, intervention summaries, and additional RMT/spectral/variance showcase injections give reviewers more concrete artifacts to inspect. At the same time, verdict/task failures fail closed more consistently.
CI and release verification become more reproducible through deterministic verify-full, Python 3.12 smoke coverage, scheduled weekly verification, on-demand docs CI, and dependency preflight hardening for net-enabled installs. The model path broadens through Mixture-of-Experts compatibility work, including fused expert weight layouts, broader VE layer discovery, Mixtral block_sparse_moe support, and safer behavior when reports exist but evaluation exits nonzero.
For the immutable release record, read the tagged CHANGELOG.md for v0.3.10.
More in Release
Continue through nearby posts in the same reading thread.
Release
Quantization, spectral, and report-schema hardening
A focused hardening release: safer AWQ plugin discovery, stronger quantization clipping behavior, and broader report-schema acceptance for edge payloads.
Release
Report rename cleanup and offline evidence-pack hardening
A stability-focused release: cleaner report output, safer offline evidence-pack flows, and CI/test hardening after the report rename.
Release
Coverage floors and fail-closed CLI/reporting paths
Split-module coverage thresholds now protect critical CLI/reporting paths while config, plugin, report, overhead, and observability edge cases fail closed more reliably.