Context & Compression
If you arrived here from an older link, here’s what’s true now:
- Every Ishvana agent feature runs on local handlers with deterministic inputs and outputs. There’s no token budget to compress around because there’s no token billing.
- Long-running analyses (manuscript scans, full-project consistency checks, Lore ML training) handle scale through streaming progress chunks, not through context pruning. See Etherforce overview → Streaming results.
- Per-content-hash caching means re-running the same analysis on unchanged content is free, regardless of size. See Etherforce overview → Handler-result cache.
Etherforce overview The dispatch layer that replaced the external-LLM routing layer.
Engine overview How the Divinity Engine fits with the rest of the backend.
This page stays here as a redirect. The conceptual content from the original Context & Compression page has been retired because the system it described is no longer in the product.