A
Architecture-aware dispatch
- Detection
- Per-host detection of memory topology (unified / single-GPU / multi-GPU) and deployment shape (DMG / Container / HostNative). Pluggable
Architecture + Deployment protocols.
- Dispatch
- Tick-based parallel DAG dispatcher: each tick asks the detected arch which steps to run concurrently vs defer. Falls back to single-step-per-tick on workflows without
depends_on.
B
Per-step keep_alive + telemetry
- Resolution
- Four-tier resolver: step config > workflow default > arch-detected default >
OLLAMA_KEEP_ALIVE. Defaults are 30m on CPU (reload dominates), 0 on single-GPU NVIDIA (VRAM is scarce).
- Observability
- Ollama load / prompt-eval / eval durations captured per workflow step. Live snapshot at
/api/system/architecture.
C
Installable Python package
- Surface
- Wheel + sdist attached to every release. Console scripts
enclave, enclave-api, enclave-chat, enclave-workflow.
- Install
pip install <wheel-url> — no PyPI account required.
D
GHCR mirror, tarball, n8n
- Container
- Every Docker Hub push mirrored to
ghcr.io/hankthebldr/enclave. Linux source tarball with SHA-256 + SHA-512 sidecars on every stable release.
- Automation
- Importable n8n workflow under
workflows/n8n/ drives release prep through a local Ollama instance — no cloud LLM calls.