Audience: Security architects, procurement technical reviewers, platform engineers evaluating deployable inference artifacts.
Length: This document is intentionally short; it states what the product is, what it guarantees, what it does not promise, and the design stance that errors must be visible—not smoothed over.
COELANOX is an inference packaging and runtime system that turns supported source models into a single .cnox binary container: serialized graph (Universal IR), weights (optionally compressed), optional ahead-of-time code (CLF blobs / execution plans), and a SHA-256 integrity hash over the artifact. A single coelanox CLI supports package, verify (integrity without executing), and run (verify-then-execute by default).
Execution is not defined by ONNX at runtime. Universal IR plus a versioned IR semantics contract is the behavioral contract; ONNX is one import path. See RUNTIME_SPECIFICATION.md and IR semantics.
| Guarantee | Meaning |
|---|---|
| Artifact integrity | A SHA-256 hash covers the container content (with the hash field zeroed for calculation). coelanox verify and coelanox run (unless --no-verify) detect tampering or corruption. |
| Semantics versioning | Containers carry ir_semantics_version. Loaders reject unknown or incompatible versions with an explicit error—no silent execution across incompatible toolchains. |
| Explicit failure on unknown graph ops at package time | If translation leaves Custom("OnnxOpName") nodes, coelanox package fails until the graph is expressible in IR or the translator is extended. The product does not “best effort” ship undefined ops. |
| Configurable bounds | Runtime configuration can cap container size, input size, workspace, execution time, and path policy (see Operations). |
| Optional provenance signing | Ed25519 signing at package time; verification with a trusted public key; optional --require-signature on consume paths. |
These are software guarantees around integrity, versioning, and policy—not a certification of model correctness or fairness.
| Non-goal | Detail |
|---|---|
| Training | COELANOX is inference-only. It does not train or fine-tune models. |
| Full ONNX parity | Only what the in-tree translator lowers to Universal IR is supported. Unsupported ONNX ops become Custom and block packaging until addressed. |
| Bit-identical outputs across all backends | Scalar, SIMD/CLF plan, and future paths may differ within floating-point tolerance. See RUNTIME_SPECIFICATION.md. |
| Generic quantized production path | Quantized ONNX paths are not a first-class, documented guarantee today. See KNOWN_LIMITATIONS.md. |
| Compliance program | COELANOX strengthens artifact integrity and technical evidence (audit logs, fingerprints); it is not a substitute for organizational compliance (e.g. EU AI Act) or formal certification. |
| Managed cloud / HTTP API | The shipped coelanox serve mode is stdin/stdout framed IPC for one loaded container—not a full HTTP/gRPC product. See Operations. |
| Source | Notes |
|---|---|
| ONNX | Opset 13 translator in-tree. Coverage is not all of ONNX; see ONNX_SUPPORTED_OPS.md and the full decomposition tree. |
| BERT bundle | Custom bundle format (e.g. bert_base_uncased style) via translator plugin. |
| ResNet-tiny-MNIST demo | Demo bundle for CV path; not a general PyTorch/TensorFlow importer. |
Roadmap (not commitments): broader framework import, more ops, vendor backends—see Reference and project roadmaps.
| Aspect | Today |
|---|---|
| Primary execution | CPU. Scalar fallback is always available and runs wherever the runtime binary runs. |
| SIMD / CLF | x86_64 CLF blobs and SIMD execution paths are the expected fast path when packaged and discovered; discovery uses ~/.coelanox/clf/ layout by default (see Operations). |
| Portability | --fallback-only packages contain no embedded native code; execution uses the scalar path for maximum portability at the cost of performance. |
| Other ISAs | Not a guaranteed parity target in this brief; treat as project-specific until documented per release. |
COELANOX is intentionally strict at boundaries:
Custom ops → package fails until fixed or translated.The goal is observable failure and auditability, not hiding errors behind ambiguous behaviour.