Wire.Band
Semantic Compression Middleware for Any Structured Data Domain
The Problem
Modern infrastructure is hemorrhaging bandwidth on redundant structured data. IoT fleets send thousands of near-identical sensor readings per hour. AI agent pipelines shuffle large payloads between services. DeFi protocols emit repetitive transaction events at millisecond intervals. In every case, the underlying semantic content is highly repetitive — the same field names, the same value ranges, the same patterns — yet generic tools like gzip and Protocol Buffers either fail on small payloads or require fragile schema negotiation. The result: over-provisioned bandwidth, bloated infrastructure costs, and AI communication that bottlenecks on serialization overhead.
The Solution
Wire.Band is **semantic compression middleware** that encodes structured data using a shared vocabulary of meaning rather than compressing bytes. It replaces verbose JSON field names and string values with compact 2-byte semantic symbols drawn from the Theta Language — a frozen ontology spanning 650 constants across 30 domain families. Sequential payloads are delta-filtered; known patterns hit a semantic cache. The result is up to 60x compression with sub-millisecond latency, zero schema negotiation, and 100% lossless round-trip fidelity.
Encode meaning, not bytes. 60x compression. 0.11ms latency. Zero schema negotiation. Transparent middleware — plain JSON in, plain JSON out.
Core Innovation
Wire.Band's differentiation is the **Theta Language Spec v1.2.0 (frozen)** — a pre-shared semantic ontology that eliminates the overhead of per-connection schema negotiation:
| Layer | Mechanism | Effect |
| Symbol encoding | 2-byte Theta symbol replaces field name + type string | 60–90% payload reduction per field |
| Delta filtering | Unchanged fields stripped across sequential readings | Near-zero cost for high-frequency sensor data |
| Semantic cache | SHA-256 fingerprint; known patterns skip re-encoding | Sub-millisecond cache-hit path |
| Frame encryption | AES-256-GCM / ChaCha20-Poly1305 + symbol permutation + HKDF per-frame keys | Zero-trust transport without key handshake overhead |
Market Opportunity
| Market Segment | Size (2026) | Growth Rate | Application |
| IoT Infrastructure | $18B | 22% CAGR | Edge bandwidth optimization, sensor fleet compression |
| AI/ML Infrastructure | $35B | 40% CAGR | Agent-to-agent communication, embedding pipelines |
| Industrial Automation | $22B | 15% CAGR | Factory floor data, predictive maintenance, supply chain |
| DeFi & Web3 Infrastructure | $12B | 30% CAGR | On-chain event compression, oracle data feeds |
| Total Addressable Market | $87B |
Business Model
Three independent revenue streams:
| Stream | Model | Customer | 2027 Target |
| Infrastructure Licensing | Per-node or per-GB | Enterprise IoT, AI platforms | $1.2M ARR |
| Semantic Intelligence | API subscription by domain family | Quant funds, research, government | $2.5M ARR |
| Silicon Roadmap | ISA licensing or chip sales | OEM device manufacturers | TBD |
The public edge clients (PyPI, crates.io) drive adoption; the private server is the monetized core. The intelligence product is architecturally separated from tenant data — it runs against public sources and explicit opt-ins only.
Technical Architecture
Three-package mono-repo:
| Package | Version | Distribution | Description |
| `wire-band` | 0.1.0 | Private | FastAPI server — semantic cache, multi-tenancy, MQTT bridge, SSE stream, crypto stack |
| `wire-band-edge` | 0.3.0 | Public (PyPI) | Python edge client — ring buffer, async flush, MQTT connector, zero private deps |
| `wireband-edge` | 0.4.0 | Public (crates.io) | Rust edge client — MQTT, BLE, CoAP, Modbus, serial, ONNX/TFLite inference, PyO3 bindings |
**Theta Language Spec v1.2.0 (frozen)** — ~650 symbol constants:
- >Primary families (1-byte): SYS, NOUS, ERGON, WORK, SWARM, IDENTITY, GOVERNANCE, AUTHORITY, THETA, HIVEMIND, INGEST, EMIT, TRANSFORM, ORACLE, KAPPA, PSI
- >Extended families (2-byte): AI, DEFI, LEGAL, GEOGRAPHIC, HARDWARE, NETWORK, SECURITY, STORAGE, METRICS, TEMPORAL, UI, PROTOCOL, EXT_THETA, PHRONESIS
**Binary frame format:** magic header `0xAE05`, 2-byte symbol, optional AEAD-encrypted payload, optional symbol permutation layer, HKDF-SHA256 per-frame key derivation.
Crypto stack (three stackable independent layers):
- >`SymbolRemapper` — Fisher-Yates keyed bijection of all 65,536 symbol values
- >`ThetaEnvelopeCipher` — AES-256-GCM or ChaCha20-Poly1305 AEAD; symbol bytes as AAD
- >`ContextualKeyDeriver` — HKDF-SHA256 per-frame key from (symbol, time-window); forward secrecy with no handshake
Technical Differentiation
| Capability | Wire.Band | gzip/zstd | Protocol Buffers | IoT protocols (CBOR/LwM2M) |
| Compression on single small payloads | 60x | Poor (< 1.2x) | 3–5x | 2–4x |
| Delta filtering across time | Yes | No | No | Partial |
| Schema negotiation required | No (pre-shared ontology) | No | Yes | Yes |
| Frame-level encryption | 3-layer AEAD | No | No | Optional |
| Rust embedded (no-std) | Yes | Platform-dependent | Generated code | Partial |
| Multi-protocol connectors | MQTT, BLE, CoAP, Modbus, serial | N/A | N/A | Protocol-specific |
| Semantic intelligence byproduct | Yes | No | No | No |
Traction & Milestones
- >Now: Server deployed on production VPS with Caddy reverse proxy; both public packages live (PyPI v0.3.0, crates.io v0.4.0)
- >Test coverage: 331 tests across unit (275), E2E (30), and stress (26) suites
- >Theta Spec: v1.2.0 frozen — backward compatibility guaranteed; 650 symbol constants stable
- >Partnership pipeline: Active conversations with IoT hardware vendors for embedded Rust integration
- >Q2 2026: First reference customer case study with quantified bandwidth metrics
- >Q3 2026: Semantic Intelligence product alpha — public data sources, domain-family API
- >Q4 2026: Enterprise licensing pipeline; OEM partner program launch
- >2027: Silicon roadmap — native Theta encoding ISA licensing to device OEMs
Competitive Advantages
1. **Frozen pre-shared ontology**: No schema negotiation, no versioning churn — the Theta Language Spec is a solved problem clients embed once
2. **Cross-time compression**: Delta filtering across sequential payloads captures redundancy that byte-pattern compressors fundamentally cannot
3. **Multi-protocol Rust client**: Single library covers MQTT, BLE, CoAP, Modbus, and serial — rare in embedded systems compression
4. **Intelligence product moat**: The semantic cache accumulates a cross-domain pattern dataset that cannot be retroactively constructed; first-mover accumulation is the moat
5. **Open edge / closed core**: Public edge clients drive organic developer adoption; private server and proprietary ontology protect the business
6. **Zero-trust frame encryption**: Three independent stackable crypto layers with no handshake overhead — purpose-built for constrained edge devices
Why Now
- >AI agent communication: As autonomous agents multiply, efficient structured messaging becomes infrastructure, not optimization
- >Edge compute maturity: Rust on embedded is production-ready; no-std Rust compression middleware is now viable at scale
- >Bandwidth cost sensitivity: Cellular and satellite IoT deployments at scale face genuine bandwidth economics pressure
- >Regulatory tailwinds: GDPR and EU data governance frameworks create demand for privacy-preserving intelligence products (semantic fingerprints, not raw content)
- >Ontology timing: The Theta Language was designed for correctness, not surveillance — this is the strongest legal and ethical foundation for a data product built on top of it
The Ask
Building the semantic layer for structured data infrastructure.
Wire.Band is positioned at the intersection of three large markets — IoT, AI/ML infrastructure, and data intelligence — with a technically defensible moat (frozen ontology, proprietary TTL policy, intelligence dataset accumulation) and a clear path from open-source edge adoption to enterprise licensing to a silicon play.
**Opportunity**: Capture the semantic compression layer before AI agent communication and IoT edge infrastructure consolidates around verbose, inefficient legacy formats.
**Rising Sun** · risingsun.name · March 2026