RX Data Store

RX is an embedded data store for JSON-shaped data. Encode once, then query the encoded document in place — no parsing, no object graph, no GC pressure. Think of it as no-SQL SQLite: unstructured data with database-style random access.

[ { "color": "red", "fruits": [ "apple", "cherry" ] },
  { "color": "yellow", "fruits": [ "apple", "banana" ] } ]

When encoding as RX, pointers deduplicate automatically: ^z reuses "apple", ^h reuses the shared key layout. The encoded form is queryable as-is — no parsing step, just direct reads from the buffer.

banana,6apple,5;ffruits,6yellow,6color,5:Echerry,6^z;ared,3^h:j;_

Benchmarked on a real 92 MB deployment manifest with 35,000 route keys:

	JSON	RX
Size	92 MB	5.1 MB
Look up one route	69 ms (full parse)	0.003 ms (~16 index hops)
Heap allocations	2,598,384	~10

When to use RX

RX sits in a specific gap: your data is too large for JSON's parse-everything model, but too unstructured for SQLite or Protobuf.

Good fits:

Build manifests, route tables, deployment artifacts — written once, read sparsely
Embedded datasets in browsers, edge runtimes, or worker processes
Any workflow where full-document parsing is the bottleneck

Bad fits:

Small documents where JSON parsing is already cheap
Human-authored config files
Write-heavy or mutable data (use a real database)
Minimizing compressed transfer size (gzip/zstd will beat RX)
Data that maps cleanly to tables (use SQLite) or a fixed schema (use Protobuf)

Typical workflow

A build or deploy step produces a large JSON-shaped artifact.
Encode it to RX once.
Runtimes read only the values they need — O(1) array access, O(log n) object key lookup.
When debugging at 3 AM, copy-paste the RX text into rx.run to inspect it. No binary tooling needed.

Install

npm install @creationix/rx     # library
npm install -g @creationix/rx  # CLI (global)
npx @creationix/rx data.rx     # CLI (one-off)

Quick start

Encode

import { stringify } from "@creationix/rx";

const rx = stringify({ users: ["alice", "bob"], version: 3 });
// Returns a string — store it anywhere you'd store JSON text

Decode

import { parse } from "@creationix/rx";

const data = parse(rx) as any;
data.users[0]         // "alice"  — no parse, direct buffer read
data.version          // 3
Object.keys(data)     // ["users", "version"]
JSON.stringify(data)  // works — full JS interop

The returned value is a read-only Proxy. It supports property access, Object.keys(), Object.entries(), for...of, for...in, Array.isArray(), .map(), .filter(), .find(), .reduce(), spread, destructuring, and JSON.stringify(). Existing read paths usually work unchanged.

Uint8Array API

For performance-critical paths, skip the string conversion:

import { encode, open } from "@creationix/rx";

const buf = encode({ path: "/api/users", status: 200 });
const data = open(buf) as any;
data.path    // "/api/users"
data.status  // 200

stringify/parse work with strings. encode/open work with Uint8Array. Same options, same Proxy behavior.

A more realistic example

The quick start above is tiny — JSON would be fine for it. RX pays off on larger data with sparse reads. Here's a site manifest (see samples/ for full files):

// site-manifest.json — 15 routes, repeated structure, shared prefixes
{
  "routes": {
    "/": { "title": "Home", "component": "LandingPage", "auth": false },
    "/docs": { "title": "Documentation", "component": "DocsIndex", "auth": false },
    "/docs/getting-started": { "title": "Getting Started", "component": "DocsPage", "auth": false },
    "/dashboard": { "title": "Dashboard", "component": "Dashboard", "auth": true },
    "/dashboard/projects": { "title": "Projects", "component": "ProjectList", "auth": true },
    // ... 10 more routes
  }
}

import { readFileSync } from "fs";
import { parse } from "@creationix/rx";

// The RX file is already smaller on disk (shared schemas, deduplicated
// component names, chain-compressed "/docs/..." and "/dashboard/..." prefixes).
// But the real win is at read time:

const manifest = parse(readFileSync("site-manifest.rx", "utf-8")) as any;
const route = manifest.routes["/dashboard/projects"];
route.title      // "Projects"
route.component  // "ProjectList"
route.auth       // true
// Only these three values were decoded. Everything else was skipped.

Scale this to 35,000 routes and the difference is 69 ms vs 0.003 ms per lookup.

The samples/ directory has four datasets showing different access patterns — route manifests, RPG game state, emoji metadata, and sensor telemetry. Start with site-manifest and quest-log if you're evaluating the format.

Encoding options

stringify(data, {
  // Add sorted indexes to containers with >= N entries (enables O(log n) lookup)
  indexes: 10,       // default threshold; use 0 for all, false to disable

  // External refs — shared dictionary of known values
  refs: { R: ["/api/users", "/api/teams"] },

  // Streaming — receive chunks as they're produced
  onChunk: (chunk, offset) => process.stdout.write(chunk),
});

If the encoder used external refs, pass the same dictionary to the decoder:

const data = parse(payload, { refs: { R: ["/api/users", "/api/teams"] } });

CLI

rx data.rx                         # pretty-print as tree
rx data.rx -j                      # convert to JSON
rx data.json -r                    # convert to RX
cat data.rx | rx                   # read from stdin (auto-detect)
rx data.rx -s key 0 sub            # select a sub-value
rx data.rx -o out.json             # write to file
rx data.rx --ast                   # output encoding structure as JSON

Tip: Add a shell function for quick paged, colorized viewing:
p() { rx "$1" -t -c | less -RFX; }

Full CLI reference

Flag	Description
`<file>`	Input file (format auto-detected by contents)
`-`	Read from stdin explicitly
`-j`, `--json`	Output as JSON
`-r`, `--rexc`	Output as RX
`-t`, `--tree`	Output as tree (default on TTY)
`-a`, `--ast`	Output encoding structure
`-s`, `--select <seg>...`	Select a sub-value
`-w`, `--write`	Write converted file (`.json`↔`.rx`)
`-o`, `--out <path>`	Write to file instead of stdout
`-c`, `--color` / `--no-color`	Force or disable ANSI color
`--index-threshold <n>`	Index containers above n values (default: 16)
`--string-chain-threshold <n>`	Split strings longer than n into chains (default: 64)
`--string-chain-delimiter <s>`	Delimiter for string chains (default: `/`)
`--key-complexity-threshold <n>`	Max object complexity for dedupe keys (default: 100)

Shell completions:

rx --completions setup [zsh|bash]

Format

RX is a text encoding — not human-readable like JSON, but safe to copy-paste, embed in strings, and move through tools that choke on binary.

Every value is read right-to-left. The parser scans left past base64 digits to find a tag character, then uses the tag to interpret any body bytes further left:

[body][tag][b64 varint]
            ◄── read this way ──

Railroad diagram coming soon — see format spec for all diagrams.

JSON	RX	What you're reading
`42`	`+1k`	tag `+` (integer), b64 `1k` = 84, zigzag → 42
`"hi"`	`hi,2`	tag `,` (string), b64 `2` = byte length, body `hi` to the left
`true`	`'t`	tag `'` (ref), name `t` → built-in literal
`[1,2,3]`	`+6+4+2;6`	tag `;` (array), b64 `6` = content size, three children to the left
`{"a":1,"b":2}`	`+4b,1+2a,1:a`	tag `:` (object), b64 `a` = content size, interleaved keys/values

Tags: + integer · * decimal · , string · ' ref/literal · : object · ; array · ^ pointer · . chain · # index

The encoder automatically deduplicates values, shares object schemas, compresses shared string prefixes, and adds sorted indexes. See the format spec for the full grammar, railroad diagrams, and a walkthrough of how a complete object is encoded byte by byte.

To inspect real data, paste RX or JSON into the live viewer at rx.run.

Inspect API

inspect() returns a lazy AST that maps 1:1 to the byte encoding — pointers stay as pointers, chains as chains, indexes as indexes:

import { encode, inspect } from "@creationix/rx";

const buf = encode({ name: "alice", scores: [10, 20, 30] });
const root = inspect(buf);

root.tag          // ":"
root[0].tag       // "," (a string key)
root[0].value     // "name"
root.length       // 4 (key, value, key, value)

for (const child of root) {
  console.log(child.tag, child.b64);
}

Each node exposes: tag, b64, left, right, size, data, and value (lazy). Nodes with children (:, ;, ., *, #) are iterable and support indexed access. Children are parsed lazily and cached.

Semantic helpers for object nodes:

for (const [key, val] of root.entries()) { ... }
for (const [key, val] of root.filteredKeys("/api/")) { ... }  // O(log n + m) on indexed objects
const node = root.index("name");   // key lookup
const elem = root.index(2);        // array index

Low-level cursor API

For zero-allocation traversal without the Proxy layer, see docs/cursor-api.md.

Proxy behavior

The value returned by parse/open is read-only:

obj.newKey = 1;      // throws TypeError
delete obj.key;      // throws TypeError
"key" in obj;        // works (zero-alloc key search)
obj.nested === obj.nested  // true (container Proxies are memoized)

Escape hatch to the underlying buffer:

import { handle } from "@creationix/rx";
const h = handle(obj.nested);
// h.data: Uint8Array, h.right: byte offset

More

docs/rx-format.md — format spec, grammar, and railroad diagrams
docs/cursor-api.md — low-level zero-allocation cursor API
rx-perf.md — cursor internals, Proxy design, allocation profile
samples/ — example datasets with JSON/RX pairs
rx.run — live web viewer

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
samples		samples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
b64.test.ts		b64.test.ts
bench.ts		bench.ts
package.json		package.json
rexc-viewer-screenshot.png		rexc-viewer-screenshot.png
rx-cli.ts		rx-cli.ts
rx-perf.md		rx-perf.md
rx.test.ts		rx.test.ts
rx.ts		rx.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RX Data Store

When to use RX

Typical workflow

Install

Quick start

Encode

Decode

Uint8Array API

A more realistic example

Encoding options

CLI

Format

Inspect API

Low-level cursor API

Proxy behavior

More

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RX Data Store

When to use RX

Typical workflow

Install

Quick start

Encode

Decode

Uint8Array API

A more realistic example

Encoding options

CLI

Format

Inspect API

Low-level cursor API

Proxy behavior

More

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages